Commit graph

729 commits

Author SHA1 Message Date
Oliver
4bdb8c7d6b Add note on code execution to CodeIODataset 2025-02-25 22:39:06 +00:00
Oliver
ef2f8d1978 Move data file & load into memory on first object creation 2025-02-25 22:36:38 +00:00
Oliver
f895a458c7 Register CodeIODataset 2025-02-24 18:28:35 +00:00
Oliver
efbcfb6eed Initial scoring algo for codeio 2025-02-24 18:27:53 +00:00
Oliver
5a222a398b Add tiny sample dataset & efficient sampling 2025-02-24 17:58:31 +00:00
Oliver
7ff162e9bb Remove outdated comment 2025-02-23 22:24:13 +00:00
Oliver
c0923a6fb8 Add validation 2025-02-23 22:23:45 +00:00
Oliver
3a5dc2080f Merge branch 'main' into codeio-sampler 2025-02-23 20:28:06 +00:00
Oliver
40d7dfdb5f Add input prediction 2025-02-23 20:27:27 +00:00
Andreas Koepf
0a487030ec minor arc_1d tweaks 2025-02-23 16:37:40 +01:00
Andreas Koepf
696769a3d6 remove unnecessary checks, use tuples 2025-02-23 13:17:48 +01:00
Andreas Koepf
e444bbf7a1 fix index out of range of arc_1d dataset (#190) 2025-02-23 12:51:41 +01:00
Andreas Koepf
a1a305c8d7 dev minor version one ahead of PyPI released version 2025-02-22 16:54:05 +01:00
Oliver
489dea7267 Draft CodeIO-derived reasoning problems dataset 2025-02-22 00:56:52 +00:00
Oliver
378cba2de1 Outline CodeIO dataset classes 2025-02-22 00:21:17 +00:00
Andreas Koepf
74f590e24f more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
d27ec36c94 add markdown tripple backtick code block for emoji_mystry hint 2025-02-21 21:06:07 +01:00
Andreas Köpf
e41b86ec36 Merge branch 'main' into feat/emoji-mystery 2025-02-21 20:58:39 +01:00
joesharratt1229
f7be02abfc added answer format spec in prompt 2025-02-21 18:03:05 +00:00
joesharratt1229
425ae24f3b added emoji dataset 2025-02-21 17:57:41 +00:00
Andreas Köpf
82839dec96 Merge pull request #181 from open-thought/rich/bitwise
Add Bitwise Arithmetic
2025-02-21 17:27:45 +01:00
Andreas Köpf
de362fb76f Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
2025-02-21 17:27:16 +01:00
Andreas Koepf (aider)
5f9d5c0e0f docs: Add comment explaining automatic base detection in int() conversion 2025-02-21 17:16:11 +01:00
Andreas Koepf (aider)
196d236978 docs: Add detailed comments for BitwiseArithmeticConfig and BitwiseArithmeticDataset 2025-02-21 17:14:00 +01:00
Andreas Koepf
aa37fbc2cf refactor: Update type hints and remove unused imports in bitwise_arithmetic.py 2025-02-21 17:13:36 +01:00
Andreas Koepf (aider)
253fd55a00 feat: Add typing hints and improve difficulty parameter documentation in bitwise_arithmetic.py 2025-02-21 17:11:40 +01:00
Rich Jones
aaff230dff lint 2025-02-21 17:09:19 +01:00
Rich Jones
217771a1b0 ensure arbitrary bit depth and signed values 2025-02-21 16:52:26 +01:00
Andreas Köpf
32d319e291 Merge pull request #180 from Adefioye/list-functions
Add induction-based tasks for list functions
2025-02-21 16:20:49 +01:00
AhmedSaif2
75cbfb8783 fix parameter name in compute_decimal_reward docstring 2025-02-21 17:01:59 +02:00
abdulhakeem
a5e3ae6528 clean up 2025-02-21 08:54:55 -06:00
Andreas Koepf
222d5ebf94 reactivate default imports for PropositionalLogicDataset 2025-02-21 15:41:04 +01:00
Andreas Köpf
78b2b518d9 Merge branch 'main' into fix/prop_logix 2025-02-21 15:38:29 +01:00
Andreas Köpf
28dc0932c4 Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Köpf
1e0f67f7a2 Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
2025-02-21 15:36:21 +01:00
Andreas Koepf
476e37e70b use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
AhmedSaif2
6b5c7a8637 add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00
Zafir Stojanovski
a168605fc7 pre-commit 2025-02-21 13:39:05 +01:00
Zafir Stojanovski
6c46b93ae2 binary alternation 2025-02-21 13:09:21 +01:00
Rich Jones
b9ab1cb2ae clean up comments 2025-02-21 12:17:21 +01:00
Rich Jones
cc451adb17 add to init 2025-02-21 12:07:17 +01:00
Rich Jones
1733927ed9 add bitwise arithmetic 2025-02-21 12:02:41 +01:00
abdulhakeem
b34db81272 Commit more changes 2025-02-21 00:37:29 -06:00
joesharratt1229
16c69b3b7a moved trivial check 2025-02-21 00:20:00 +00:00
joesharratt1229
f61a4569ff reimplemented prop logic 2025-02-20 23:59:31 +00:00
Oliver
31941d09e6 Answer scoring fixes to address edge cases 2025-02-20 22:04:01 +00:00
Andreas Koepf
5e7e205639 update GALLERY.my, bump version 2025-02-20 23:03:54 +01:00
Andreas Köpf
07587d1647 Merge branch 'main' into env/rotten-oranges 2025-02-20 22:51:07 +01:00
Andreas Köpf
d902debf7e Merge pull request #172 from open-thought/rich/jugs
Add Water Jug Puzzles
2025-02-20 22:48:12 +01:00