Commit graph

1239 commits

Author SHA1 Message Date
Oliver
081f84dec6 Add steps to synthesize CoTs with DeepSeekV3 2025-02-21 23:36:19 +00:00
Oliver
cce6002c70 Improve prompt for better LLM adherence 2025-02-21 23:00:48 +00:00
Andreas Koepf
eeb9fa31d5 more native type hints 2025-02-21 21:23:14 +01:00
Andreas Köpf
90a1181285
Merge pull request #185 from joesharratt1229/feat/emoji-mystery
Implements #173
2025-02-21 21:09:26 +01:00
Andreas Koepf
51808210aa add markdown tripple backtick code block for emoji_mystry hint 2025-02-21 21:06:07 +01:00
Andreas Köpf
c56045b9a7
Merge branch 'main' into feat/emoji-mystery 2025-02-21 20:58:39 +01:00
Oliver
cb1f634078 Prompt tweak 2025-02-21 18:34:13 +00:00
joesharratt1229
1fb73011f8 added answer format spec in prompt 2025-02-21 18:03:05 +00:00
joesharratt1229
9b9554e489 added tests 2025-02-21 17:58:13 +00:00
joesharratt1229
5e64d1c24c added emoji dataset 2025-02-21 17:57:41 +00:00
Oliver
a0ccfa5144 Merge branch 'main' into codeio-experiments 2025-02-21 17:25:08 +00:00
Andreas Koepf
97b30f5f53 update GALLERY.md 2025-02-21 17:30:33 +01:00
Andreas Köpf
1c6359f1f3
Merge pull request #181 from open-thought/rich/bitwise
Add Bitwise Arithmetic
2025-02-21 17:27:45 +01:00
Andreas Köpf
2947038557
Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
2025-02-21 17:27:16 +01:00
Andreas Koepf (aider)
af4d79e947 fix: Handle negative hex number prefix variations in bitwise arithmetic test 2025-02-21 17:23:50 +01:00
Andreas Koepf (aider)
e846c53347 test: Update bitwise arithmetic difficulty levels to [1, 2, 3] 2025-02-21 17:22:36 +01:00
Andreas Koepf (aider)
660f7e6f03 test: Add comprehensive unit tests for BitwiseArithmeticDataset 2025-02-21 17:21:00 +01:00
Andreas Koepf (aider)
bae97aa795 docs: Add comment explaining automatic base detection in int() conversion 2025-02-21 17:16:11 +01:00
Andreas Koepf (aider)
5ff957a766 docs: Add detailed comments for BitwiseArithmeticConfig and BitwiseArithmeticDataset 2025-02-21 17:14:00 +01:00
Andreas Koepf
44f4cc08eb refactor: Update type hints and remove unused imports in bitwise_arithmetic.py 2025-02-21 17:13:36 +01:00
Andreas Koepf (aider)
c91d13bd08 feat: Add typing hints and improve difficulty parameter documentation in bitwise_arithmetic.py 2025-02-21 17:11:40 +01:00
Rich Jones
1cf6821f17 lint 2025-02-21 17:09:19 +01:00
Rich Jones
c1b26cf184 ensure arbitrary bit depth and signed values 2025-02-21 16:52:26 +01:00
Andreas Köpf
700aab6114
Merge pull request #180 from Adefioye/list-functions
Add induction-based tasks for list functions
2025-02-21 16:20:49 +01:00
Andreas Köpf
bad2abf63e
Merge pull request #184 from AhmedSaif2/main
fix parameter name in compute_decimal_reward docstring
2025-02-21 16:15:05 +01:00
AhmedSaif2
5d3bfda677 fix parameter name in compute_decimal_reward docstring 2025-02-21 17:01:59 +02:00
abdulhakeem
a5e88dbd2e clean up 2025-02-21 08:54:55 -06:00
Andreas Köpf
ef33dbc077
Merge pull request #183 from open-thought/rich/rdmepolish
Enhance README friendliness
2025-02-21 15:45:56 +01:00
Andreas Köpf
88474a98d9
Merge pull request #179 from joesharratt1229/fix/prop_logix
Fix/prop logix
2025-02-21 15:43:13 +01:00
Andreas Koepf
7f30e711e5 reactivate default imports for PropositionalLogicDataset 2025-02-21 15:41:04 +01:00
Andreas Köpf
802b8c4bed
Merge branch 'main' into fix/prop_logix 2025-02-21 15:38:29 +01:00
Andreas Köpf
b59ccdefa2
Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Köpf
a6a5d30f1c
Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
2025-02-21 15:36:21 +01:00
Andreas Koepf
acde58a200 use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
3e7ff3b084 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Rich Jones
0d84a8ecf5 fix mkd 2025-02-21 15:14:36 +01:00
Rich Jones
0730664eb6 link to eval readme 2025-02-21 15:13:29 +01:00
Rich Jones
84583b450a edits 2025-02-21 15:09:11 +01:00
Rich Jones
2cb395ba30 polish readme a bit 2025-02-21 15:04:04 +01:00
AhmedSaif2
5d02064b5a add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00
AhmedSaif2
5c45e55340 extend format tests to allow questions that ends with question marks 2025-02-21 15:50:03 +02:00
Zafir Stojanovski
77789257d3 include pre-parsed responses in json 2025-02-21 13:50:48 +01:00
Zafir Stojanovski
96464388bb pre-commit 2025-02-21 13:39:05 +01:00
Zafir Stojanovski
941085e0c5 binary alternation 2025-02-21 13:09:21 +01:00
Rich Jones
b6c7ceabb2 clean up comments 2025-02-21 12:17:21 +01:00
Rich Jones
ee9202d63d add to init 2025-02-21 12:07:17 +01:00
Rich Jones
17088e9b42 add bitwise arithmetic 2025-02-21 12:02:41 +01:00
abdulhakeem
13a696979f Remove play file 2025-02-21 00:51:04 -06:00
abdulhakeem
624594bb1a Commit more changes 2025-02-21 00:37:29 -06:00
joesharratt1229
5fb655e390 moved trivial check 2025-02-21 00:20:00 +00:00