Commit graph

989 commits

Author SHA1 Message Date
Zafir Stojanovski
f19498edb8 async 2025-02-24 22:07:35 +01:00
Zafir Stojanovski
9a27b80fd1 generate inputs synchronously 2025-02-24 15:58:06 +01:00
Zafir Stojanovski
5a23ba6b7c pre-commit 2025-02-23 13:11:31 +01:00
Zafir Stojanovski
0d07746a4e sampling code 2025-02-23 00:40:11 +01:00
Zafir Stojanovski
e84cec26ed greedy coreset sampling 2025-02-22 16:15:14 +01:00
Zafir Stojanovski
e9ff3a1ee2 exploratory notebook 2025-02-22 00:46:33 +01:00
Andreas Koepf
74f590e24f more native type hints 2025-02-21 21:23:14 +01:00
Andreas Köpf
ae26704d05 Merge pull request #185 from joesharratt1229/feat/emoji-mystery
Implements #173
2025-02-21 21:09:26 +01:00
Andreas Koepf
d27ec36c94 add markdown tripple backtick code block for emoji_mystry hint 2025-02-21 21:06:07 +01:00
Andreas Köpf
e41b86ec36 Merge branch 'main' into feat/emoji-mystery 2025-02-21 20:58:39 +01:00
joesharratt1229
f7be02abfc added answer format spec in prompt 2025-02-21 18:03:05 +00:00
joesharratt1229
650387f748 added tests 2025-02-21 17:58:13 +00:00
joesharratt1229
425ae24f3b added emoji dataset 2025-02-21 17:57:41 +00:00
Andreas Koepf
71ecc0cfca update GALLERY.md 2025-02-21 17:30:33 +01:00
Andreas Köpf
82839dec96 Merge pull request #181 from open-thought/rich/bitwise
Add Bitwise Arithmetic
2025-02-21 17:27:45 +01:00
Andreas Köpf
de362fb76f Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
2025-02-21 17:27:16 +01:00
Andreas Koepf (aider)
5fb26fc709 fix: Handle negative hex number prefix variations in bitwise arithmetic test 2025-02-21 17:23:50 +01:00
Andreas Koepf (aider)
2abe783be4 test: Update bitwise arithmetic difficulty levels to [1, 2, 3] 2025-02-21 17:22:36 +01:00
Andreas Koepf (aider)
5b233ce9cc test: Add comprehensive unit tests for BitwiseArithmeticDataset 2025-02-21 17:21:00 +01:00
Andreas Koepf (aider)
5f9d5c0e0f docs: Add comment explaining automatic base detection in int() conversion 2025-02-21 17:16:11 +01:00
Andreas Koepf (aider)
196d236978 docs: Add detailed comments for BitwiseArithmeticConfig and BitwiseArithmeticDataset 2025-02-21 17:14:00 +01:00
Andreas Koepf
aa37fbc2cf refactor: Update type hints and remove unused imports in bitwise_arithmetic.py 2025-02-21 17:13:36 +01:00
Andreas Koepf (aider)
253fd55a00 feat: Add typing hints and improve difficulty parameter documentation in bitwise_arithmetic.py 2025-02-21 17:11:40 +01:00
Rich Jones
aaff230dff lint 2025-02-21 17:09:19 +01:00
Rich Jones
217771a1b0 ensure arbitrary bit depth and signed values 2025-02-21 16:52:26 +01:00
Andreas Köpf
32d319e291 Merge pull request #180 from Adefioye/list-functions
Add induction-based tasks for list functions
2025-02-21 16:20:49 +01:00
Andreas Köpf
d24293358e Merge pull request #184 from AhmedSaif2/main
fix parameter name in compute_decimal_reward docstring
2025-02-21 16:15:05 +01:00
AhmedSaif2
75cbfb8783 fix parameter name in compute_decimal_reward docstring 2025-02-21 17:01:59 +02:00
abdulhakeem
a5e3ae6528 clean up 2025-02-21 08:54:55 -06:00
Andreas Köpf
ee2d04439f Merge pull request #183 from open-thought/rich/rdmepolish
Enhance README friendliness
2025-02-21 15:45:56 +01:00
Andreas Köpf
6d22bb6f27 Merge pull request #179 from joesharratt1229/fix/prop_logix
Fix/prop logix
2025-02-21 15:43:13 +01:00
Andreas Koepf
222d5ebf94 reactivate default imports for PropositionalLogicDataset 2025-02-21 15:41:04 +01:00
Andreas Köpf
78b2b518d9 Merge branch 'main' into fix/prop_logix 2025-02-21 15:38:29 +01:00
Andreas Köpf
28dc0932c4 Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Köpf
1e0f67f7a2 Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
2025-02-21 15:36:21 +01:00
Andreas Koepf
476e37e70b use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Rich Jones
eb72b055b2 fix mkd 2025-02-21 15:14:36 +01:00
Rich Jones
745bea8a6e link to eval readme 2025-02-21 15:13:29 +01:00
Rich Jones
f142ea7cd3 edits 2025-02-21 15:09:11 +01:00
Rich Jones
4fb0a71381 polish readme a bit 2025-02-21 15:04:04 +01:00
AhmedSaif2
6b5c7a8637 add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00
AhmedSaif2
065d3d0afc extend format tests to allow questions that ends with question marks 2025-02-21 15:50:03 +02:00
Zafir Stojanovski
0391a99446 include pre-parsed responses in json 2025-02-21 13:50:48 +01:00
Zafir Stojanovski
a168605fc7 pre-commit 2025-02-21 13:39:05 +01:00
Zafir Stojanovski
6c46b93ae2 binary alternation 2025-02-21 13:09:21 +01:00
Rich Jones
b9ab1cb2ae clean up comments 2025-02-21 12:17:21 +01:00
Rich Jones
cc451adb17 add to init 2025-02-21 12:07:17 +01:00
Rich Jones
1733927ed9 add bitwise arithmetic 2025-02-21 12:02:41 +01:00
abdulhakeem
e80e67d5af Remove play file 2025-02-21 00:51:04 -06:00