Andreas Köpf
|
b71a051f6a
|
Merge pull request #193 from open-thought/190_fix_arc_1d_out_of_range
Fix index out of range for arc_1d dataset
|
2025-02-23 13:20:08 +01:00 |
|
Andreas Koepf
|
696769a3d6
|
remove unnecessary checks, use tuples
|
2025-02-23 13:17:48 +01:00 |
|
Andreas Koepf
|
f600c7eb30
|
add arc_1d size range test
|
2025-02-23 12:58:51 +01:00 |
|
Andreas Koepf
|
e444bbf7a1
|
fix index out of range of arc_1d dataset (#190)
|
2025-02-23 12:51:41 +01:00 |
|
Andreas Koepf
|
a1a305c8d7
|
dev minor version one ahead of PyPI released version
|
2025-02-22 16:54:05 +01:00 |
|
Andreas Köpf
|
5c73043a1e
|
Merge pull request #176 from olliestanley/codeio-experiments
Experiments with CodeI/O techniques for synthesising reasoning data
|
2025-02-22 16:24:17 +01:00 |
|
Oliver
|
94cd3c4d43
|
Add steps to synthesize CoTs with DeepSeekV3
|
2025-02-21 23:36:19 +00:00 |
|
Oliver
|
3297fc1bc0
|
Improve prompt for better LLM adherence
|
2025-02-21 23:00:48 +00:00 |
|
Andreas Koepf
|
74f590e24f
|
more native type hints
|
2025-02-21 21:23:14 +01:00 |
|
Andreas Köpf
|
ae26704d05
|
Merge pull request #185 from joesharratt1229/feat/emoji-mystery
Implements #173
|
2025-02-21 21:09:26 +01:00 |
|
Andreas Koepf
|
d27ec36c94
|
add markdown tripple backtick code block for emoji_mystry hint
|
2025-02-21 21:06:07 +01:00 |
|
Andreas Köpf
|
e41b86ec36
|
Merge branch 'main' into feat/emoji-mystery
|
2025-02-21 20:58:39 +01:00 |
|
Oliver
|
fc2c43b7d3
|
Prompt tweak
|
2025-02-21 18:34:13 +00:00 |
|
joesharratt1229
|
f7be02abfc
|
added answer format spec in prompt
|
2025-02-21 18:03:05 +00:00 |
|
joesharratt1229
|
650387f748
|
added tests
|
2025-02-21 17:58:13 +00:00 |
|
joesharratt1229
|
425ae24f3b
|
added emoji dataset
|
2025-02-21 17:57:41 +00:00 |
|
Oliver
|
c2b42d2717
|
Merge branch 'main' into codeio-experiments
|
2025-02-21 17:25:08 +00:00 |
|
Andreas Koepf
|
71ecc0cfca
|
update GALLERY.md
|
2025-02-21 17:30:33 +01:00 |
|
Andreas Köpf
|
82839dec96
|
Merge pull request #181 from open-thought/rich/bitwise
Add Bitwise Arithmetic
|
2025-02-21 17:27:45 +01:00 |
|
Andreas Köpf
|
de362fb76f
|
Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
|
2025-02-21 17:27:16 +01:00 |
|
Andreas Koepf (aider)
|
5fb26fc709
|
fix: Handle negative hex number prefix variations in bitwise arithmetic test
|
2025-02-21 17:23:50 +01:00 |
|
Andreas Koepf (aider)
|
2abe783be4
|
test: Update bitwise arithmetic difficulty levels to [1, 2, 3]
|
2025-02-21 17:22:36 +01:00 |
|
Andreas Koepf (aider)
|
5b233ce9cc
|
test: Add comprehensive unit tests for BitwiseArithmeticDataset
|
2025-02-21 17:21:00 +01:00 |
|
Andreas Koepf (aider)
|
5f9d5c0e0f
|
docs: Add comment explaining automatic base detection in int() conversion
|
2025-02-21 17:16:11 +01:00 |
|
Andreas Koepf (aider)
|
196d236978
|
docs: Add detailed comments for BitwiseArithmeticConfig and BitwiseArithmeticDataset
|
2025-02-21 17:14:00 +01:00 |
|
Andreas Koepf
|
aa37fbc2cf
|
refactor: Update type hints and remove unused imports in bitwise_arithmetic.py
|
2025-02-21 17:13:36 +01:00 |
|
Andreas Koepf (aider)
|
253fd55a00
|
feat: Add typing hints and improve difficulty parameter documentation in bitwise_arithmetic.py
|
2025-02-21 17:11:40 +01:00 |
|
Rich Jones
|
aaff230dff
|
lint
|
2025-02-21 17:09:19 +01:00 |
|
Rich Jones
|
217771a1b0
|
ensure arbitrary bit depth and signed values
|
2025-02-21 16:52:26 +01:00 |
|
Andreas Köpf
|
32d319e291
|
Merge pull request #180 from Adefioye/list-functions
Add induction-based tasks for list functions
|
2025-02-21 16:20:49 +01:00 |
|
Andreas Köpf
|
d24293358e
|
Merge pull request #184 from AhmedSaif2/main
fix parameter name in compute_decimal_reward docstring
|
2025-02-21 16:15:05 +01:00 |
|
AhmedSaif2
|
75cbfb8783
|
fix parameter name in compute_decimal_reward docstring
|
2025-02-21 17:01:59 +02:00 |
|
abdulhakeem
|
a5e3ae6528
|
clean up
|
2025-02-21 08:54:55 -06:00 |
|
Andreas Köpf
|
ee2d04439f
|
Merge pull request #183 from open-thought/rich/rdmepolish
Enhance README friendliness
|
2025-02-21 15:45:56 +01:00 |
|
Andreas Köpf
|
6d22bb6f27
|
Merge pull request #179 from joesharratt1229/fix/prop_logix
Fix/prop logix
|
2025-02-21 15:43:13 +01:00 |
|
Andreas Koepf
|
222d5ebf94
|
reactivate default imports for PropositionalLogicDataset
|
2025-02-21 15:41:04 +01:00 |
|
Andreas Köpf
|
78b2b518d9
|
Merge branch 'main' into fix/prop_logix
|
2025-02-21 15:38:29 +01:00 |
|
Andreas Köpf
|
28dc0932c4
|
Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
|
2025-02-21 15:37:24 +01:00 |
|
Andreas Köpf
|
1e0f67f7a2
|
Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
|
2025-02-21 15:36:21 +01:00 |
|
Andreas Koepf
|
476e37e70b
|
use Decimal class for numeric comparison e.g. +0123.100 == 123.1
|
2025-02-21 15:36:06 +01:00 |
|
Andreas Koepf
|
ff5b210106
|
use native types List->list, Dict->dict, Set->set, Tuple->tuple
|
2025-02-21 15:15:38 +01:00 |
|
Rich Jones
|
eb72b055b2
|
fix mkd
|
2025-02-21 15:14:36 +01:00 |
|
Rich Jones
|
745bea8a6e
|
link to eval readme
|
2025-02-21 15:13:29 +01:00 |
|
Rich Jones
|
f142ea7cd3
|
edits
|
2025-02-21 15:09:11 +01:00 |
|
Rich Jones
|
4fb0a71381
|
polish readme a bit
|
2025-02-21 15:04:04 +01:00 |
|
AhmedSaif2
|
6b5c7a8637
|
add a helper function to handle redundant code
|
2025-02-21 15:54:00 +02:00 |
|
AhmedSaif2
|
065d3d0afc
|
extend format tests to allow questions that ends with question marks
|
2025-02-21 15:50:03 +02:00 |
|
Zafir Stojanovski
|
0391a99446
|
include pre-parsed responses in json
|
2025-02-21 13:50:48 +01:00 |
|
Zafir Stojanovski
|
a168605fc7
|
pre-commit
|
2025-02-21 13:39:05 +01:00 |
|
Zafir Stojanovski
|
6c46b93ae2
|
binary alternation
|
2025-02-21 13:09:21 +01:00 |
|