Commit graph

891 commits

Author SHA1 Message Date
Oliver
5fa06c961f Fix 2025-02-26 11:17:23 +00:00
Andreas Köpf
48f082663a
Fix PoolMatrixConfigs::score_answer(), add unit tests (#215) 2025-02-26 00:43:18 +01:00
Andreas Koepf
bba128ffd0 fix score_answer of pool_matrix (if -> elif), remove print 2025-02-25 23:43:29 +01:00
Andreas Koepf
f9e8f8b064 add try-except to GraphColorDataset.score_answer() 2025-02-25 23:43:29 +01:00
Andreas Koepf
65d17b9850 add None/empty check to score_answer of cryptarithm 2025-02-25 23:43:29 +01:00
Oliver
aa6759c160 Merge branch 'main' into codeio-sampler 2025-02-25 22:41:47 +00:00
Oliver
81c77a495d Add note on code execution to CodeIODataset 2025-02-25 22:39:06 +00:00
Oliver
0252dd905f Move data file & load into memory on first object creation 2025-02-25 22:36:38 +00:00
vncntt
5f01049607
Add KnightsKnavesDataset (knights_knaves)
Adapted code from https://github.com/AlphaPav/mem-kk-logic/blob/main/data_prep/lib_kk.py

---------

Co-authored-by: Andreas Koepf (aider) <andreas.koepf@provisio.com>
2025-02-25 20:15:38 +01:00
Oliver
fe502d5eb2 Register CodeIODataset 2025-02-24 18:28:35 +00:00
Oliver
43daec67ea Initial scoring algo for codeio 2025-02-24 18:27:53 +00:00
Oliver
1795c8ea7a Add tiny sample dataset & efficient sampling 2025-02-24 17:58:31 +00:00
Oliver
7b5a12a92c Remove outdated comment 2025-02-23 22:24:13 +00:00
Oliver
e07287e1f9 Add validation 2025-02-23 22:23:45 +00:00
Andreas Koepf
b5f6f7d753 bump version, update gallery 2025-02-23 22:36:39 +01:00
Andreas Köpf
d115655f0a
Merge pull request #191 from zafstojano/env/shortest-path
feat(env): Shortest Path
2025-02-23 22:28:43 +01:00
Andreas Koepf
45e452bff6 reduce size of default shortest_path maze grid 2025-02-23 22:27:17 +01:00
Oliver
342902683f Merge branch 'main' into codeio-sampler 2025-02-23 20:28:06 +00:00
Oliver
f787069fd2 Add input prediction 2025-02-23 20:27:27 +00:00
Zafir Stojanovski
c5f37d5e9f predict actual path 2025-02-23 18:24:23 +01:00
Andreas Koepf
469934d9b7 minor arc_1d tweaks 2025-02-23 16:37:40 +01:00
Andreas Koepf
ec3050a4f6 remove unnecessary checks, use tuples 2025-02-23 13:17:48 +01:00
Andreas Koepf
7a45b14a49 fix index out of range of arc_1d dataset (#190) 2025-02-23 12:51:41 +01:00
Zafir Stojanovski
97b3097984 shortest path 2025-02-23 11:25:00 +01:00
Andreas Koepf
e4102a44f6 dev minor version one ahead of PyPI released version 2025-02-22 16:54:05 +01:00
Oliver
e718168428 Draft CodeIO-derived reasoning problems dataset 2025-02-22 00:56:52 +00:00
Oliver
563480329e Outline CodeIO dataset classes 2025-02-22 00:21:17 +00:00
Andreas Koepf
eeb9fa31d5 more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
51808210aa add markdown tripple backtick code block for emoji_mystry hint 2025-02-21 21:06:07 +01:00
Andreas Köpf
c56045b9a7
Merge branch 'main' into feat/emoji-mystery 2025-02-21 20:58:39 +01:00
joesharratt1229
1fb73011f8 added answer format spec in prompt 2025-02-21 18:03:05 +00:00
joesharratt1229
5e64d1c24c added emoji dataset 2025-02-21 17:57:41 +00:00
Andreas Köpf
1c6359f1f3
Merge pull request #181 from open-thought/rich/bitwise
Add Bitwise Arithmetic
2025-02-21 17:27:45 +01:00
Andreas Köpf
2947038557
Merge pull request #182 from zafstojano/env/binary-alternation
feat(env): Binary Alternation
2025-02-21 17:27:16 +01:00
Andreas Koepf (aider)
bae97aa795 docs: Add comment explaining automatic base detection in int() conversion 2025-02-21 17:16:11 +01:00
Andreas Koepf (aider)
5ff957a766 docs: Add detailed comments for BitwiseArithmeticConfig and BitwiseArithmeticDataset 2025-02-21 17:14:00 +01:00
Andreas Koepf
44f4cc08eb refactor: Update type hints and remove unused imports in bitwise_arithmetic.py 2025-02-21 17:13:36 +01:00
Andreas Koepf (aider)
c91d13bd08 feat: Add typing hints and improve difficulty parameter documentation in bitwise_arithmetic.py 2025-02-21 17:11:40 +01:00
Rich Jones
1cf6821f17 lint 2025-02-21 17:09:19 +01:00
Rich Jones
c1b26cf184 ensure arbitrary bit depth and signed values 2025-02-21 16:52:26 +01:00
Andreas Köpf
700aab6114
Merge pull request #180 from Adefioye/list-functions
Add induction-based tasks for list functions
2025-02-21 16:20:49 +01:00
AhmedSaif2
5d3bfda677 fix parameter name in compute_decimal_reward docstring 2025-02-21 17:01:59 +02:00
abdulhakeem
a5e88dbd2e clean up 2025-02-21 08:54:55 -06:00
Andreas Koepf
7f30e711e5 reactivate default imports for PropositionalLogicDataset 2025-02-21 15:41:04 +01:00
Andreas Köpf
802b8c4bed
Merge branch 'main' into fix/prop_logix 2025-02-21 15:38:29 +01:00
Andreas Köpf
b59ccdefa2
Merge pull request #178 from olliestanley/feature/unsloth-train
Add minimal working GRPO training example with Unsloth
2025-02-21 15:37:24 +01:00
Andreas Köpf
a6a5d30f1c
Merge pull request #175 from AhmedSaif2/fix-format
Add score_answer function to handle comma-formatted numbers
2025-02-21 15:36:21 +01:00
Andreas Koepf
acde58a200 use Decimal class for numeric comparison e.g. +0123.100 == 123.1 2025-02-21 15:36:06 +01:00
Andreas Koepf
3e7ff3b084 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
AhmedSaif2
5d02064b5a add a helper function to handle redundant code 2025-02-21 15:54:00 +02:00