Commit graph

27 commits

Author SHA1 Message Date
Denini Gabriel
02b7fac863
fix encoding to be able to run on win (#502) 2025-08-18 09:19:45 +01:00
joesharratt1229
4b60c32978
Curr exp (#487)
* began curr exp

* added holdout words

* updated config

* added context

* updated base curriculum

* updaed

* updated curriculum

* updated

* updated

* updated automatic flag

* updated ray trainer

* update
2025-07-25 20:38:47 +01:00
Zafir Stojanovski
dafdee621e
fix(env): Unify CodeIO datasets (#405)
* unify codeio

* filtered for libraries not present in reasoning-gym
2025-04-02 22:40:03 +02:00
joesharratt1229
43c739cb3e
Feat/curr adj (#394) 2025-04-02 06:39:14 +01:00
Roman Machacek
2c52f33c3a
CodeIO HQ Dataset (#382)
* ADD: CodeIO high quality dataset

Based on the dataset for CodeI/O. Annotated using Qwen-Coder and filtered based on the various metrics resulting in high quality filtered dataset, where approx 50% of the original data is kept.

* ADD: Compressed version

* Delete pure json version
2025-04-01 22:34:33 +02:00
Oliver Stanley
f14662e213
Add a few new CodeI/O samples, resolve numeric answer scoring bug (#332)
* add handful of codeio samples

* scoring fix
2025-03-11 23:55:33 +01:00
Adefioye
c8c3930797
Add ACRE(Abstract Causal REasoning Beyond Covariation) python generators (#199)
* Add acre python generators
* acre: improved prompt & formatting of examples, support arbitrary sizes

---------

Co-authored-by: Andreas Koepf <andreas.koepf@provisio.com>
2025-03-10 00:09:54 +01:00
Oliver Stanley
d1e505a8e9
First version of CodeI/O reasoning data (#264)
* notebook for prepping first set of raw code files
* updated codeio processing notebook for repo-level processing
* fix for edge case in codeio scoring
* Add reformat notebook
* filtering pass
* add non-determinism filtering
* Tweak CodeIODataset & include first real data
* add basic codeio test, metadata
2025-03-05 22:34:11 +01:00
Oliver
0252dd905f Move data file & load into memory on first object creation 2025-02-25 22:36:38 +00:00
Andreas Koepf
8512fa2297 add sampled subset of rush hour database 2025-02-14 11:10:30 +01:00
Andreas Köpf
3f6b2fc807
Add Coaching & ScoreBoard class (result tracking) (#72)
* feat: Add Coach and ScoreBoard classes for performance tracking and difficulty adjustment
* feat: Add GroupedScores class to wrap aggregated scores
* refactor: Create ScoreStats class with tuple-based score statistics
* feat: Add unit test for Coach with CompositeDataset and multiple datasets
* fix: Add difficulty metadata to leg counting dataset
* feat: Add clear() method to ScoreBoard to reset all stored data
* feat: Add __len__ method to ScoreBoard to return number of scores
* feat: Add update_dataset_config method to CompositeDataset
* cleanup __init__ & imports
2025-02-06 23:15:28 +01:00
Zafir Stojanovski
5cc45d6939 delete words_alpha.txt 2025-02-06 10:12:25 +01:00
Zafir Stojanovski
76a3d4761c generate all english anagrams 2025-02-05 16:25:23 +01:00
Cavit Erginsoy
aff0fecef4 lint 2025-02-03 11:35:30 +00:00
Cavit Erginsoy
9b1068ea39 Merge remote-tracking branch 'upstream/main' 2025-02-03 07:44:32 +00:00
Cavit Erginsoy
ade33e1a22 filtered out lesser known words to aid model learning ease 2025-02-03 07:19:30 +00:00
Cavit Erginsoy
372e778c26 improved word quality, removed extremly rares 2025-02-02 19:24:53 +00:00
Andreas Koepf
5ae329becd lint 2025-01-30 23:14:32 +01:00
Cavit Erginsoy
df3c4580ee INIT 2025-01-30 21:32:46 +00:00
Andreas Koepf
0f494ebff1 rename static.py -> wordle_words.py 2025-01-30 01:06:52 +01:00
Andreas Koepf
fc775eda7e lint, seed & size for figlet 2025-01-30 00:58:34 +01:00
Rich Jones
cbf5087887 figlet fonts 2025-01-29 20:38:38 +01:00
Andreas Koepf
20069b2a7d formatting 2025-01-24 10:34:07 +01:00
Andreas Koepf (aider)
c05a938ab7 refactor: Replace deprecated open_text() with files() API 2025-01-23 19:19:14 +01:00
Andreas Koepf
f78bb85a85 add plaintext of book In the year 2889 by Michel & Jules Verne (project Gutenberg eBook #19362) 2025-01-23 19:01:44 +01:00
Andreas Koepf (aider)
746e05aa3d feat: Add data file loading utilities to reasoning_gym.data 2025-01-23 18:58:58 +01:00
Andreas Koepf (aider)
22538be4c5 feat: Add data file packaging configuration and __init__.py for reasoning_gym.data 2025-01-23 18:57:34 +01:00