Oliver Stanley
7475a20700
include ranges rather than sampled values in difficulty metadata dicts ( #387 )
...
* update difficulty metadata for logic datasets
* update difficulty metadata for graph datasets
* update difficulty metadata for geometry datasets
* update difficulty metadata for games datasets
* update difficulty metadata for cognition datasets
* update difficulty metadata for arithmetic datasets
* update difficulty metadata for arc datasets
* update difficulty metadata for algorithmic datasets
* update difficulty metadata for algebra datasets
* use tuples
* update tests
* update tests
2025-03-20 10:27:03 +01:00
joesharratt1229
6354ca5d35
added rearc curr ( #358 )
2025-03-13 21:14:03 +01:00
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Andreas Koepf
127f505798
add ArcAgiDataset class, fix score_entry() metadata params
2025-02-08 23:18:18 +01:00
Andreas Koepf
d0ee809757
re-arc cleanup
2025-02-08 19:07:28 +01:00
joesharratt1229
5d17a6b31c
added testing of rearc
2025-02-08 11:42:49 +00:00