Commit graph

10 commits

Author SHA1 Message Date
Andreas Köpf
b2904ccab9 Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Zafir Stojanovski
2f9d94c1e7 fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
74f590e24f more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Zafir Stojanovski
4b71eb2da9 improve template 2025-02-16 11:02:51 +01:00
Rich Jones
02c1fdf304 explicitly handle multiple solutions 2025-02-14 12:07:53 +01:00
Andreas Koepf
4e49806d22 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
Joe Norton
ff8f627f8d add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
7f34d98c25 lint 2025-01-31 18:45:52 -08:00
Joe Norton
0cc2645027 add palindrome_generation 2025-01-31 18:45:52 -08:00