Commit graph

10 commits

Author SHA1 Message Date
Andreas Köpf
5d7fbac0ad
Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
Zafir Stojanovski
01e1c8f9af
fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
eeb9fa31d5 more native type hints 2025-02-21 21:23:14 +01:00
Andreas Koepf
3e7ff3b084 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Zafir Stojanovski
edecfef50d improve template 2025-02-16 11:02:51 +01:00
Rich Jones
acdc08bf94 explicitly handle multiple solutions 2025-02-14 12:07:53 +01:00
Andreas Koepf
127f505798 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
Joe Norton
731d36f43f add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Joe Norton
d0d84ae82a lint 2025-01-31 18:45:52 -08:00
Joe Norton
f75d9a2829 add palindrome_generation 2025-01-31 18:45:52 -08:00