Commit graph

55 commits

Author SHA1 Message Date
Zafir Stojanovski
a48ff14507 add difficulty where possible (#274) 2025-03-07 19:01:26 +01:00
Andreas Köpf
c2263979bc Basic curriculum (#198)
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
Zafir Stojanovski
f843ac1b82 shortest path curriculum (#271) 2025-03-05 22:46:10 +01:00
Zafir Stojanovski
a048084009 largest island curriculum (#270) 2025-03-05 22:45:35 +01:00
Zafir Stojanovski
84158df1c7 feat(env): Course Schedule Curriculum (#266)
* course schedule curriculum

* update levels

* update comments

* lint
2025-03-05 22:42:46 +01:00
Andreas Köpf
b2904ccab9 Minor question template & score_answer improvements (#261)
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
joesharratt1229
bf24999bb0 implemented family_relationships score ans (#260) 2025-03-04 21:37:57 +01:00
Rich Jones
e3b7365f50 Game of Life partial scoring and rule-clarification (#258)
* partial scoring and rule clarification
* better ql scoring
* word seq reverse typos
2025-03-03 22:22:39 +01:00
Zafir Stojanovski
2f9d94c1e7 fix: Unify Prompts (#254)
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
0b8c4bce0c reduce size of default shortest_path maze grid 2025-02-23 22:27:17 +01:00
Zafir Stojanovski
915a0f1f51 predict actual path 2025-02-23 18:24:23 +01:00
Zafir Stojanovski
df914dfb49 shortest path 2025-02-23 11:25:00 +01:00
Andreas Koepf
ff5b210106 use native types List->list, Dict->dict, Set->set, Tuple->tuple 2025-02-21 15:15:38 +01:00
Oliver
0de0044d52 Formatting/scoring improvements for BF & family 2025-02-17 19:08:15 +00:00
Zafir Stojanovski
2dd53241e5 remove unmodified dunder methods 2025-02-06 22:56:11 +01:00
Zafir Stojanovski
0b8fb1f6aa typo 2025-02-05 12:08:18 +01:00
Zafir Stojanovski
0de4d5ca4c course schedule 2025-02-04 23:50:24 +01:00
Zafir Stojanovski
739418c2b1 pre-commit 2025-02-03 23:25:01 +01:00
Zafir Stojanovski
ff8573b830 add clarifications for bounds 2025-02-03 23:21:58 +01:00
Zafir Stojanovski
14e68a76cb added largest island code 2025-02-03 22:46:06 +01:00
Andreas Koepf
6ecd25c283 add quantum lock answer format hint 2025-02-02 22:35:43 +01:00
Andreas Koepf
ccf282cc90 post merge lint 2025-02-02 10:04:18 +01:00
Andreas Koepf (aider)
2c979c3913 refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig 2025-02-02 09:55:51 +01:00
Andreas Koepf
25540b6634 lint 2025-01-30 22:55:04 +01:00
Andreas Köpf
fb8e0f21af Merge branch 'main' into miserlou/bfi 2025-01-30 22:45:01 +01:00
Andreas Koepf
28a7f7f532 add simple dataset gallery generation script 2025-01-30 22:30:26 +01:00
Rich Jones
645aa13a15 init definitions 2025-01-30 17:15:48 +01:00
Andreas Koepf
0f5fc0fb93 add algorithmic verification hint in README, lint 2025-01-30 10:14:54 +01:00
Andreas Koepf (aider)
22303dedb6 fix: Add validate() method to QuantumLockConfig 2025-01-30 01:22:46 +01:00
Andreas Koepf
3bc1291db2 refactor: Improve QuantumLock dataset with type hints, random seed, and code structure 2025-01-30 01:22:43 +01:00
Rich Jones
e99a9c59c3 Add QL puzz 2025-01-29 23:33:39 +01:00
Rich Jones
451af16f98 initial puzzle 2025-01-29 23:25:59 +01:00
Andreas Koepf (aider)
f23c52116a feat: Add mother-in-law and father-in-law relationship detection 2025-01-27 21:24:35 +01:00
Andreas Koepf
9227d438ab add uncle, aunt & niece and nephew family relationships 2025-01-27 21:19:48 +01:00
Andreas Koepf (aider)
54fc49c02c feat: Randomly assign children to multiple couples in family generation 2025-01-27 20:54:58 +01:00
Andreas Koepf
59042db1c2 refactor: Rename paternal_aunt variable to aunt for consistency 2025-01-27 20:54:56 +01:00
Andreas Koepf (aider)
b4c77768ed feat: Add paternal aunt and her husband to family generation 2025-01-27 20:50:22 +01:00
Andreas Koepf (aider)
6669acc568 feat: Add paternal uncle and aunt to family generation process 2025-01-27 20:48:28 +01:00
Andreas Koepf (aider)
84c0107ce0 feat: Add separate maternal and paternal grandparents to family relationships 2025-01-27 20:41:13 +01:00
Andreas Koepf (aider)
9f878e0244 feat: Add aunt, uncle, niece, and nephew relationships to family graph 2025-01-27 20:28:18 +01:00
Andreas Koepf
c3b6af35f0 min python 3.11 to support StrEnum 2025-01-26 22:17:43 +01:00
Andreas Koepf
ad9f0d265c fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00
Andreas Koepf
519e411fa5 add reasoning_gym.create_dataset({name}, ...) global factory function 2025-01-25 00:58:34 +01:00
Andreas Koepf
0d2d8ba6a0 pass config to ProceduralDataset base 2025-01-25 00:23:05 +01:00
Andreas Koepf
669bd97066 cleanup 2025-01-24 17:39:37 +01:00
Andreas Koepf (aider)
53f7a9238c refactor: Use StrEnum and lowercase values for Gender and Relationship enums 2025-01-24 17:25:35 +01:00
Andreas Koepf
5c5d46b4bd formatting, cleanup 2025-01-24 17:12:42 +01:00
Andreas Koepf (aider)
49ac56831f feat: Add 10 modern female names to default name list 2025-01-24 17:11:09 +01:00
Andreas Koepf (aider)
1033598de0 feat: Add 10 modern male names to default name list 2025-01-24 17:10:02 +01:00
Andreas Koepf (aider)
22427cce2e feat: Add 20 more male and female names to default name lists 2025-01-24 17:08:53 +01:00