Zafir Stojanovski
a48ff14507
add difficulty where possible ( #274 )
2025-03-07 19:01:26 +01:00
Andreas Köpf
c2263979bc
Basic curriculum ( #198 )
...
* feat: Add optional curriculum support to dataset registration and creation
* docs: Add docstrings to create_curriculum() and register_dataset()
* feat: Add curriculum configuration classes for CurriculumExperiment
* feat: Add weight parameter to CurriculumAttributeConfig and use in DatasetSpec
* refactor: Simplify CurriculumAttributeConfig with "*" attribute level support
* test: Add unit tests for CurriculumExperiment class
* feat: Add from_yaml() method to CurriculumExperimentConfig with unit test
2025-03-07 11:22:12 +01:00
Zafir Stojanovski
f843ac1b82
shortest path curriculum ( #271 )
2025-03-05 22:46:10 +01:00
Zafir Stojanovski
a048084009
largest island curriculum ( #270 )
2025-03-05 22:45:35 +01:00
Zafir Stojanovski
84158df1c7
feat(env): Course Schedule Curriculum ( #266 )
...
* course schedule curriculum
* update levels
* update comments
* lint
2025-03-05 22:42:46 +01:00
Andreas Köpf
b2904ccab9
Minor question template & score_answer improvements ( #261 )
...
* math prompt improvements
* ignore brackets in complex_arithmetic results
* improve additional instruction in prompt of polynomial_equations
* more strict tests for score_answer in polynomial_equations
* simplify special reward handling
* fix test_intermediate_integration
* fix sokoban dataset
* add common dataset score_answer consistency test
2025-03-04 21:55:09 +01:00
joesharratt1229
bf24999bb0
implemented family_relationships score ans ( #260 )
2025-03-04 21:37:57 +01:00
Rich Jones
e3b7365f50
Game of Life partial scoring and rule-clarification ( #258 )
...
* partial scoring and rule clarification
* better ql scoring
* word seq reverse typos
2025-03-03 22:22:39 +01:00
Zafir Stojanovski
2f9d94c1e7
fix: Unify Prompts ( #254 )
...
* remove cot
* fix prompt template
* fix pool matrix
* spiral matrix fixed
2025-03-03 21:55:53 +01:00
Andreas Koepf
0b8c4bce0c
reduce size of default shortest_path maze grid
2025-02-23 22:27:17 +01:00
Zafir Stojanovski
915a0f1f51
predict actual path
2025-02-23 18:24:23 +01:00
Zafir Stojanovski
df914dfb49
shortest path
2025-02-23 11:25:00 +01:00
Andreas Koepf
ff5b210106
use native types List->list, Dict->dict, Set->set, Tuple->tuple
2025-02-21 15:15:38 +01:00
Oliver
0de0044d52
Formatting/scoring improvements for BF & family
2025-02-17 19:08:15 +00:00
Zafir Stojanovski
2dd53241e5
remove unmodified dunder methods
2025-02-06 22:56:11 +01:00
Zafir Stojanovski
0b8fb1f6aa
typo
2025-02-05 12:08:18 +01:00
Zafir Stojanovski
0de4d5ca4c
course schedule
2025-02-04 23:50:24 +01:00
Zafir Stojanovski
739418c2b1
pre-commit
2025-02-03 23:25:01 +01:00
Zafir Stojanovski
ff8573b830
add clarifications for bounds
2025-02-03 23:21:58 +01:00
Zafir Stojanovski
14e68a76cb
added largest island code
2025-02-03 22:46:06 +01:00
Andreas Koepf
6ecd25c283
add quantum lock answer format hint
2025-02-02 22:35:43 +01:00
Andreas Koepf
ccf282cc90
post merge lint
2025-02-02 10:04:18 +01:00
Andreas Koepf (aider)
2c979c3913
refactor: Use field default_factory TimeIntervalsConfig, AdvancedGeometryConfig
2025-02-02 09:55:51 +01:00
Andreas Koepf
25540b6634
lint
2025-01-30 22:55:04 +01:00
Andreas Köpf
fb8e0f21af
Merge branch 'main' into miserlou/bfi
2025-01-30 22:45:01 +01:00
Andreas Koepf
28a7f7f532
add simple dataset gallery generation script
2025-01-30 22:30:26 +01:00
Rich Jones
645aa13a15
init definitions
2025-01-30 17:15:48 +01:00
Andreas Koepf
0f5fc0fb93
add algorithmic verification hint in README, lint
2025-01-30 10:14:54 +01:00
Andreas Koepf (aider)
22303dedb6
fix: Add validate() method to QuantumLockConfig
2025-01-30 01:22:46 +01:00
Andreas Koepf
3bc1291db2
refactor: Improve QuantumLock dataset with type hints, random seed, and code structure
2025-01-30 01:22:43 +01:00
Rich Jones
e99a9c59c3
Add QL puzz
2025-01-29 23:33:39 +01:00
Rich Jones
451af16f98
initial puzzle
2025-01-29 23:25:59 +01:00
Andreas Koepf (aider)
f23c52116a
feat: Add mother-in-law and father-in-law relationship detection
2025-01-27 21:24:35 +01:00
Andreas Koepf
9227d438ab
add uncle, aunt & niece and nephew family relationships
2025-01-27 21:19:48 +01:00
Andreas Koepf (aider)
54fc49c02c
feat: Randomly assign children to multiple couples in family generation
2025-01-27 20:54:58 +01:00
Andreas Koepf
59042db1c2
refactor: Rename paternal_aunt variable to aunt for consistency
2025-01-27 20:54:56 +01:00
Andreas Koepf (aider)
b4c77768ed
feat: Add paternal aunt and her husband to family generation
2025-01-27 20:50:22 +01:00
Andreas Koepf (aider)
6669acc568
feat: Add paternal uncle and aunt to family generation process
2025-01-27 20:48:28 +01:00
Andreas Koepf (aider)
84c0107ce0
feat: Add separate maternal and paternal grandparents to family relationships
2025-01-27 20:41:13 +01:00
Andreas Koepf (aider)
9f878e0244
feat: Add aunt, uncle, niece, and nephew relationships to family graph
2025-01-27 20:28:18 +01:00
Andreas Koepf
c3b6af35f0
min python 3.11 to support StrEnum
2025-01-26 22:17:43 +01:00
Andreas Koepf
ad9f0d265c
fix unit tests, lower python dependency to 3.9
2025-01-26 16:55:17 +01:00
Andreas Koepf
519e411fa5
add reasoning_gym.create_dataset({name}, ...) global factory function
2025-01-25 00:58:34 +01:00
Andreas Koepf
0d2d8ba6a0
pass config to ProceduralDataset base
2025-01-25 00:23:05 +01:00
Andreas Koepf
669bd97066
cleanup
2025-01-24 17:39:37 +01:00
Andreas Koepf (aider)
53f7a9238c
refactor: Use StrEnum and lowercase values for Gender and Relationship enums
2025-01-24 17:25:35 +01:00
Andreas Koepf
5c5d46b4bd
formatting, cleanup
2025-01-24 17:12:42 +01:00
Andreas Koepf (aider)
49ac56831f
feat: Add 10 modern female names to default name list
2025-01-24 17:11:09 +01:00
Andreas Koepf (aider)
1033598de0
feat: Add 10 modern male names to default name list
2025-01-24 17:10:02 +01:00
Andreas Koepf (aider)
22427cce2e
feat: Add 20 more male and female names to default name lists
2025-01-24 17:08:53 +01:00