mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
Commit graph
Select branches
Hide pull requests
Miserlou-patch-1
collision-check
curriculum
curriculum_refactor
external-bench-cfg
feat/curr-adj
feat/curriculum-exp
feat/game-eval-new
feat/multi-curriculum-exp
feat/ollie/tweak-length-reward
feat/unsloth-example
fix-slow-games
fix/update-training-readme
get_batch_epoch_param
main
revert_breaking_commits
rich/docsandstuff
rich/fixdice
rich/genbench
rich/qwentweak
rich/verl
#1
#100
#101
#102
#105
#106
#108
#109
#11
#110
#111
#112
#113
#114
#115
#116
#117
#118
#119
#12
#120
#123
#124
#125
#126
#127
#128
#129
#130
#131
#132
#133
#134
#135
#136
#137
#138
#14
#140
#141
#142
#143
#144
#145
#146
#147
#148
#149
#150
#151
#152
#154
#155
#158
#159
#161
#164
#165
#166
#167
#168
#169
#170
#172
#174
#175
#176
#177
#178
#179
#180
#181
#182
#183
#184
#185
#186
#188
#191
#193
#194
#195
#196
#197
#198
#199
#204
#205
#206
#21
#212
#215
#216
#217
#219
#22
#220
#23
#231
#232
#233
#235
#236
#237
#238
#239
#240
#241
#242
#243
#244
#246
#247
#248
#249
#250
#251
#252
#254
#255
#256
#257
#258
#259
#260
#261
#262
#263
#264
#265
#266
#267
#268
#269
#270
#271
#272
#273
#274
#275
#276
#277
#278
#279
#28
#280
#281
#282
#283
#284
#286
#287
#288
#289
#290
#291
#292
#293
#294
#295
#296
#297
#298
#299
#30
#300
#301
#302
#303
#304
#305
#306
#307
#308
#309
#31
#310
#311
#312
#313
#314
#315
#316
#317
#318
#319
#32
#320
#321
#322
#323
#324
#325
#326
#327
#328
#329
#33
#330
#331
#332
#333
#334
#335
#336
#337
#338
#339
#340
#341
#342
#343
#344
#345
#346
#347
#348
#349
#350
#351
#352
#353
#354
#355
#356
#357
#358
#359
#360
#361
#362
#363
#364
#365
#366
#367
#368
#369
#370
#371
#372
#373
#374
#375
#376
#378
#379
#38
#380
#381
#382
#383
#384
#386
#387
#388
#389
#39
#391
#392
#393
#394
#395
#396
#397
#398
#399
#40
#400
#401
#402
#403
#404
#405
#406
#407
#41
#410
#411
#412
#413
#414
#415
#416
#418
#42
#420
#422
#423
#424
#425
#426
#427
#43
#430
#431
#432
#433
#434
#435
#436
#437
#438
#439
#441
#442
#443
#444
#445
#446
#447
#448
#449
#450
#451
#454
#458
#459
#46
#460
#461
#462
#463
#464
#465
#47
#471
#471
#472
#473
#475
#476
#477
#479
#48
#480
#481
#482
#483
#485
#486
#487
#488
#489
#49
#490
#490
#491
#492
#493
#494
#495
#495
#497
#498
#499
#5
#50
#500
#502
#503
#504
#505
#506
#507
#508
#509
#51
#510
#511
#513
#514
#515
#517
#518
#519
#52
#523
#524
#525
#526
#527
#55
#57
#59
#60
#61
#62
#63
#64
#65
#66
#67
#68
#70
#71
#72
#73
#74
#75
#76
#77
#78
#79
#8
#80
#81
#82
#84
#85
#86
#87
#88
#89
#90
#91
#92
#93
#94
#97
#99
v0.1.20
v0.1.22
v0.1.23
v0.1.24
v0.1.25
-
6aec30c6ed
fix: Correct 1-digit number test ranges in chain sum test
Andreas Koepf (aider)
2025-01-23 12:14:15 +01:00 -
95fdca5a92
feat: Add special case handling for min_digits=1 in ChainSum generation
Andreas Koepf (aider)
2025-01-23 12:07:56 +01:00 -
516d4d20d4
feat: Add special case handling for min_digits=1 in ChainSum generation
Andreas Koepf (aider)
2025-01-23 12:07:56 +01:00 -
e1734b0b11
refactor: Move min_value and max_value calculations to __getitem__
Andreas Koepf (aider)
2025-01-23 12:05:55 +01:00 -
4777e6b435
refactor: Move min_value and max_value calculations to __getitem__
Andreas Koepf (aider)
2025-01-23 12:05:55 +01:00 -
1ff01627cc
feat: Implement allow_negation to generate both positive and negative numbers in ChainSum
Andreas Koepf (aider)
2025-01-23 12:01:21 +01:00 -
d2825f41ce
feat: Implement allow_negation to generate both positive and negative numbers in ChainSum
Andreas Koepf (aider)
2025-01-23 12:01:21 +01:00 -
358829cc60
refactor: Replace Random import with random module and update type hints
Andreas Koepf
2025-01-23 12:01:20 +01:00 -
c3bce305c1
refactor: Replace Random import with random module and update type hints
Andreas Koepf
2025-01-23 12:01:20 +01:00 -
654bed3146
refactor: Simplify ChainSum random number generation with base seed
Andreas Koepf (aider)
2025-01-23 11:56:36 +01:00 -
4aeb76ae8c
refactor: Simplify ChainSum random number generation with base seed
Andreas Koepf (aider)
2025-01-23 11:56:36 +01:00 -
d46f338160
feat: Add digit-based number range generation for chain sum tasks
Andreas Koepf (aider)
2025-01-23 11:46:55 +01:00 -
626fd78bda
feat: Add digit-based number range generation for chain sum tasks
Andreas Koepf (aider)
2025-01-23 11:46:55 +01:00 -
3d9b48ba31
feat: Add ChainSum class for generating simple arithmetic tasks
Andreas Koepf (aider)
2025-01-23 11:40:00 +01:00 -
c8aa98f4e8
feat: Add ChainSum class for generating simple arithmetic tasks
Andreas Koepf (aider)
2025-01-23 11:40:00 +01:00 -
37338d2107
chore: Add __init__.py to arithmetic module
Andreas Koepf
2025-01-23 11:39:59 +01:00 -
a1bd52b75e
chore: Add __init__.py to arithmetic module
Andreas Koepf
2025-01-23 11:39:59 +01:00 -
6b41e3cc47
fix: Prevent infinite loop in arithmetic dataset space generation
Andreas Koepf (aider)
2025-01-23 11:34:18 +01:00 -
fbba398c91
fix: Prevent infinite loop in arithmetic dataset space generation
Andreas Koepf (aider)
2025-01-23 11:34:18 +01:00 -
473a2be31d
feat: Unify arithmetic task generation with configurable dataset class
Andreas Koepf (aider)
2025-01-23 11:30:09 +01:00 -
40596262e1
feat: Unify arithmetic task generation with configurable dataset class
Andreas Koepf (aider)
2025-01-23 11:30:09 +01:00 -
88e438fa68
feat: Add basic arithmetic module to reasoning_gym
Andreas Koepf
2025-01-23 11:30:03 +01:00 -
8a6364a791
feat: Add basic arithmetic module to reasoning_gym
Andreas Koepf
2025-01-23 11:30:03 +01:00 -
b42ae939fd
fix README
Andreas Koepf
2025-01-23 11:04:41 +01:00 -
d4628270bb
fix README
Andreas Koepf
2025-01-23 11:04:41 +01:00 -
82afdb26e9
feat: Add Black and isort pre-commit hooks with line length configuration
Andreas Koepf (aider)
2025-01-23 11:02:13 +01:00 -
324e84897a
feat: Add Black and isort pre-commit hooks with line length configuration
Andreas Koepf (aider)
2025-01-23 11:02:13 +01:00 -
64641c4ee0
chore: Update project homepage and bug tracker URLs
Andreas Koepf (aider)
2025-01-23 10:58:29 +01:00 -
8d53002e1b
chore: Update project homepage and bug tracker URLs
Andreas Koepf (aider)
2025-01-23 10:58:29 +01:00 -
f98c16a6d3
chore: Update license to Apache 2.0 and author information
Andreas Koepf (aider)
2025-01-23 10:55:01 +01:00 -
ba45a1e6a0
chore: Update license to Apache 2.0 and author information
Andreas Koepf (aider)
2025-01-23 10:55:01 +01:00 -
4dc36f79e7
build: Initialize reasoning_gym package structure with packaging and development setup
Andreas Koepf (aider)
2025-01-23 10:50:54 +01:00 -
3f80fd7b80
build: Initialize reasoning_gym package structure with packaging and development setup
Andreas Koepf (aider)
2025-01-23 10:50:54 +01:00 -
32818d7cfd
chore: Add .gitignore with .aider and .env files
Andreas Koepf
2025-01-23 10:50:53 +01:00 -
530cb523c8
chore: Add .gitignore with .aider and .env files
Andreas Koepf
2025-01-23 10:50:53 +01:00 -
60ad5347f3
create inital README
Andreas Koepf
2025-01-23 09:40:28 +01:00 -
1b413e0b73
create inital README
Andreas Koepf
2025-01-23 09:40:28 +01:00 -
2e57412bef
Initial commit
Andreas Köpf
2025-01-23 09:39:53 +01:00 -
f5a31bed8dInitial commit
Andreas Köpf
2025-01-23 09:39:53 +01:00