Alvarez
|
d762c229e2
|
Update instructions.py
|
2026-02-27 10:23:47 +01:00 |
|
dmahan93
|
7ceed9b6d9
|
Merge pull request #388 from milord12345/fix/replace-print-with-logger-reasoning-gym
refactor: replace print statements with self.logger in reasoning_gym_environment.py
|
2026-02-24 14:24:12 -06:00 |
|
dmahan93
|
329a233bba
|
Merge pull request #389 from CreeptoGengar/fix/validate-without-train
fix: handle validation without training
|
2026-02-23 14:21:40 -06:00 |
|
pre-commit-ci[bot]
|
a930d3db12
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-21 19:25:14 +00:00 |
|
VolodymyrBg
|
7e5ddbce06
|
fix: add try/finally to guarantee gym environment cleanup
|
2026-02-21 21:23:46 +02:00 |
|
pre-commit-ci[bot]
|
929980185d
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-21 13:54:38 +00:00 |
|
Gengar
|
34c8c87f0f
|
fix: handle validation without training
Added validation functionality to the training process and refactored validation method to use a dedicated validator instance.
|
2026-02-21 15:53:37 +02:00 |
|
pre-commit-ci[bot]
|
623dadc5cd
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-20 16:43:18 +00:00 |
|
milord1234
|
853703ffc5
|
refactor: replace print statements with self.logger in reasoning_gym_environment.py
Replace 20 print() calls with appropriate logging levels:
- Error messages -> self.logger.error()
- Warnings -> self.logger.warning()
- Info/status messages -> self.logger.info()
- Debug messages -> self.logger.debug()
Left 2 top-level print() calls untouched (no logger access).
|
2026-02-20 19:57:43 +03:30 |
|
dmahan93
|
708b42a00f
|
Merge pull request #378 from johnh4098/add-regex-generation-env
Add regex generation environment for community
|
2026-02-18 12:37:32 -08:00 |
|
pre-commit-ci[bot]
|
53a69d30e1
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-11 19:47:28 +00:00 |
|
johnh4098
|
86d5163316
|
Add regex generation environment for community
|
2026-02-11 23:04:47 +03:30 |
|
victlop
|
4c18b883c5
|
Merge branch 'main' into cleanup/remove-redundant-import-comments
|
2026-02-11 19:14:59 +03:30 |
|
victlop
|
a1823f99fb
|
chore: remove redundant inline comments from swe_rl_env.py imports
|
2026-02-11 19:08:47 +03:30 |
|
dmahan93
|
1580ab5934
|
Merge pull request #365 from alireza78a/fix/replace-debug-prints-with-logger
fix: replace debug print statements with logger
|
2026-02-09 21:01:38 -08:00 |
|
Alireza
|
6b92ee16ec
|
fix duplicate code + add safety checks
|
2026-02-09 10:58:49 +03:30 |
|
alireza78a
|
1303cb59e8
|
fix: replace debug print statements with logger in dataset_env and infinimath_env
|
2026-02-07 14:51:33 +00:00 |
|
Teknium
|
462abbebf7
|
Merge pull request #339 from VolodymyrBg/bg
chore: fix typos
|
2026-01-31 09:03:17 -08:00 |
|
Teknium
|
efc85528bc
|
Merge pull request #338 from windlgrass/fix-init-current-item
fix: initialize current_item in __init__ to prevent AttributeError
|
2026-01-31 09:02:06 -08:00 |
|
Teknium
|
8b22416dd4
|
Merge branch 'main' into fix-duplicate-code
|
2026-01-31 08:52:43 -08:00 |
|
VolodymyrBg
|
f285bbd417
|
Update refusalbench_environment.py
|
2026-01-29 12:43:15 +02:00 |
|
VolodymyrBg
|
94f29eac18
|
Update simpleqa_eval.py
|
2026-01-29 12:42:28 +02:00 |
|
VolodymyrBg
|
347edc9188
|
Update instructions.py
|
2026-01-29 12:31:52 +02:00 |
|
VolodymyrBg
|
466fd96b41
|
Update patient.py
|
2026-01-29 12:16:31 +02:00 |
|
VolodymyrBg
|
39f3509965
|
Update instruction_following_algorithm_environment.py
|
2026-01-29 11:22:05 +02:00 |
|
Wind
|
eb5be87f81
|
Update dataset_env.py
|
2026-01-29 15:16:34 +07:00 |
|
Wind
|
6c2f1ac408
|
Update dataset_env.py
|
2026-01-29 15:16:05 +07:00 |
|
Wind
|
2607942ffa
|
Update dataset_env.py
|
2026-01-29 15:11:31 +07:00 |
|
dmahan93
|
e8fd85429f
|
Merge pull request #323 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
|
2026-01-26 11:02:44 -08:00 |
|
dmahan93
|
b8ec055942
|
Merge pull request #324 from DeVikingMark/fix/gradient-quantile-prefix
fix: use correct prefix for gradient quantiles with NaN/Inf
|
2026-01-26 11:01:36 -08:00 |
|
dmahan93
|
cf2b280d52
|
Merge pull request #325 from crStiv/typo
fix: multiple typos of different importance
|
2026-01-26 11:00:44 -08:00 |
|
pre-commit-ci[bot]
|
2be7442dd5
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-26 16:41:26 +00:00 |
|
Wind
|
42601e2325
|
Update instructions_utils.py
|
2026-01-26 17:24:12 +07:00 |
|
Wind
|
7feb826fed
|
Update instructions_registry.py
|
2026-01-26 17:23:39 +07:00 |
|
Wind
|
883043de49
|
Update instructions.py
|
2026-01-26 17:14:57 +07:00 |
|
dmahan93
|
5af29933a7
|
Merge pull request #305 from alt-glitch/sid/verifiers
Verifiers Integration
|
2026-01-23 10:21:49 -08:00 |
|
balyan.sid@gmail.com
|
4ba69d3a80
|
revert to using evalbase
|
2026-01-23 23:41:32 +05:30 |
|
balyan.sid@gmail.com
|
5a20abdce7
|
switch eval to use managed server adapter impl. moved managed server
adapter
|
2026-01-23 23:26:29 +05:30 |
|
Siddharth Balyan
|
32d12c05c3
|
Merge branch 'main' into sid/verifiers
|
2026-01-23 21:57:13 +05:30 |
|
Wind
|
4f24688d18
|
Update coding_server.py
|
2026-01-22 15:19:28 +07:00 |
|
Teknium
|
faf84d241c
|
Merge branch 'main' into patch-1
|
2026-01-21 05:55:56 -08:00 |
|
crStiv
|
b44eca5a5e
|
Fix typo in TODO comment in plot.py
|
2026-01-20 00:14:51 +02:00 |
|
crStiv
|
8edfbe1de4
|
Fix typo in error message for resume type
|
2026-01-20 00:12:12 +02:00 |
|
crStiv
|
abc9ad3c73
|
Fix typos in comments for clarity
|
2026-01-20 00:07:50 +02:00 |
|
crStiv
|
ee97038408
|
Fix typos in instruction description methods
Corrected typos in the docstring for build_description and another function.
|
2026-01-19 23:58:55 +02:00 |
|
crStiv
|
3db6276299
|
Fix typo in README.md for GamePigeon
|
2026-01-19 23:50:20 +02:00 |
|
crStiv
|
31266ba5b9
|
Fix typo in fight commentator prompt
|
2026-01-19 23:45:52 +02:00 |
|
crStiv
|
d8f29a6026
|
Fix typo in fight commentator prompt
|
2026-01-19 23:45:39 +02:00 |
|
Ragnar
|
5c8ee88f0f
|
Update callbacks.py
|
2026-01-19 20:39:21 +02:00 |
|
Siddharth Balyan
|
7f28c52994
|
Merge branch 'main' into sid/verifiers
|
2026-01-16 11:50:27 +05:30 |
|