Commit graph

1320 commits

Author SHA1 Message Date
dmahan93
31a1cd1a8e
Merge pull request #355 from Ridwannurudeen/docs/improve-setup-and-troubleshooting
[docs] Clarify prerequisites, fix Python version inconsistency, and add troubleshooting section
2026-02-09 20:58:49 -08:00
dmahan93
17015f5f96
Merge pull request #373 from NousResearch/add-tokenizer-config-to-servers
add tokenizer name config to set the vllm/sglang tokenizer
2026-02-09 20:47:51 -08:00
pre-commit-ci[bot]
41df2a3701
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.14.14 → v0.15.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.14...v0.15.0)
2026-02-09 23:25:15 +00:00
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
dmahan93
13f282aabc
Merge pull request #370 from alireza78a/fix/minor-bug-fixes
fix duplicate code + add safety checks
2026-02-09 13:10:28 -08:00
Alireza
6b92ee16ec fix duplicate code + add safety checks 2026-02-09 10:58:49 +03:30
Ridwan Nurudeen
b03b09febc
Merge branch 'main' into docs/improve-setup-and-troubleshooting 2026-02-07 19:33:44 +01:00
alireza78a
1303cb59e8 fix: replace debug print statements with logger in dataset_env and infinimath_env 2026-02-07 14:51:33 +00:00
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test 2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0 Add regression test for TRL vLLM completion wrapper
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30
dmahan93
7da681ec46
Merge pull request #359 from NousResearch/add-dummy-managed-server-for-openai
Add dummy openai managed server
2026-02-04 14:28:22 -08:00
Dakota
9ff24bf370 change to 128 tokens to support low length rejection 2026-02-04 16:23:30 -06:00
Dakota
10f651289c Add dummy openai managed server 2026-02-04 15:16:36 -06:00
Ridwan Nurudeen
cc4b1f61a3
Revert badge change per reviewer request 2026-02-02 22:05:09 +01:00
Ridwannurudeen
5e2e84835b [docs] Clarify prerequisites, fix Python version inconsistency, and add troubleshooting section 2026-02-01 23:39:37 +01:00
Teknium
462abbebf7
Merge pull request #339 from VolodymyrBg/bg
chore: fix typos
2026-01-31 09:03:17 -08:00
Teknium
efc85528bc
Merge pull request #338 from windlgrass/fix-init-current-item
fix: initialize current_item in __init__ to prevent AttributeError
2026-01-31 09:02:06 -08:00
Teknium
a2330dc099
Merge pull request #334 from HusseinAdeiza/fix-typos-docs
Fix typos in SLURM.md
2026-01-31 08:59:53 -08:00
Teknium
c2f0de563e
Merge branch 'main' into fix-typos-docs 2026-01-31 08:57:23 -08:00
Teknium
4bbea4ec8e
Merge pull request #330 from windlgrass/fix-duplicate-code
fix: remove duplicate code in instruction files
2026-01-31 08:55:26 -08:00
Teknium
8b22416dd4
Merge branch 'main' into fix-duplicate-code 2026-01-31 08:52:43 -08:00
VolodymyrBg
f285bbd417
Update refusalbench_environment.py 2026-01-29 12:43:15 +02:00
VolodymyrBg
94f29eac18
Update simpleqa_eval.py 2026-01-29 12:42:28 +02:00
VolodymyrBg
347edc9188
Update instructions.py 2026-01-29 12:31:52 +02:00
VolodymyrBg
466fd96b41
Update patient.py 2026-01-29 12:16:31 +02:00
VolodymyrBg
39f3509965
Update instruction_following_algorithm_environment.py 2026-01-29 11:22:05 +02:00
VolodymyrBg
1eb0d72099
Update FAQ.md 2026-01-29 10:43:47 +02:00
VolodymyrBg
e0744adf28
Update README.md 2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py 2026-01-29 10:22:51 +02:00
Wind
eb5be87f81
Update dataset_env.py 2026-01-29 15:16:34 +07:00
Wind
6c2f1ac408
Update dataset_env.py 2026-01-29 15:16:05 +07:00
VolodymyrBg
77a3505955
Update test_openai_api_workarounds.py 2026-01-29 10:13:50 +02:00
Wind
2607942ffa
Update dataset_env.py 2026-01-29 15:11:31 +07:00
HusseinAdeiza
91cf15b933 Fix typos in SLURM.md 2026-01-28 16:48:39 +01:00
dmahan93
e8fd85429f
Merge pull request #323 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-01-26 11:02:44 -08:00
dmahan93
b8ec055942
Merge pull request #324 from DeVikingMark/fix/gradient-quantile-prefix
fix: use correct prefix for gradient quantiles with NaN/Inf
2026-01-26 11:01:36 -08:00
dmahan93
cf2b280d52
Merge pull request #325 from crStiv/typo
fix: multiple typos of different importance
2026-01-26 11:00:44 -08:00
dmahan93
7134d08a46
Merge pull request #329 from windlgrass/fix-typos-in-instructions-docstrings
fix: correct typos in instructions.py
2026-01-26 10:59:03 -08:00
pre-commit-ci[bot]
2be7442dd5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-26 16:41:26 +00:00
pre-commit-ci[bot]
9115df2895
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 25.12.0 → 26.1.0](https://github.com/psf/black-pre-commit-mirror/compare/25.12.0...26.1.0)
- [github.com/astral-sh/ruff-pre-commit: v0.14.11 → v0.14.14](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.11...v0.14.14)
2026-01-26 16:40:38 +00:00
Wind
42601e2325
Update instructions_utils.py 2026-01-26 17:24:12 +07:00
Wind
7feb826fed
Update instructions_registry.py 2026-01-26 17:23:39 +07:00
Wind
883043de49
Update instructions.py 2026-01-26 17:14:57 +07:00
dmahan93
5af29933a7
Merge pull request #305 from alt-glitch/sid/verifiers
Verifiers Integration
2026-01-23 10:21:49 -08:00
balyan.sid@gmail.com
4ba69d3a80 revert to using evalbase 2026-01-23 23:41:32 +05:30
balyan.sid@gmail.com
5a20abdce7 switch eval to use managed server adapter impl. moved managed server
adapter
2026-01-23 23:26:29 +05:30
dmahan93
1e3be64b5f
Merge pull request #327 from windlgrass/fix-max-token-length-typo
fix: typo in max_token_length
2026-01-23 09:55:36 -08:00
Siddharth Balyan
32d12c05c3
Merge branch 'main' into sid/verifiers 2026-01-23 21:57:13 +05:30
Wind
4f24688d18
Update coding_server.py 2026-01-22 15:19:28 +07:00
Teknium
1f814a5c10
Merge pull request #289 from GHOryy5/patch-1
Prevent hangs in kernel evaluation by bounding worker waits
2026-01-21 09:04:50 -08:00