dmahan93
|
31a1cd1a8e
|
Merge pull request #355 from Ridwannurudeen/docs/improve-setup-and-troubleshooting
[docs] Clarify prerequisites, fix Python version inconsistency, and add troubleshooting section
|
2026-02-09 20:58:49 -08:00 |
|
dmahan93
|
17015f5f96
|
Merge pull request #373 from NousResearch/add-tokenizer-config-to-servers
add tokenizer name config to set the vllm/sglang tokenizer
|
2026-02-09 20:47:51 -08:00 |
|
pre-commit-ci[bot]
|
41df2a3701
|
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.14.14 → v0.15.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.14...v0.15.0)
|
2026-02-09 23:25:15 +00:00 |
|
Dakota
|
7d6aeb9bbf
|
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
|
2026-02-09 15:26:29 -06:00 |
|
dmahan93
|
13f282aabc
|
Merge pull request #370 from alireza78a/fix/minor-bug-fixes
fix duplicate code + add safety checks
|
2026-02-09 13:10:28 -08:00 |
|
Alireza
|
6b92ee16ec
|
fix duplicate code + add safety checks
|
2026-02-09 10:58:49 +03:30 |
|
Ridwan Nurudeen
|
b03b09febc
|
Merge branch 'main' into docs/improve-setup-and-troubleshooting
|
2026-02-07 19:33:44 +01:00 |
|
alireza78a
|
1303cb59e8
|
fix: replace debug print statements with logger in dataset_env and infinimath_env
|
2026-02-07 14:51:33 +00:00 |
|
Ansul
|
3b9b67a3ad
|
Merge branch 'main' into fix/trl-vllm-completion-test
|
2026-02-06 02:13:29 +05:30 |
|
ansulx
|
d97f366ae0
|
Add regression test for TRL vLLM completion wrapper
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
|
2026-02-06 01:57:16 +05:30 |
|
dmahan93
|
7da681ec46
|
Merge pull request #359 from NousResearch/add-dummy-managed-server-for-openai
Add dummy openai managed server
|
2026-02-04 14:28:22 -08:00 |
|
Dakota
|
9ff24bf370
|
change to 128 tokens to support low length rejection
|
2026-02-04 16:23:30 -06:00 |
|
Dakota
|
10f651289c
|
Add dummy openai managed server
|
2026-02-04 15:16:36 -06:00 |
|
Ridwan Nurudeen
|
cc4b1f61a3
|
Revert badge change per reviewer request
|
2026-02-02 22:05:09 +01:00 |
|
Ridwannurudeen
|
5e2e84835b
|
[docs] Clarify prerequisites, fix Python version inconsistency, and add troubleshooting section
|
2026-02-01 23:39:37 +01:00 |
|
Teknium
|
462abbebf7
|
Merge pull request #339 from VolodymyrBg/bg
chore: fix typos
|
2026-01-31 09:03:17 -08:00 |
|
Teknium
|
efc85528bc
|
Merge pull request #338 from windlgrass/fix-init-current-item
fix: initialize current_item in __init__ to prevent AttributeError
|
2026-01-31 09:02:06 -08:00 |
|
Teknium
|
a2330dc099
|
Merge pull request #334 from HusseinAdeiza/fix-typos-docs
Fix typos in SLURM.md
|
2026-01-31 08:59:53 -08:00 |
|
Teknium
|
c2f0de563e
|
Merge branch 'main' into fix-typos-docs
|
2026-01-31 08:57:23 -08:00 |
|
Teknium
|
4bbea4ec8e
|
Merge pull request #330 from windlgrass/fix-duplicate-code
fix: remove duplicate code in instruction files
|
2026-01-31 08:55:26 -08:00 |
|
Teknium
|
8b22416dd4
|
Merge branch 'main' into fix-duplicate-code
|
2026-01-31 08:52:43 -08:00 |
|
VolodymyrBg
|
f285bbd417
|
Update refusalbench_environment.py
|
2026-01-29 12:43:15 +02:00 |
|
VolodymyrBg
|
94f29eac18
|
Update simpleqa_eval.py
|
2026-01-29 12:42:28 +02:00 |
|
VolodymyrBg
|
347edc9188
|
Update instructions.py
|
2026-01-29 12:31:52 +02:00 |
|
VolodymyrBg
|
466fd96b41
|
Update patient.py
|
2026-01-29 12:16:31 +02:00 |
|
VolodymyrBg
|
39f3509965
|
Update instruction_following_algorithm_environment.py
|
2026-01-29 11:22:05 +02:00 |
|
VolodymyrBg
|
1eb0d72099
|
Update FAQ.md
|
2026-01-29 10:43:47 +02:00 |
|
VolodymyrBg
|
e0744adf28
|
Update README.md
|
2026-01-29 10:23:53 +02:00 |
|
VolodymyrBg
|
dd02df0d76
|
Update base.py
|
2026-01-29 10:22:51 +02:00 |
|
Wind
|
eb5be87f81
|
Update dataset_env.py
|
2026-01-29 15:16:34 +07:00 |
|
Wind
|
6c2f1ac408
|
Update dataset_env.py
|
2026-01-29 15:16:05 +07:00 |
|
VolodymyrBg
|
77a3505955
|
Update test_openai_api_workarounds.py
|
2026-01-29 10:13:50 +02:00 |
|
Wind
|
2607942ffa
|
Update dataset_env.py
|
2026-01-29 15:11:31 +07:00 |
|
HusseinAdeiza
|
91cf15b933
|
Fix typos in SLURM.md
|
2026-01-28 16:48:39 +01:00 |
|
dmahan93
|
e8fd85429f
|
Merge pull request #323 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
|
2026-01-26 11:02:44 -08:00 |
|
dmahan93
|
b8ec055942
|
Merge pull request #324 from DeVikingMark/fix/gradient-quantile-prefix
fix: use correct prefix for gradient quantiles with NaN/Inf
|
2026-01-26 11:01:36 -08:00 |
|
dmahan93
|
cf2b280d52
|
Merge pull request #325 from crStiv/typo
fix: multiple typos of different importance
|
2026-01-26 11:00:44 -08:00 |
|
dmahan93
|
7134d08a46
|
Merge pull request #329 from windlgrass/fix-typos-in-instructions-docstrings
fix: correct typos in instructions.py
|
2026-01-26 10:59:03 -08:00 |
|
pre-commit-ci[bot]
|
2be7442dd5
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-01-26 16:41:26 +00:00 |
|
pre-commit-ci[bot]
|
9115df2895
|
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 25.12.0 → 26.1.0](https://github.com/psf/black-pre-commit-mirror/compare/25.12.0...26.1.0)
- [github.com/astral-sh/ruff-pre-commit: v0.14.11 → v0.14.14](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.11...v0.14.14)
|
2026-01-26 16:40:38 +00:00 |
|
Wind
|
42601e2325
|
Update instructions_utils.py
|
2026-01-26 17:24:12 +07:00 |
|
Wind
|
7feb826fed
|
Update instructions_registry.py
|
2026-01-26 17:23:39 +07:00 |
|
Wind
|
883043de49
|
Update instructions.py
|
2026-01-26 17:14:57 +07:00 |
|
dmahan93
|
5af29933a7
|
Merge pull request #305 from alt-glitch/sid/verifiers
Verifiers Integration
|
2026-01-23 10:21:49 -08:00 |
|
balyan.sid@gmail.com
|
4ba69d3a80
|
revert to using evalbase
|
2026-01-23 23:41:32 +05:30 |
|
balyan.sid@gmail.com
|
5a20abdce7
|
switch eval to use managed server adapter impl. moved managed server
adapter
|
2026-01-23 23:26:29 +05:30 |
|
dmahan93
|
1e3be64b5f
|
Merge pull request #327 from windlgrass/fix-max-token-length-typo
fix: typo in max_token_length
|
2026-01-23 09:55:36 -08:00 |
|
Siddharth Balyan
|
32d12c05c3
|
Merge branch 'main' into sid/verifiers
|
2026-01-23 21:57:13 +05:30 |
|
Wind
|
4f24688d18
|
Update coding_server.py
|
2026-01-22 15:19:28 +07:00 |
|
Teknium
|
1f814a5c10
|
Merge pull request #289 from GHOryy5/patch-1
Prevent hangs in kernel evaluation by bounding worker waits
|
2026-01-21 09:04:50 -08:00 |
|