Commit graph

1306 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
3de03d6db3 single copy 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
5ba06c7d4a threading 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
ca1ec60869 improve default 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
eed13670de better debugging 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3ac4a64f6f patching problem 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
5af1a4a974 basic changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
007f4f275d changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
80f67f979a error handling 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
9e53076a82 param locations update 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
197fce640f daemon errors 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3995e0af7d monkey patch fixes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
7b975f3adc changes based on torchtitan 2 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
53b29472b4 changes based on torchtitan 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
078dd4a333 Cleanup 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
39e94c4278 weight updates async 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b3874b658a vllm underlying weights 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b0d35be8a4 IPC updates 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
e278978fa1 health changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
f51ae77f54 add missing parameter 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
d6f389f86f readme updates 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
88ccaa0ea5 standardize the training approach 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
ebdbc54842 tracking 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
9498d9576f training bug 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
d978eff127 smol changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
adc3ae712b design choice - LoRA and shared vLLM through the bridge 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
22648bd912 gradient checkpointing issue for LoRAs 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
1e7b7cf841 stuff 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
db7414329b generate endpoint with logprobs 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
e956af11a2 changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
426c0fac4c local version 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
7b143a7d68 correction 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3ed23058c3 initial commit 2026-02-13 11:26:22 -05:00
Jai Suphavadeeprasit
407a22ba12 Save the eval to the disk 2026-02-13 11:25:49 -05:00
dmahan93
81b2d4daab
Merge pull request #375 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-02-09 21:09:44 -08:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
dmahan93
1580ab5934
Merge pull request #365 from alireza78a/fix/replace-debug-prints-with-logger
fix: replace debug print statements with logger
2026-02-09 21:01:38 -08:00
dmahan93
31a1cd1a8e
Merge pull request #355 from Ridwannurudeen/docs/improve-setup-and-troubleshooting
[docs] Clarify prerequisites, fix Python version inconsistency, and add troubleshooting section
2026-02-09 20:58:49 -08:00
dmahan93
17015f5f96
Merge pull request #373 from NousResearch/add-tokenizer-config-to-servers
add tokenizer name config to set the vllm/sglang tokenizer
2026-02-09 20:47:51 -08:00
pre-commit-ci[bot]
41df2a3701
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.14.14 → v0.15.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.14.14...v0.15.0)
2026-02-09 23:25:15 +00:00
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
dmahan93
13f282aabc
Merge pull request #370 from alireza78a/fix/minor-bug-fixes
fix duplicate code + add safety checks
2026-02-09 13:10:28 -08:00
Alireza
6b92ee16ec fix duplicate code + add safety checks 2026-02-09 10:58:49 +03:30
Ridwan Nurudeen
b03b09febc
Merge branch 'main' into docs/improve-setup-and-troubleshooting 2026-02-07 19:33:44 +01:00
alireza78a
1303cb59e8 fix: replace debug print statements with logger in dataset_env and infinimath_env 2026-02-07 14:51:33 +00:00
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test 2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0 Add regression test for TRL vLLM completion wrapper
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30
dmahan93
7da681ec46
Merge pull request #359 from NousResearch/add-dummy-managed-server-for-openai
Add dummy openai managed server
2026-02-04 14:28:22 -08:00
Dakota
9ff24bf370 change to 128 tokens to support low length rejection 2026-02-04 16:23:30 -06:00
Dakota
10f651289c Add dummy openai managed server 2026-02-04 15:16:36 -06:00
Ridwan Nurudeen
cc4b1f61a3
Revert badge change per reviewer request 2026-02-02 22:05:09 +01:00