Commit graph

1580 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
862cd3667d clean logging 2026-03-13 12:38:52 -04:00
Jai Suphavadeeprasit
600c54f5f8 clean log 2026-03-13 12:12:33 -04:00
pre-commit-ci[bot]
d1b0dee8f7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 15:14:09 +00:00
Jai Suphavadeeprasit
d8857eb69f investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3df0e45659 investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
690e670e64 investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a43b0b7e72 training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
7ec622a098 training ideas 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c26432b963 training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
62ef2fcc2e training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a54dfe7a13 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c37516b289 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
fd5b426f9f tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
34a39367dc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
8a348beccd tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
2f371e03fc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
b457a678ce tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3a440f847c tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c275687fba tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
f1cfc137ec tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
78c0a6d082 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
98a5d3b334 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
82be871979 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
abba562d4a testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e79af5ff69 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e84686b4fd remove enforce eager 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
057c9fe870 shorten worker timeout 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
d1fd89f992 non blocking test 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
09ad401995 sneaky bug logging 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
64794e7c72 sneaky bug 2026-03-13 11:06:00 -04:00
Jai Suphavadeeprasit
bb2736db4e next 2026-03-13 11:05:40 -04:00
Jai Suphavadeeprasit
4f33ab8bf4 apparently not so easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
81f90a67b5 forgot something easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
e5633527ba quicker training 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
985311eb94 trial 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
ad364ac771 increase timeout cause vllm is super slow all of a sudden 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
d5ca760f36 command change 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
530fed2877 testing set up 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
f44eb810bf teacher env init 2026-03-13 11:04:57 -04:00
dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0 2026-03-09 23:17:01 -05:00
dmahan93
6facf0add5
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-09 23:16:00 -05:00
dmahan93
f198c1738e Merge conflict commit 2026-03-09 23:13:43 -05:00
Partho Das
632ab0161c Revert "rm hardcoded same score check"
This reverts commit f02c24204d.
2026-03-10 01:42:44 +05:30
dmahan93
c0db13978a
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-03-09 14:43:08 -05:00
pre-commit-ci[bot]
880bb4a632
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0)
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5)
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2)
2026-03-09 16:44:14 +00:00
Partho Das
cdc23ba5dc Revert "allow serve openai overrides"
This reverts commit bd98a82bbc.
2026-03-08 04:42:09 +05:30
Partho Das
cd3a9163c7 Revert "eval max_token_length consistent with training config"
This reverts commit 5f52befd38.
2026-03-08 04:42:02 +05:30
J-SUPHA
1f676f2185
Merge pull request #406 from NousResearch/logprobsfn
Unified get_logprobs interface across the server stack
2026-03-05 17:36:22 -05:00
Jai Suphavadeeprasit
eb50099361 test_get_logprobs_input_ids_only_passthrough 2026-03-05 17:04:45 -05:00