Jai Suphavadeeprasit
a43b0b7e72
training kernel
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
7ec622a098
training ideas
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c26432b963
training kernel
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
62ef2fcc2e
training kernel
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a54dfe7a13
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c37516b289
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
fd5b426f9f
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
34a39367dc
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
8a348beccd
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
2f371e03fc
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
b457a678ce
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3a440f847c
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c275687fba
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
f1cfc137ec
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
78c0a6d082
tokenizer bug
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
98a5d3b334
testing config
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
82be871979
testing config
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
abba562d4a
testing config
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e79af5ff69
testing config
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e84686b4fd
remove enforce eager
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
057c9fe870
shorten worker timeout
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
d1fd89f992
non blocking test
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
09ad401995
sneaky bug logging
2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
64794e7c72
sneaky bug
2026-03-13 11:06:00 -04:00
Jai Suphavadeeprasit
bb2736db4e
next
2026-03-13 11:05:40 -04:00
Jai Suphavadeeprasit
4f33ab8bf4
apparently not so easy
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
81f90a67b5
forgot something easy
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
e5633527ba
quicker training
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
985311eb94
trial
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
ad364ac771
increase timeout cause vllm is super slow all of a sudden
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
d5ca760f36
command change
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
530fed2877
testing set up
2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
f44eb810bf
teacher env init
2026-03-13 11:04:57 -04:00
dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
...
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0
2026-03-09 23:17:01 -05:00
dmahan93
6facf0add5
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
...
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-09 23:16:00 -05:00
dmahan93
f198c1738e
Merge conflict commit
2026-03-09 23:13:43 -05:00
Partho Das
632ab0161c
Revert "rm hardcoded same score check"
...
This reverts commit f02c24204d .
2026-03-10 01:42:44 +05:30
dmahan93
c0db13978a
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2026-03-09 14:43:08 -05:00
pre-commit-ci[bot]
880bb4a632
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0 )
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5 )
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2 )
2026-03-09 16:44:14 +00:00
Partho Das
cdc23ba5dc
Revert "allow serve openai overrides"
...
This reverts commit bd98a82bbc .
2026-03-08 04:42:09 +05:30
Partho Das
cd3a9163c7
Revert "eval max_token_length consistent with training config"
...
This reverts commit 5f52befd38 .
2026-03-08 04:42:02 +05:30
J-SUPHA
1f676f2185
Merge pull request #406 from NousResearch/logprobsfn
...
Unified get_logprobs interface across the server stack
2026-03-05 17:36:22 -05:00
Jai Suphavadeeprasit
eb50099361
test_get_logprobs_input_ids_only_passthrough
2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086
fix: use sys.executable instead of hardcoded "python" in tests
...
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.
Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e
managed_Server pass through and centralize sem logic
2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6
make preserve thinking optional
2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7
readme language
2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664
testing
2026-03-03 23:38:04 -05:00