Commit graph

1572 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
c26432b963 training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
62ef2fcc2e training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a54dfe7a13 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c37516b289 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
fd5b426f9f tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
34a39367dc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
8a348beccd tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
2f371e03fc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
b457a678ce tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3a440f847c tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c275687fba tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
f1cfc137ec tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
78c0a6d082 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
98a5d3b334 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
82be871979 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
abba562d4a testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e79af5ff69 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e84686b4fd remove enforce eager 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
057c9fe870 shorten worker timeout 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
d1fd89f992 non blocking test 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
09ad401995 sneaky bug logging 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
64794e7c72 sneaky bug 2026-03-13 11:06:00 -04:00
Jai Suphavadeeprasit
bb2736db4e next 2026-03-13 11:05:40 -04:00
Jai Suphavadeeprasit
4f33ab8bf4 apparently not so easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
81f90a67b5 forgot something easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
e5633527ba quicker training 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
985311eb94 trial 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
ad364ac771 increase timeout cause vllm is super slow all of a sudden 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
d5ca760f36 command change 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
530fed2877 testing set up 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
f44eb810bf teacher env init 2026-03-13 11:04:57 -04:00
dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0 2026-03-09 23:17:01 -05:00
dmahan93
6facf0add5
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-09 23:16:00 -05:00
dmahan93
f198c1738e Merge conflict commit 2026-03-09 23:13:43 -05:00
Partho Das
632ab0161c Revert "rm hardcoded same score check"
This reverts commit f02c24204d.
2026-03-10 01:42:44 +05:30
dmahan93
c0db13978a
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-03-09 14:43:08 -05:00
pre-commit-ci[bot]
880bb4a632
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0)
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5)
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2)
2026-03-09 16:44:14 +00:00
Partho Das
cdc23ba5dc Revert "allow serve openai overrides"
This reverts commit bd98a82bbc.
2026-03-08 04:42:09 +05:30
Partho Das
cd3a9163c7 Revert "eval max_token_length consistent with training config"
This reverts commit 5f52befd38.
2026-03-08 04:42:02 +05:30
J-SUPHA
1f676f2185
Merge pull request #406 from NousResearch/logprobsfn
Unified get_logprobs interface across the server stack
2026-03-05 17:36:22 -05:00
Jai Suphavadeeprasit
eb50099361 test_get_logprobs_input_ids_only_passthrough 2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086 fix: use sys.executable instead of hardcoded "python" in tests
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.

Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e managed_Server pass through and centralize sem logic 2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6 make preserve thinking optional 2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7 readme language 2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664 testing 2026-03-03 23:38:04 -05:00
pre-commit-ci[bot]
efc90bfb1b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-04 04:18:12 +00:00
Jai Suphavadeeprasit
1eeb31065f fixing comments 2026-03-03 23:16:05 -05:00