Commit graph

1541 commits

Author SHA1 Message Date
dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0 2026-03-09 23:17:01 -05:00
dmahan93
6facf0add5
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-09 23:16:00 -05:00
dmahan93
f198c1738e Merge conflict commit 2026-03-09 23:13:43 -05:00
Partho Das
632ab0161c Revert "rm hardcoded same score check"
This reverts commit f02c24204d.
2026-03-10 01:42:44 +05:30
dmahan93
c0db13978a
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-03-09 14:43:08 -05:00
pre-commit-ci[bot]
880bb4a632
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0)
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5)
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2)
2026-03-09 16:44:14 +00:00
Partho Das
cdc23ba5dc Revert "allow serve openai overrides"
This reverts commit bd98a82bbc.
2026-03-08 04:42:09 +05:30
Partho Das
cd3a9163c7 Revert "eval max_token_length consistent with training config"
This reverts commit 5f52befd38.
2026-03-08 04:42:02 +05:30
J-SUPHA
1f676f2185
Merge pull request #406 from NousResearch/logprobsfn
Unified get_logprobs interface across the server stack
2026-03-05 17:36:22 -05:00
Jai Suphavadeeprasit
eb50099361 test_get_logprobs_input_ids_only_passthrough 2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086 fix: use sys.executable instead of hardcoded "python" in tests
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.

Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e managed_Server pass through and centralize sem logic 2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6 make preserve thinking optional 2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7 readme language 2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664 testing 2026-03-03 23:38:04 -05:00
pre-commit-ci[bot]
efc90bfb1b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-04 04:18:12 +00:00
Jai Suphavadeeprasit
1eeb31065f fixing comments 2026-03-03 23:16:05 -05:00
Jai Suphavadeeprasit
51088ac24d add tests 2026-03-03 23:08:40 -05:00
pre-commit-ci[bot]
8f304d44fd [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-04 03:08:19 +00:00
Jai Suphavadeeprasit
5aaf7a346c prompt logprobs simplicity 2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
f1c20591b6 prompt logprobs 2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
439b9b129b prompt logprobs 2026-03-03 21:58:05 -05:00
dmahan93
173bf7f208
Merge pull request #399 from 0xbyt4/bugfix/fix-python-hardcoded-in-tests
fix: use sys.executable instead of hardcoded "python" in tests
2026-03-03 19:56:27 -06:00
dmahan93
12d61d197f add env using the tool api stuff 2026-03-03 19:51:30 -06:00
dmahan93
c8eb63f33d readme updates for tool calling 2026-03-03 12:22:10 -06:00
pre-commit-ci[bot]
e98100e5f6 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-03 17:21:00 +00:00
Jai Suphavadeeprasit
323a8a2601 readme updates 2026-03-03 12:19:55 -05:00
Jai Suphavadeeprasit
b9291aa29f init commit 2026-03-03 11:32:09 -05:00
Partho Das
5f52befd38 eval max_token_length consistent with training config
instead of hardcoding, follows other envs pattern
2026-03-03 18:03:04 +05:30
dmahan93
8f21bb57ed add better warning message 2026-03-02 23:21:25 -06:00
dmahan93
add42a2afb add tool call parsing based on vllm impl and an openai server endpoint 2026-03-02 23:17:13 -06:00
dmahan93
887a94374c
Merge pull request #322 from NousResearch/pipelineRL
Pipeline rl
2026-03-02 21:02:48 -06:00
pre-commit-ci[bot]
b795d48a06 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 22:47:08 +00:00
dmahan93
be73d92723
Merge branch 'main' into pipelineRL 2026-03-02 16:43:32 -06:00
dmahan93
5235a9edca
Merge pull request #404 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2026-03-02 16:25:37 -06:00
dmahan93
3645107c42
Merge pull request #402 from NousResearch/add-new-precommit
add code-spell and secrects precommit
2026-03-02 16:25:26 -06:00
Jai Suphavadeeprasit
8d29f49a58 more terminal changes 2026-03-02 14:40:55 -05:00
pre-commit-ci[bot]
a41f75fc5f
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.15.2 → v0.15.4](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.2...v0.15.4)
2026-03-02 16:44:06 +00:00
Jai Suphavadeeprasit
2f01720899 more readme changes 2026-03-02 11:39:45 -05:00
Jai Suphavadeeprasit
585244559e more readme changes 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
4a7da8049f README changes 2026-03-02 11:18:52 -05:00
pre-commit-ci[bot]
91afc9e46e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
d2ea8cd612 remove KL 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
dbf6026165 remove reqs and update community readme 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
fb3228f669 add this to our pyproject 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
ce85c7d95e H100 bug fixes 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
624b3cdabe feedback fixes: shared layers + hard coded values + warmup steps 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
e1f9b926bb script test 2026-03-02 11:18:52 -05:00