dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
...
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0
2026-03-09 23:17:01 -05:00
dmahan93
6facf0add5
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
...
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-09 23:16:00 -05:00
dmahan93
f198c1738e
Merge conflict commit
2026-03-09 23:13:43 -05:00
Partho Das
632ab0161c
Revert "rm hardcoded same score check"
...
This reverts commit f02c24204d .
2026-03-10 01:42:44 +05:30
dmahan93
c0db13978a
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2026-03-09 14:43:08 -05:00
pre-commit-ci[bot]
880bb4a632
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0 )
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5 )
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2 )
2026-03-09 16:44:14 +00:00
Partho Das
cdc23ba5dc
Revert "allow serve openai overrides"
...
This reverts commit bd98a82bbc .
2026-03-08 04:42:09 +05:30
Partho Das
cd3a9163c7
Revert "eval max_token_length consistent with training config"
...
This reverts commit 5f52befd38 .
2026-03-08 04:42:02 +05:30
J-SUPHA
1f676f2185
Merge pull request #406 from NousResearch/logprobsfn
...
Unified get_logprobs interface across the server stack
2026-03-05 17:36:22 -05:00
Jai Suphavadeeprasit
eb50099361
test_get_logprobs_input_ids_only_passthrough
2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086
fix: use sys.executable instead of hardcoded "python" in tests
...
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.
Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e
managed_Server pass through and centralize sem logic
2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6
make preserve thinking optional
2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7
readme language
2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664
testing
2026-03-03 23:38:04 -05:00
pre-commit-ci[bot]
efc90bfb1b
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-04 04:18:12 +00:00
Jai Suphavadeeprasit
1eeb31065f
fixing comments
2026-03-03 23:16:05 -05:00
Jai Suphavadeeprasit
51088ac24d
add tests
2026-03-03 23:08:40 -05:00
pre-commit-ci[bot]
8f304d44fd
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-04 03:08:19 +00:00
Jai Suphavadeeprasit
5aaf7a346c
prompt logprobs simplicity
2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
f1c20591b6
prompt logprobs
2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
439b9b129b
prompt logprobs
2026-03-03 21:58:05 -05:00
dmahan93
173bf7f208
Merge pull request #399 from 0xbyt4/bugfix/fix-python-hardcoded-in-tests
...
fix: use sys.executable instead of hardcoded "python" in tests
2026-03-03 19:56:27 -06:00
dmahan93
12d61d197f
add env using the tool api stuff
2026-03-03 19:51:30 -06:00
dmahan93
c8eb63f33d
readme updates for tool calling
2026-03-03 12:22:10 -06:00
pre-commit-ci[bot]
e98100e5f6
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-03 17:21:00 +00:00
Jai Suphavadeeprasit
323a8a2601
readme updates
2026-03-03 12:19:55 -05:00
Jai Suphavadeeprasit
b9291aa29f
init commit
2026-03-03 11:32:09 -05:00
Partho Das
5f52befd38
eval max_token_length consistent with training config
...
instead of hardcoding, follows other envs pattern
2026-03-03 18:03:04 +05:30
dmahan93
8f21bb57ed
add better warning message
2026-03-02 23:21:25 -06:00
dmahan93
add42a2afb
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-02 23:17:13 -06:00
dmahan93
887a94374c
Merge pull request #322 from NousResearch/pipelineRL
...
Pipeline rl
2026-03-02 21:02:48 -06:00
pre-commit-ci[bot]
b795d48a06
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-02 22:47:08 +00:00
dmahan93
be73d92723
Merge branch 'main' into pipelineRL
2026-03-02 16:43:32 -06:00
dmahan93
5235a9edca
Merge pull request #404 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2026-03-02 16:25:37 -06:00
dmahan93
3645107c42
Merge pull request #402 from NousResearch/add-new-precommit
...
add code-spell and secrects precommit
2026-03-02 16:25:26 -06:00
Jai Suphavadeeprasit
8d29f49a58
more terminal changes
2026-03-02 14:40:55 -05:00
pre-commit-ci[bot]
a41f75fc5f
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.15.2 → v0.15.4](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.2...v0.15.4 )
2026-03-02 16:44:06 +00:00
Jai Suphavadeeprasit
2f01720899
more readme changes
2026-03-02 11:39:45 -05:00
Jai Suphavadeeprasit
585244559e
more readme changes
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
4a7da8049f
README changes
2026-03-02 11:18:52 -05:00
pre-commit-ci[bot]
91afc9e46e
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
d2ea8cd612
remove KL
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
dbf6026165
remove reqs and update community readme
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
fb3228f669
add this to our pyproject
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
ce85c7d95e
H100 bug fixes
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
624b3cdabe
feedback fixes: shared layers + hard coded values + warmup steps
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
e1f9b926bb
script test
2026-03-02 11:18:52 -05:00