Commit graph

282 commits

Author SHA1 Message Date
dmahan93
f198c1738e Merge conflict commit 2026-03-09 23:13:43 -05:00
Jai Suphavadeeprasit
eb50099361 test_get_logprobs_input_ids_only_passthrough 2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086 fix: use sys.executable instead of hardcoded "python" in tests
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.

Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e managed_Server pass through and centralize sem logic 2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6 make preserve thinking optional 2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7 readme language 2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664 testing 2026-03-03 23:38:04 -05:00
pre-commit-ci[bot]
efc90bfb1b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-04 04:18:12 +00:00
Jai Suphavadeeprasit
1eeb31065f fixing comments 2026-03-03 23:16:05 -05:00
Jai Suphavadeeprasit
51088ac24d add tests 2026-03-03 23:08:40 -05:00
pre-commit-ci[bot]
8f304d44fd [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-04 03:08:19 +00:00
Jai Suphavadeeprasit
5aaf7a346c prompt logprobs simplicity 2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
f1c20591b6 prompt logprobs 2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
439b9b129b prompt logprobs 2026-03-03 21:58:05 -05:00
dmahan93
12d61d197f add env using the tool api stuff 2026-03-03 19:51:30 -06:00
dmahan93
c8eb63f33d readme updates for tool calling 2026-03-03 12:22:10 -06:00
pre-commit-ci[bot]
e98100e5f6 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-03 17:21:00 +00:00
Jai Suphavadeeprasit
323a8a2601 readme updates 2026-03-03 12:19:55 -05:00
Jai Suphavadeeprasit
b9291aa29f init commit 2026-03-03 11:32:09 -05:00
dmahan93
8f21bb57ed add better warning message 2026-03-02 23:21:25 -06:00
dmahan93
add42a2afb add tool call parsing based on vllm impl and an openai server endpoint 2026-03-02 23:17:13 -06:00
pre-commit-ci[bot]
fe2fd3d824 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
5640d7de25 error handling 2026-03-02 11:18:51 -05:00
J-SUPHA
b763b4e20d
Merge pull request #387 from NousResearch/opd-filtered
Opd filtered
2026-02-27 21:40:03 -05:00
pre-commit-ci[bot]
216c1f5899 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-27 21:17:58 +00:00
Jai Suphavadeeprasit
35587cbdc0 logger changes 2026-02-27 16:17:03 -05:00
pre-commit-ci[bot]
64d3ee1bd6 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-27 18:16:06 +00:00
Jai Suphavadeeprasit
f343b24a6a narrow down scope 2026-02-27 11:14:42 -05:00
Ocheretovich Oksana
aec5552db6 fix: pass num_steps to register_to_api
Signed-off-by: Ocheretovich Oksana <ocheretovich@gmail.com>
2026-02-24 11:22:18 +02:00
Jai Suphavadeeprasit
e5297148f9 dynamic system prompt fixed 2026-02-20 14:50:43 -05:00
Jai Suphavadeeprasit
fc248dd65b clean 2026-02-20 12:01:50 -05:00
Jai Suphavadeeprasit
55f7cbd091 dynamic system prompts 2026-02-20 03:14:05 -05:00
Jai Suphavadeeprasit
e615eb1f50 assertions 2026-02-20 02:16:49 -05:00
Jai Suphavadeeprasit
559d649a26 proper fallback 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
3910a58f9b refactor base 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
1c90fc71b0 on policy clean up 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
79e392c446 post merge changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
c89854a350 debug changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
0510ca9b72 found bug 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
fb23014dcc base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
ea2b388435 base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
e814007575 base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
b492ac4fce on policy changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
6bc962c746 initial commit 2026-02-20 01:45:41 -05:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
Alireza
6b92ee16ec fix duplicate code + add safety checks 2026-02-09 10:58:49 +03:30
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test 2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0 Add regression test for TRL vLLM completion wrapper
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30