dmahan93
f198c1738e
Merge conflict commit
2026-03-09 23:13:43 -05:00
Jai Suphavadeeprasit
eb50099361
test_get_logprobs_input_ids_only_passthrough
2026-03-05 17:04:45 -05:00
0xbyt4
4d8e9b8086
fix: use sys.executable instead of hardcoded "python" in tests
...
Tests that launch the API server via subprocess used a hardcoded
"python" command which fails on systems where only "python3" is
available (e.g. macOS). Using sys.executable ensures the same
interpreter running pytest is used for subprocesses.
Fixes 36 test errors on macOS environments.
2026-03-05 17:04:45 -05:00
pre-commit-ci[bot]
b166c3a9d9
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-05 20:48:17 +00:00
Jai Suphavadeeprasit
b91922082e
managed_Server pass through and centralize sem logic
2026-03-05 15:46:33 -05:00
dmahan93
f4875c5dc6
make preserve thinking optional
2026-03-04 15:44:12 -06:00
Jai Suphavadeeprasit
c85a3e5ee7
readme language
2026-03-03 23:44:29 -05:00
Jai Suphavadeeprasit
1a3d9ee664
testing
2026-03-03 23:38:04 -05:00
pre-commit-ci[bot]
efc90bfb1b
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-04 04:18:12 +00:00
Jai Suphavadeeprasit
1eeb31065f
fixing comments
2026-03-03 23:16:05 -05:00
Jai Suphavadeeprasit
51088ac24d
add tests
2026-03-03 23:08:40 -05:00
pre-commit-ci[bot]
8f304d44fd
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-04 03:08:19 +00:00
Jai Suphavadeeprasit
5aaf7a346c
prompt logprobs simplicity
2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
f1c20591b6
prompt logprobs
2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
439b9b129b
prompt logprobs
2026-03-03 21:58:05 -05:00
dmahan93
12d61d197f
add env using the tool api stuff
2026-03-03 19:51:30 -06:00
dmahan93
c8eb63f33d
readme updates for tool calling
2026-03-03 12:22:10 -06:00
pre-commit-ci[bot]
e98100e5f6
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-03 17:21:00 +00:00
Jai Suphavadeeprasit
323a8a2601
readme updates
2026-03-03 12:19:55 -05:00
Jai Suphavadeeprasit
b9291aa29f
init commit
2026-03-03 11:32:09 -05:00
dmahan93
8f21bb57ed
add better warning message
2026-03-02 23:21:25 -06:00
dmahan93
add42a2afb
add tool call parsing based on vllm impl and an openai server endpoint
2026-03-02 23:17:13 -06:00
pre-commit-ci[bot]
fe2fd3d824
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
5640d7de25
error handling
2026-03-02 11:18:51 -05:00
J-SUPHA
b763b4e20d
Merge pull request #387 from NousResearch/opd-filtered
...
Opd filtered
2026-02-27 21:40:03 -05:00
pre-commit-ci[bot]
216c1f5899
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-27 21:17:58 +00:00
Jai Suphavadeeprasit
35587cbdc0
logger changes
2026-02-27 16:17:03 -05:00
pre-commit-ci[bot]
64d3ee1bd6
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-27 18:16:06 +00:00
Jai Suphavadeeprasit
f343b24a6a
narrow down scope
2026-02-27 11:14:42 -05:00
Ocheretovich Oksana
aec5552db6
fix: pass num_steps to register_to_api
...
Signed-off-by: Ocheretovich Oksana <ocheretovich@gmail.com>
2026-02-24 11:22:18 +02:00
Jai Suphavadeeprasit
e5297148f9
dynamic system prompt fixed
2026-02-20 14:50:43 -05:00
Jai Suphavadeeprasit
fc248dd65b
clean
2026-02-20 12:01:50 -05:00
Jai Suphavadeeprasit
55f7cbd091
dynamic system prompts
2026-02-20 03:14:05 -05:00
Jai Suphavadeeprasit
e615eb1f50
assertions
2026-02-20 02:16:49 -05:00
Jai Suphavadeeprasit
559d649a26
proper fallback
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
3910a58f9b
refactor base
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
1c90fc71b0
on policy clean up
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
79e392c446
post merge changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
c89854a350
debug changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
0510ca9b72
found bug
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
fb23014dcc
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
ea2b388435
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
e814007575
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
b492ac4fce
on policy changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
6bc962c746
initial commit
2026-02-20 01:45:41 -05:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
...
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
Dakota
7d6aeb9bbf
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
2026-02-09 15:26:29 -06:00
Alireza
6b92ee16ec
fix duplicate code + add safety checks
2026-02-09 10:58:49 +03:30
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test
2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0
Add regression test for TRL vLLM completion wrapper
...
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30