pre-commit-ci[bot]
8f304d44fd
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-04 03:08:19 +00:00
Jai Suphavadeeprasit
5aaf7a346c
prompt logprobs simplicity
2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
f1c20591b6
prompt logprobs
2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
439b9b129b
prompt logprobs
2026-03-03 21:58:05 -05:00
pre-commit-ci[bot]
e98100e5f6
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-03 17:21:00 +00:00
Jai Suphavadeeprasit
323a8a2601
readme updates
2026-03-03 12:19:55 -05:00
Jai Suphavadeeprasit
b9291aa29f
init commit
2026-03-03 11:32:09 -05:00
pre-commit-ci[bot]
fe2fd3d824
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
5640d7de25
error handling
2026-03-02 11:18:51 -05:00
J-SUPHA
b763b4e20d
Merge pull request #387 from NousResearch/opd-filtered
...
Opd filtered
2026-02-27 21:40:03 -05:00
pre-commit-ci[bot]
216c1f5899
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-27 21:17:58 +00:00
Jai Suphavadeeprasit
35587cbdc0
logger changes
2026-02-27 16:17:03 -05:00
pre-commit-ci[bot]
64d3ee1bd6
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-27 18:16:06 +00:00
Jai Suphavadeeprasit
f343b24a6a
narrow down scope
2026-02-27 11:14:42 -05:00
Ocheretovich Oksana
aec5552db6
fix: pass num_steps to register_to_api
...
Signed-off-by: Ocheretovich Oksana <ocheretovich@gmail.com>
2026-02-24 11:22:18 +02:00
Jai Suphavadeeprasit
e5297148f9
dynamic system prompt fixed
2026-02-20 14:50:43 -05:00
Jai Suphavadeeprasit
fc248dd65b
clean
2026-02-20 12:01:50 -05:00
Jai Suphavadeeprasit
55f7cbd091
dynamic system prompts
2026-02-20 03:14:05 -05:00
Jai Suphavadeeprasit
e615eb1f50
assertions
2026-02-20 02:16:49 -05:00
Jai Suphavadeeprasit
559d649a26
proper fallback
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
3910a58f9b
refactor base
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
1c90fc71b0
on policy clean up
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
79e392c446
post merge changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
c89854a350
debug changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
0510ca9b72
found bug
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
fb23014dcc
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
ea2b388435
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
e814007575
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
b492ac4fce
on policy changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
6bc962c746
initial commit
2026-02-20 01:45:41 -05:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
...
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
Dakota
7d6aeb9bbf
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
2026-02-09 15:26:29 -06:00
Alireza
6b92ee16ec
fix duplicate code + add safety checks
2026-02-09 10:58:49 +03:30
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test
2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0
Add regression test for TRL vLLM completion wrapper
...
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30
Dakota
9ff24bf370
change to 128 tokens to support low length rejection
2026-02-04 16:23:30 -06:00
Dakota
10f651289c
Add dummy openai managed server
2026-02-04 15:16:36 -06:00
VolodymyrBg
1eb0d72099
Update FAQ.md
2026-01-29 10:43:47 +02:00
VolodymyrBg
e0744adf28
Update README.md
2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py
2026-01-29 10:22:51 +02:00
VolodymyrBg
77a3505955
Update test_openai_api_workarounds.py
2026-01-29 10:13:50 +02:00
pre-commit-ci[bot]
2be7442dd5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-26 16:41:26 +00:00
balyan.sid@gmail.com
5a20abdce7
switch eval to use managed server adapter impl. moved managed server
...
adapter
2026-01-23 23:26:29 +05:30
Siddharth Balyan
ecea823d5c
Merge branch 'main' into sid/verifiers
2026-01-19 12:58:32 +05:30
Teknium
84a8bbb9cb
Merge pull request #317 from Savage890/fix/issue-308-jsonl2html
...
fix: handle nested message format in jsonl2html.py
2026-01-16 06:47:44 -08:00
Siddharth Balyan
7f28c52994
Merge branch 'main' into sid/verifiers
2026-01-16 11:50:27 +05:30
teknium
31a8cdc7a7
update test to reflect the change in reasoning effort mapping
2026-01-15 07:48:52 +00:00
teknium
681616844d
linter....
2026-01-15 07:44:53 +00:00
teknium
45d47fbf56
Refactor reasoning configuration check in APIServer class
...
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
pre-commit-ci[bot]
f3ea354f31
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 07:23:36 +00:00