Commit graph

252 commits

Author SHA1 Message Date
pre-commit-ci[bot]
60fb6cae11 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-20 04:58:47 +00:00
Jai Suphavadeeprasit
ccdd5a1ca6 linting 2026-02-19 23:57:47 -05:00
Jai Suphavadeeprasit
809b88bf30 gsm8k trial 2026-02-19 21:32:40 -05:00
Jai Suphavadeeprasit
bbbfaf1680 gsm8k trial 2026-02-19 21:17:49 -05:00
Jai Suphavadeeprasit
527433b5bc change OPD style 2026-02-19 17:08:27 -05:00
Jai Suphavadeeprasit
33f5696171 Merge branch 'pipelineRL' into OnPolicyDistillation 2026-02-19 16:39:21 -05:00
Jai Suphavadeeprasit
bc0f9ee625 debug changes 2026-02-17 08:15:07 -05:00
Jai Suphavadeeprasit
f52de7441c found bug 2026-02-16 21:26:44 -05:00
Jai Suphavadeeprasit
573221497d base env debugging 2026-02-16 21:23:54 -05:00
Jai Suphavadeeprasit
7a90f34d85 base env debugging 2026-02-16 21:20:33 -05:00
Jai Suphavadeeprasit
b0658f6327 base env debugging 2026-02-16 21:05:57 -05:00
Jai Suphavadeeprasit
0e81c62e90 on policy changes 2026-02-16 17:39:37 -05:00
Jai Suphavadeeprasit
cc9b891eba initial commit 2026-02-16 11:46:20 -05:00
pre-commit-ci[bot]
e1aca5ecf5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
80f67f979a error handling 2026-02-13 11:26:25 -05:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
Alireza
6b92ee16ec fix duplicate code + add safety checks 2026-02-09 10:58:49 +03:30
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test 2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0 Add regression test for TRL vLLM completion wrapper
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30
Dakota
9ff24bf370 change to 128 tokens to support low length rejection 2026-02-04 16:23:30 -06:00
Dakota
10f651289c Add dummy openai managed server 2026-02-04 15:16:36 -06:00
VolodymyrBg
1eb0d72099
Update FAQ.md 2026-01-29 10:43:47 +02:00
VolodymyrBg
e0744adf28
Update README.md 2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py 2026-01-29 10:22:51 +02:00
VolodymyrBg
77a3505955
Update test_openai_api_workarounds.py 2026-01-29 10:13:50 +02:00
pre-commit-ci[bot]
2be7442dd5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-26 16:41:26 +00:00
balyan.sid@gmail.com
5a20abdce7 switch eval to use managed server adapter impl. moved managed server
adapter
2026-01-23 23:26:29 +05:30
Siddharth Balyan
ecea823d5c
Merge branch 'main' into sid/verifiers 2026-01-19 12:58:32 +05:30
Teknium
84a8bbb9cb
Merge pull request #317 from Savage890/fix/issue-308-jsonl2html
fix: handle nested message format in jsonl2html.py
2026-01-16 06:47:44 -08:00
Siddharth Balyan
7f28c52994
Merge branch 'main' into sid/verifiers 2026-01-16 11:50:27 +05:30
teknium
31a8cdc7a7 update test to reflect the change in reasoning effort mapping 2026-01-15 07:48:52 +00:00
teknium
681616844d linter.... 2026-01-15 07:44:53 +00:00
teknium
45d47fbf56 Refactor reasoning configuration check in APIServer class
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
pre-commit-ci[bot]
f3ea354f31 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-15 07:23:36 +00:00
teknium
c2e7b3700e Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2 Add README for server handling module and refine ReasoningConfig logic
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869 Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions. 2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1 Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change. 2026-01-15 06:26:31 +00:00
pre-commit-ci[bot]
39e9a233db [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-14 21:36:05 +00:00
Savage890
39f05d18fa fix: handle nested message format in jsonl2html.py (#308) 2026-01-15 03:01:15 +05:30
balyan.sid@gmail.com
57fa229846 remove unused managed_server wrapper + tese 2026-01-14 17:09:57 +05:30
balyan.sid@gmail.com
6a27e88023 use managed server 2026-01-14 17:09:01 +05:30
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft 2026-01-12 09:45:38 -08:00
teknium
6aba5244b8 ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled. 2026-01-12 17:45:19 +00:00
teknium
21504537fc revive _get_server_base_url 2026-01-12 16:49:38 +00:00
balyan.sid@gmail.com
294b980625 add tests for AtroposManagedClient 2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
cf636595d2 rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
6cfcbdf4d5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00