pre-commit-ci[bot]
60fb6cae11
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-20 04:58:47 +00:00
Jai Suphavadeeprasit
ccdd5a1ca6
linting
2026-02-19 23:57:47 -05:00
Jai Suphavadeeprasit
809b88bf30
gsm8k trial
2026-02-19 21:32:40 -05:00
Jai Suphavadeeprasit
bbbfaf1680
gsm8k trial
2026-02-19 21:17:49 -05:00
Jai Suphavadeeprasit
527433b5bc
change OPD style
2026-02-19 17:08:27 -05:00
Jai Suphavadeeprasit
33f5696171
Merge branch 'pipelineRL' into OnPolicyDistillation
2026-02-19 16:39:21 -05:00
Jai Suphavadeeprasit
bc0f9ee625
debug changes
2026-02-17 08:15:07 -05:00
Jai Suphavadeeprasit
f52de7441c
found bug
2026-02-16 21:26:44 -05:00
Jai Suphavadeeprasit
573221497d
base env debugging
2026-02-16 21:23:54 -05:00
Jai Suphavadeeprasit
7a90f34d85
base env debugging
2026-02-16 21:20:33 -05:00
Jai Suphavadeeprasit
b0658f6327
base env debugging
2026-02-16 21:05:57 -05:00
Jai Suphavadeeprasit
0e81c62e90
on policy changes
2026-02-16 17:39:37 -05:00
Jai Suphavadeeprasit
cc9b891eba
initial commit
2026-02-16 11:46:20 -05:00
pre-commit-ci[bot]
e1aca5ecf5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
80f67f979a
error handling
2026-02-13 11:26:25 -05:00
dmahan93
9ffd4de275
Merge pull request #362 from ansulx/fix/trl-vllm-completion-test
...
Add regression test for TRL vLLM completion wrapper
2026-02-09 21:06:12 -08:00
Dakota
7d6aeb9bbf
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
2026-02-09 15:26:29 -06:00
Alireza
6b92ee16ec
fix duplicate code + add safety checks
2026-02-09 10:58:49 +03:30
Ansul
3b9b67a3ad
Merge branch 'main' into fix/trl-vllm-completion-test
2026-02-06 02:13:29 +05:30
ansulx
d97f366ae0
Add regression test for TRL vLLM completion wrapper
...
Ensure the TRL vLLM completion wrapper returns a Completion with text so issue #183 stays covered.
2026-02-06 01:57:16 +05:30
Dakota
9ff24bf370
change to 128 tokens to support low length rejection
2026-02-04 16:23:30 -06:00
Dakota
10f651289c
Add dummy openai managed server
2026-02-04 15:16:36 -06:00
VolodymyrBg
1eb0d72099
Update FAQ.md
2026-01-29 10:43:47 +02:00
VolodymyrBg
e0744adf28
Update README.md
2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py
2026-01-29 10:22:51 +02:00
VolodymyrBg
77a3505955
Update test_openai_api_workarounds.py
2026-01-29 10:13:50 +02:00
pre-commit-ci[bot]
2be7442dd5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-26 16:41:26 +00:00
balyan.sid@gmail.com
5a20abdce7
switch eval to use managed server adapter impl. moved managed server
...
adapter
2026-01-23 23:26:29 +05:30
Siddharth Balyan
ecea823d5c
Merge branch 'main' into sid/verifiers
2026-01-19 12:58:32 +05:30
Teknium
84a8bbb9cb
Merge pull request #317 from Savage890/fix/issue-308-jsonl2html
...
fix: handle nested message format in jsonl2html.py
2026-01-16 06:47:44 -08:00
Siddharth Balyan
7f28c52994
Merge branch 'main' into sid/verifiers
2026-01-16 11:50:27 +05:30
teknium
31a8cdc7a7
update test to reflect the change in reasoning effort mapping
2026-01-15 07:48:52 +00:00
teknium
681616844d
linter....
2026-01-15 07:44:53 +00:00
teknium
45d47fbf56
Refactor reasoning configuration check in APIServer class
...
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
pre-commit-ci[bot]
f3ea354f31
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 07:23:36 +00:00
teknium
c2e7b3700e
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2
Add README for server handling module and refine ReasoningConfig logic
...
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869
Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions.
2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1
Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change.
2026-01-15 06:26:31 +00:00
pre-commit-ci[bot]
39e9a233db
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-14 21:36:05 +00:00
Savage890
39f05d18fa
fix: handle nested message format in jsonl2html.py ( #308 )
2026-01-15 03:01:15 +05:30
balyan.sid@gmail.com
57fa229846
remove unused managed_server wrapper + tese
2026-01-14 17:09:57 +05:30
balyan.sid@gmail.com
6a27e88023
use managed server
2026-01-14 17:09:01 +05:30
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft
2026-01-12 09:45:38 -08:00
teknium
6aba5244b8
ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled.
2026-01-12 17:45:19 +00:00
teknium
21504537fc
revive _get_server_base_url
2026-01-12 16:49:38 +00:00
balyan.sid@gmail.com
294b980625
add tests for AtroposManagedClient
2026-01-12 10:34:05 +05:30
balyan.sid@gmail.com
cf636595d2
rework server and eval for rl rollout. add in asyncmanagedserver for
...
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
6cfcbdf4d5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00