Commit graph

175 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
3910a58f9b refactor base 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
1c90fc71b0 on policy clean up 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
79e392c446 post merge changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
c89854a350 debug changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
0510ca9b72 found bug 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
fb23014dcc base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
ea2b388435 base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
e814007575 base env debugging 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
b492ac4fce on policy changes 2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
6bc962c746 initial commit 2026-02-20 01:45:41 -05:00
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
Dakota
9ff24bf370 change to 128 tokens to support low length rejection 2026-02-04 16:23:30 -06:00
Dakota
10f651289c Add dummy openai managed server 2026-02-04 15:16:36 -06:00
VolodymyrBg
e0744adf28
Update README.md 2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py 2026-01-29 10:22:51 +02:00
balyan.sid@gmail.com
5a20abdce7 switch eval to use managed server adapter impl. moved managed server
adapter
2026-01-23 23:26:29 +05:30
Siddharth Balyan
7f28c52994
Merge branch 'main' into sid/verifiers 2026-01-16 11:50:27 +05:30
teknium
681616844d linter.... 2026-01-15 07:44:53 +00:00
teknium
45d47fbf56 Refactor reasoning configuration check in APIServer class
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
pre-commit-ci[bot]
f3ea354f31 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-15 07:23:36 +00:00
teknium
c2e7b3700e Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2 Add README for server handling module and refine ReasoningConfig logic
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869 Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions. 2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1 Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change. 2026-01-15 06:26:31 +00:00
balyan.sid@gmail.com
57fa229846 remove unused managed_server wrapper + tese 2026-01-14 17:09:57 +05:30
balyan.sid@gmail.com
6a27e88023 use managed server 2026-01-14 17:09:01 +05:30
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft 2026-01-12 09:45:38 -08:00
teknium
6aba5244b8 ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled. 2026-01-12 17:45:19 +00:00
teknium
21504537fc revive _get_server_base_url 2026-01-12 16:49:38 +00:00
balyan.sid@gmail.com
cf636595d2 rework server and eval for rl rollout. add in asyncmanagedserver for
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
6cfcbdf4d5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00
teknium
e1ece3e64e Add reasoning configuration support across server implementations
- Updated server classes (OpenAIServer, SGLangServer, TrlVllmServer, VLLMServer) to accept a ReasoningConfig parameter during initialization.
- Enhanced ReasoningConfig to allow flexible max_tokens without strict validation, accommodating varying provider limits.
- Implemented reasoning configuration injection in APIServer methods for chat and completion handling.
- Updated tests to reflect changes in max_tokens validation logic.

This commit integrates reasoning capabilities into the server handling architecture, improving compatibility with diverse reasoning models.
2026-01-05 23:20:01 +00:00
teknium
127a925471 Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2025-12-30 11:59:46 +00:00
teknium
747fbc9285 fix linting 2025-12-30 11:56:21 +00:00
pre-commit-ci[bot]
97047eee7b [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-30 00:26:33 +00:00
teknium
62fa51240c Add support for reasoning models and their variety of providers/endpoints 2025-12-30 00:23:00 +00:00
Dakota
8ec5066998 add eval runner 2025-12-19 19:56:59 -06:00
Ksenchi
6c843d6677
Update MANAGED_SERVER.md 2025-11-12 07:22:40 +01:00
Ksenchi
edf26d68e1
Update MANAGED_SERVER.md 2025-11-12 07:22:06 +01:00
Dhyaneesh
39d5fb4452 feat: dump evaluate subcommand config to YAML in env save dir
Automatically save the final merged evaluate configuration to evaluate_config.yaml
in the data_dir_to_save_evals directory. This includes env config, OpenAI/server
configs, and server manager settings, enabling reproducibility and easier
debugging of evaluation runs.

The config is saved after all merging (CLI args > YAML > defaults) to capture
the exact configuration used for the evaluation.
2025-11-08 23:46:13 +05:30
dmahan93
b4080a4f37
Merge pull request #273 from NousResearch/add-vllm-manager-fn
add managed vllm server
2025-11-07 14:22:07 -08:00
Dakota
e6ac3abdcb add managed vllm server 2025-11-07 13:06:49 -06:00
dmahan93
c96b8a1255
Merge pull request #267 from bobtajson/main
fix: correct typo and improve code quality
2025-11-06 18:36:37 -08:00
dmahan93
b1e164eef5
Merge pull request #264 from NousResearch/add-logprob-server-manager-fn
add sglang specific token level logprob handling and server manager/b…
2025-10-29 13:53:39 -07:00
Dakota
578175a709 fix pre-commit 2025-10-29 14:47:50 -05:00
Dakota
5d6d6bb0dc add docs :) 2025-10-29 11:26:43 -05:00
Dakota
c3a118f50d fix tests 2025-10-29 10:55:10 -05:00
Dakota
d5400460e8 made masked logprobs coherently decided on 2025-10-29 10:52:38 -05:00
Dakota
e57c396f86 ran pre-commit 2025-10-29 10:45:27 -05:00