Jai Suphavadeeprasit
3910a58f9b
refactor base
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
1c90fc71b0
on policy clean up
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
79e392c446
post merge changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
c89854a350
debug changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
0510ca9b72
found bug
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
fb23014dcc
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
ea2b388435
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
e814007575
base env debugging
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
b492ac4fce
on policy changes
2026-02-20 01:45:41 -05:00
Jai Suphavadeeprasit
6bc962c746
initial commit
2026-02-20 01:45:41 -05:00
Dakota
7d6aeb9bbf
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
2026-02-09 15:26:29 -06:00
Dakota
9ff24bf370
change to 128 tokens to support low length rejection
2026-02-04 16:23:30 -06:00
Dakota
10f651289c
Add dummy openai managed server
2026-02-04 15:16:36 -06:00
VolodymyrBg
e0744adf28
Update README.md
2026-01-29 10:23:53 +02:00
VolodymyrBg
dd02df0d76
Update base.py
2026-01-29 10:22:51 +02:00
balyan.sid@gmail.com
5a20abdce7
switch eval to use managed server adapter impl. moved managed server
...
adapter
2026-01-23 23:26:29 +05:30
Siddharth Balyan
7f28c52994
Merge branch 'main' into sid/verifiers
2026-01-16 11:50:27 +05:30
teknium
681616844d
linter....
2026-01-15 07:44:53 +00:00
teknium
45d47fbf56
Refactor reasoning configuration check in APIServer class
...
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
pre-commit-ci[bot]
f3ea354f31
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 07:23:36 +00:00
teknium
c2e7b3700e
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2
Add README for server handling module and refine ReasoningConfig logic
...
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869
Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions.
2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1
Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change.
2026-01-15 06:26:31 +00:00
balyan.sid@gmail.com
57fa229846
remove unused managed_server wrapper + tese
2026-01-14 17:09:57 +05:30
balyan.sid@gmail.com
6a27e88023
use managed server
2026-01-14 17:09:01 +05:30
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft
2026-01-12 09:45:38 -08:00
teknium
6aba5244b8
ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled.
2026-01-12 17:45:19 +00:00
teknium
21504537fc
revive _get_server_base_url
2026-01-12 16:49:38 +00:00
balyan.sid@gmail.com
cf636595d2
rework server and eval for rl rollout. add in asyncmanagedserver for
...
verifiers
2026-01-12 10:34:05 +05:30
pre-commit-ci[bot]
6cfcbdf4d5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00
teknium
e1ece3e64e
Add reasoning configuration support across server implementations
...
- Updated server classes (OpenAIServer, SGLangServer, TrlVllmServer, VLLMServer) to accept a ReasoningConfig parameter during initialization.
- Enhanced ReasoningConfig to allow flexible max_tokens without strict validation, accommodating varying provider limits.
- Implemented reasoning configuration injection in APIServer methods for chat and completion handling.
- Updated tests to reflect changes in max_tokens validation logic.
This commit integrates reasoning capabilities into the server handling architecture, improving compatibility with diverse reasoning models.
2026-01-05 23:20:01 +00:00
teknium
127a925471
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
2025-12-30 11:59:46 +00:00
teknium
747fbc9285
fix linting
2025-12-30 11:56:21 +00:00
pre-commit-ci[bot]
97047eee7b
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-12-30 00:26:33 +00:00
teknium
62fa51240c
Add support for reasoning models and their variety of providers/endpoints
2025-12-30 00:23:00 +00:00
Dakota
8ec5066998
add eval runner
2025-12-19 19:56:59 -06:00
Ksenchi
6c843d6677
Update MANAGED_SERVER.md
2025-11-12 07:22:40 +01:00
Ksenchi
edf26d68e1
Update MANAGED_SERVER.md
2025-11-12 07:22:06 +01:00
Dhyaneesh
39d5fb4452
feat: dump evaluate subcommand config to YAML in env save dir
...
Automatically save the final merged evaluate configuration to evaluate_config.yaml
in the data_dir_to_save_evals directory. This includes env config, OpenAI/server
configs, and server manager settings, enabling reproducibility and easier
debugging of evaluation runs.
The config is saved after all merging (CLI args > YAML > defaults) to capture
the exact configuration used for the evaluation.
2025-11-08 23:46:13 +05:30
dmahan93
b4080a4f37
Merge pull request #273 from NousResearch/add-vllm-manager-fn
...
add managed vllm server
2025-11-07 14:22:07 -08:00
Dakota
e6ac3abdcb
add managed vllm server
2025-11-07 13:06:49 -06:00
dmahan93
c96b8a1255
Merge pull request #267 from bobtajson/main
...
fix: correct typo and improve code quality
2025-11-06 18:36:37 -08:00
dmahan93
b1e164eef5
Merge pull request #264 from NousResearch/add-logprob-server-manager-fn
...
add sglang specific token level logprob handling and server manager/b…
2025-10-29 13:53:39 -07:00
Dakota
578175a709
fix pre-commit
2025-10-29 14:47:50 -05:00
Dakota
5d6d6bb0dc
add docs :)
2025-10-29 11:26:43 -05:00
Dakota
c3a118f50d
fix tests
2025-10-29 10:55:10 -05:00
Dakota
d5400460e8
made masked logprobs coherently decided on
2025-10-29 10:52:38 -05:00
Dakota
e57c396f86
ran pre-commit
2025-10-29 10:45:27 -05:00