teknium
b2d17a44d2
Add README for server handling module and refine ReasoningConfig logic
...
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869
Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions.
2026-01-15 06:44:55 +00:00
teknium
0316cac8d1
Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change.
2026-01-15 06:26:31 +00:00
Teknium
837fc237ee
Merge branch 'main' into add_reasoning_handling_draft
2026-01-12 09:45:38 -08:00
teknium
6aba5244b8
ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled.
2026-01-12 17:45:19 +00:00
teknium
21504537fc
revive _get_server_base_url
2026-01-12 16:49:38 +00:00
pre-commit-ci[bot]
6cfcbdf4d5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00
teknium
e1ece3e64e
Add reasoning configuration support across server implementations
...
- Updated server classes (OpenAIServer, SGLangServer, TrlVllmServer, VLLMServer) to accept a ReasoningConfig parameter during initialization.
- Enhanced ReasoningConfig to allow flexible max_tokens without strict validation, accommodating varying provider limits.
- Implemented reasoning configuration injection in APIServer methods for chat and completion handling.
- Updated tests to reflect changes in max_tokens validation logic.
This commit integrates reasoning capabilities into the server handling architecture, improving compatibility with diverse reasoning models.
2026-01-05 23:20:01 +00:00
teknium
127a925471
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
2025-12-30 11:59:46 +00:00
teknium
747fbc9285
fix linting
2025-12-30 11:56:21 +00:00
pre-commit-ci[bot]
97047eee7b
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-12-30 00:26:33 +00:00
teknium
62fa51240c
Add support for reasoning models and their variety of providers/endpoints
2025-12-30 00:23:00 +00:00
Dakota
8ec5066998
add eval runner
2025-12-19 19:56:59 -06:00
Ksenchi
6c843d6677
Update MANAGED_SERVER.md
2025-11-12 07:22:40 +01:00
Ksenchi
edf26d68e1
Update MANAGED_SERVER.md
2025-11-12 07:22:06 +01:00
Dhyaneesh
39d5fb4452
feat: dump evaluate subcommand config to YAML in env save dir
...
Automatically save the final merged evaluate configuration to evaluate_config.yaml
in the data_dir_to_save_evals directory. This includes env config, OpenAI/server
configs, and server manager settings, enabling reproducibility and easier
debugging of evaluation runs.
The config is saved after all merging (CLI args > YAML > defaults) to capture
the exact configuration used for the evaluation.
2025-11-08 23:46:13 +05:30
dmahan93
b4080a4f37
Merge pull request #273 from NousResearch/add-vllm-manager-fn
...
add managed vllm server
2025-11-07 14:22:07 -08:00
Dakota
e6ac3abdcb
add managed vllm server
2025-11-07 13:06:49 -06:00
dmahan93
c96b8a1255
Merge pull request #267 from bobtajson/main
...
fix: correct typo and improve code quality
2025-11-06 18:36:37 -08:00
dmahan93
b1e164eef5
Merge pull request #264 from NousResearch/add-logprob-server-manager-fn
...
add sglang specific token level logprob handling and server manager/b…
2025-10-29 13:53:39 -07:00
Dakota
578175a709
fix pre-commit
2025-10-29 14:47:50 -05:00
Dakota
5d6d6bb0dc
add docs :)
2025-10-29 11:26:43 -05:00
Dakota
c3a118f50d
fix tests
2025-10-29 10:55:10 -05:00
Dakota
d5400460e8
made masked logprobs coherently decided on
2025-10-29 10:52:38 -05:00
Dakota
e57c396f86
ran pre-commit
2025-10-29 10:45:27 -05:00
Dakota
17bb7bdf15
revert base.py
2025-10-29 10:11:05 -05:00
dmahan93
c483840f59
set prompt logprobs to a masked value
2025-10-26 11:58:55 -07:00
dmahan93
c22f8ca81b
Merge remote-tracking branch 'origin/add-logprob-server-manager-fn' into add-logprob-server-manager-fn
2025-10-24 23:18:37 -07:00
dmahan93
5d662bf1aa
add chat example and fix bug in managed_server
2025-10-24 23:15:56 -07:00
pre-commit-ci[bot]
0d80da5146
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-10-24 20:10:29 +00:00
dmahan93
7bf4cfbf80
add managed server to make grabbing logprobs easier w/ tokenized items
2025-10-24 13:09:46 -07:00
bobtajson
6e2d36bd2a
Update base.py
2025-10-23 10:27:23 +02:00
pre-commit-ci[bot]
312f8859e3
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-10-16 18:47:07 +00:00
Dakota
d240dbb3b7
Merge remote-tracking branch 'origin/add-logprob-server-manager-fn' into add-logprob-server-manager-fn
2025-10-16 13:46:03 -05:00
Dakota
134cbc09d0
update openai/trl_vllm server with new fn
2025-10-16 13:45:55 -05:00
pre-commit-ci[bot]
1e6a745491
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-10-16 17:39:04 +00:00
Dakota
c36ec29656
add sglang specific token level logprob handling and server manager/baseline logprob/token fn
2025-10-16 12:38:03 -05:00
ropresearch
e5b8fb8654
clean up
2025-10-10 11:50:39 -04:00
ropresearch
baf4b2d8a8
gzip compression for atropos api
2025-10-10 01:26:52 -04:00
ropresearch
c3fc68879c
group temps, sample temps, and logprob api params
2025-09-25 16:41:58 -04:00
shannonsands
1a808e2038
Revert "Fix multiple scored data groups ( #223 )"
...
This reverts commit 67b3144113 .
2025-08-29 17:55:45 +10:00
shannonsands
67b3144113
Fix multiple scored data groups ( #223 )
...
* removed changes to other files
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fail on scores empty
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-08-29 15:47:32 +10:00
Dakota
11f1303da0
add error logging to collect_trajectories so they don't fail silently
2025-08-15 16:34:21 -05:00
pre-commit-ci[bot]
3d2d9e67fa
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-07-15 11:42:46 +00:00
Alexey Gorbatovski
53984580c8
Bug fix
2025-07-15 14:37:55 +03:00
hjc-puro
72210cf4ad
rename fn
2025-07-11 04:04:55 +00:00
hjc-puro
d133ba3867
comment
2025-07-11 03:54:03 +00:00
hjc-puro
ccb8eaf230
move table to util
2025-07-11 03:52:24 +00:00
hjc-puro
5e61331360
simplify schema
2025-07-11 03:49:49 +00:00
hjc-puro
290e087fc5
remove some imports
2025-07-11 03:25:10 +00:00