Commit graph

26 commits

Author SHA1 Message Date
Dakota
7d6aeb9bbf add tokenizer name config to set the vllm/sglang tokenizer to something different if needed 2026-02-09 15:26:29 -06:00
teknium
681616844d linter.... 2026-01-15 07:44:53 +00:00
teknium
45d47fbf56 Refactor reasoning configuration check in APIServer class
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
teknium
c2e7b3700e Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft 2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2 Add README for server handling module and refine ReasoningConfig logic
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869 Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions. 2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1 Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change. 2026-01-15 06:26:31 +00:00
teknium
6aba5244b8 ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled. 2026-01-12 17:45:19 +00:00
pre-commit-ci[bot]
6cfcbdf4d5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00
teknium
e1ece3e64e Add reasoning configuration support across server implementations
- Updated server classes (OpenAIServer, SGLangServer, TrlVllmServer, VLLMServer) to accept a ReasoningConfig parameter during initialization.
- Enhanced ReasoningConfig to allow flexible max_tokens without strict validation, accommodating varying provider limits.
- Implemented reasoning configuration injection in APIServer methods for chat and completion handling.
- Updated tests to reflect changes in max_tokens validation logic.

This commit integrates reasoning capabilities into the server handling architecture, improving compatibility with diverse reasoning models.
2026-01-05 23:20:01 +00:00
teknium
747fbc9285 fix linting 2025-12-30 11:56:21 +00:00
teknium
62fa51240c Add support for reasoning models and their variety of providers/endpoints 2025-12-30 00:23:00 +00:00
Dakota
e6ac3abdcb add managed vllm server 2025-11-07 13:06:49 -06:00
Dakota
c36ec29656 add sglang specific token level logprob handling and server manager/baseline logprob/token fn 2025-10-16 12:38:03 -05:00
Shannon Sands
76736ec4c2 removed unused 2025-05-17 13:37:55 -07:00
Shannon Sands
c05d9f7f53 added health check flag to skip entirely 2025-05-17 13:36:26 -07:00
Shannon Sands
f84934363c linting 2025-05-17 13:06:16 -07:00
Shannon Sands
59b08a1aa9 Added flag for chat completion, switch in OpenAI client 2025-05-17 12:59:20 -07:00
dmahan93
553d45cc7e start servers as healthy so they can get a chance to check... 2025-05-13 11:17:23 -05:00
dmahan93
8b864e9786 move server_type to serverbaseline instead so it can be used as well for server instantiation 2025-05-13 10:21:58 -05:00
dmahan93
bb658631e9 fix bad import from restructuring... 2025-05-13 09:58:16 -05:00
dmahan93
df62979b90 refactor to not mess up process... 2025-05-13 09:22:07 -05:00
dmahan93
0aaf59fc9a add trl server
add gsm8k example for axolotl checking
2025-05-12 19:04:46 -05:00
dmahan93
96be544228 Merge commit '71e7a5ca27' into add-support-for-custom-api-servers 2025-05-12 18:40:35 -05:00
dmahan93
70cf61c210 add custom server support 2025-05-08 12:01:49 -05:00