Jai Suphavadeeprasit
b91922082e
managed_Server pass through and centralize sem logic
2026-03-05 15:46:33 -05:00
Jai Suphavadeeprasit
5aaf7a346c
prompt logprobs simplicity
2026-03-03 22:06:49 -05:00
Jai Suphavadeeprasit
439b9b129b
prompt logprobs
2026-03-03 21:58:05 -05:00
Jai Suphavadeeprasit
b9291aa29f
init commit
2026-03-03 11:32:09 -05:00
Dakota
7d6aeb9bbf
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
2026-02-09 15:26:29 -06:00
teknium
681616844d
linter....
2026-01-15 07:44:53 +00:00
teknium
45d47fbf56
Refactor reasoning configuration check in APIServer class
...
- Removed unnecessary commented-out code and simplified the logic for checking if reasoning is configured and active. This enhances code readability and maintainability.
2026-01-15 07:43:21 +00:00
teknium
c2e7b3700e
Merge branch 'add_reasoning_handling_draft' of https://github.com/NousResearch/atropos into add_reasoning_handling_draft
2026-01-15 07:22:47 +00:00
teknium
b2d17a44d2
Add README for server handling module and refine ReasoningConfig logic
...
- Introduced a new README.md file detailing the server handling module, including support for reasoning models, provider differences, effort level mappings, and usage examples.
- Cleaned up the ReasoningConfig class by removing unnecessary comments and clarifying logic related to reasoning injection and provider-specific requirements.
2026-01-15 07:21:53 +00:00
teknium
0e187d7869
Update completion handler documentation to clarify that reasoning config is not injected for completions, as it is only supported in chat completions.
2026-01-15 06:44:55 +00:00
pre-commit-ci[bot]
b6e24266b2
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-15 06:27:19 +00:00
teknium
0316cac8d1
Rename is_active method to is_reasoning_kwargs_active in ReasoningConfig for clarity. Update references in the class and corresponding tests to reflect this change.
2026-01-15 06:26:31 +00:00
teknium
6aba5244b8
ReasoningConfig documentation to clarify dependency on effort and max_tokens settings. This update specifies that enabling either of these parameters requires reasoning in OpenRouter to be set to Enabled.
2026-01-12 17:45:19 +00:00
pre-commit-ci[bot]
6cfcbdf4d5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2026-01-05 23:20:47 +00:00
teknium
e1ece3e64e
Add reasoning configuration support across server implementations
...
- Updated server classes (OpenAIServer, SGLangServer, TrlVllmServer, VLLMServer) to accept a ReasoningConfig parameter during initialization.
- Enhanced ReasoningConfig to allow flexible max_tokens without strict validation, accommodating varying provider limits.
- Implemented reasoning configuration injection in APIServer methods for chat and completion handling.
- Updated tests to reflect changes in max_tokens validation logic.
This commit integrates reasoning capabilities into the server handling architecture, improving compatibility with diverse reasoning models.
2026-01-05 23:20:01 +00:00
teknium
747fbc9285
fix linting
2025-12-30 11:56:21 +00:00
teknium
62fa51240c
Add support for reasoning models and their variety of providers/endpoints
2025-12-30 00:23:00 +00:00
Dakota
e6ac3abdcb
add managed vllm server
2025-11-07 13:06:49 -06:00
Dakota
c36ec29656
add sglang specific token level logprob handling and server manager/baseline logprob/token fn
2025-10-16 12:38:03 -05:00
Shannon Sands
76736ec4c2
removed unused
2025-05-17 13:37:55 -07:00
Shannon Sands
c05d9f7f53
added health check flag to skip entirely
2025-05-17 13:36:26 -07:00
Shannon Sands
f84934363c
linting
2025-05-17 13:06:16 -07:00
Shannon Sands
59b08a1aa9
Added flag for chat completion, switch in OpenAI client
2025-05-17 12:59:20 -07:00
dmahan93
553d45cc7e
start servers as healthy so they can get a chance to check...
2025-05-13 11:17:23 -05:00
dmahan93
8b864e9786
move server_type to serverbaseline instead so it can be used as well for server instantiation
2025-05-13 10:21:58 -05:00
dmahan93
bb658631e9
fix bad import from restructuring...
2025-05-13 09:58:16 -05:00
dmahan93
df62979b90
refactor to not mess up process...
2025-05-13 09:22:07 -05:00
dmahan93
0aaf59fc9a
add trl server
...
add gsm8k example for axolotl checking
2025-05-12 19:04:46 -05:00
dmahan93
96be544228
Merge commit ' 71e7a5ca27' into add-support-for-custom-api-servers
2025-05-12 18:40:35 -05:00
dmahan93
70cf61c210
add custom server support
2025-05-08 12:01:49 -05:00