mirror of
https://github.com/NousResearch/atropos.git
synced 2026-04-22 16:48:57 +00:00
- Introduced max_n_completions configuration to limit the number of completions requested per server call. - Updated chat_completion and completion methods to split requests exceeding max_n_completions into multiple calls, merging results accordingly. - Enhanced documentation for max_n_completions in ServerManagerConfig. |
||
|---|---|---|
| .. | ||
| openai_server.py | ||
| server_baseline.py | ||
| server_harness.py | ||
| server_manager.py | ||
| trl_vllm_server.py | ||