Commit graph

14 commits

Author SHA1 Message Date
pre-commit-ci[bot]
1e6a745491 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-16 17:39:04 +00:00
Dakota
c36ec29656 add sglang specific token level logprob handling and server manager/baseline logprob/token fn 2025-10-16 12:38:03 -05:00
Alexey Gorbatovski
35c542328a Fix infinite loop in wait_for_sem by updating semaphore values inside loop 2025-07-06 00:27:45 +03:00
dmahan93
44b96c7b6c Add max_n_completions parameter to ServerManager for handling multiple completions
- Introduced max_n_completions configuration to limit the number of completions requested per server call.
- Updated chat_completion and completion methods to split requests exceeding max_n_completions into multiple calls, merging results accordingly.
- Enhanced documentation for max_n_completions in ServerManagerConfig.
2025-06-02 11:11:55 -05:00
dmahan93
f7552c9c6f
make default not slurm 2025-05-13 13:11:04 -05:00
dmahan93
8b864e9786 move server_type to serverbaseline instead so it can be used as well for server instantiation 2025-05-13 10:21:58 -05:00
dmahan93
6fc356e76e fix type checking to substantiate an appropriate class instead of the abstract server class 2025-05-13 10:09:36 -05:00
dmahan93
df62979b90 refactor to not mess up process... 2025-05-13 09:22:07 -05:00
dmahan93
6e9405ba95 Fix bad merge 2025-05-12 20:02:54 -05:00
dmahan93
0aaf59fc9a add trl server
add gsm8k example for axolotl checking
2025-05-12 19:04:46 -05:00
dmahan93
96be544228 Merge commit '71e7a5ca27' into add-support-for-custom-api-servers 2025-05-12 18:40:35 -05:00
dmahan93
8ff48065a3
Update server_manager.py to not continue to API config stuff if serverbaseline is set 2025-05-08 20:18:15 -05:00
dmahan93
70cf61c210 add custom server support 2025-05-08 12:01:49 -05:00
Dakota Nous
621d00dd80 first commit 2025-04-29 12:10:10 -07:00