Commit graph

89 commits

Author SHA1 Message Date
Krishnan Srinivasan
bd35adabb4 vibe-coded xitter, v0 2025-05-18 15:04:37 -04:00
hjc-puro
76bdace9c2
Merge pull request #56 from NousResearch/check_server_status_task_404
changed health check to chat completions since all oai models are com…
2025-05-18 22:49:03 +08:00
Shannon Sands
76736ec4c2 removed unused 2025-05-17 13:37:55 -07:00
Shannon Sands
c05d9f7f53 added health check flag to skip entirely 2025-05-17 13:36:26 -07:00
Shannon Sands
f84934363c linting 2025-05-17 13:06:16 -07:00
Shannon Sands
90138376f9 thanks cursor 2025-05-17 13:00:35 -07:00
Shannon Sands
59b08a1aa9 Added flag for chat completion, switch in OpenAI client 2025-05-17 12:59:20 -07:00
Shannon Sands
81534f7c8a message 2025-05-17 12:50:35 -07:00
Shannon Sands
7dc3938038 changed health check to chat completions since all oai models are compatible with that, and it should work with compatible servers 2025-05-17 12:46:50 -07:00
Shannon Sands
edf2beaa32 linting 2025-05-16 20:40:15 -07:00
Shannon Sands
78fc747230 wording 2025-05-16 17:49:22 -07:00
Shannon Sands
263127ec63 Added FAQ.md, linked to it from README 2025-05-16 17:44:20 -07:00
Shannon Sands
00dd120067 Merge branch 'main' into blackjack2-env 2025-05-14 17:27:44 -07:00
Shannon Sands
67cfd961c5 linting 2025-05-14 14:01:31 -07:00
Shannon Sands
d6f9d58606 new env runs locally 2025-05-14 11:57:45 -07:00
Shannon Sands
21cc528b85 move best-of-n selection to util 2025-05-14 10:35:12 -07:00
Shannon Sands
4c00e2b209 move message history out to utils 2025-05-14 10:13:56 -07:00
dmahan93
df3fa0af9f move argparse to main() 2025-05-14 09:57:08 -05:00
dmahan93
96a3476bec Refactor API server initialization to accept command-line arguments, and default reload to False 2025-05-14 09:12:14 -05:00
dmahan93
f7552c9c6f
make default not slurm 2025-05-13 13:11:04 -05:00
dmahan93
553d45cc7e start servers as healthy so they can get a chance to check... 2025-05-13 11:17:23 -05:00
dmahan93
42c47ebc49 add docs :) 2025-05-13 10:28:17 -05:00
dmahan93
8b864e9786 move server_type to serverbaseline instead so it can be used as well for server instantiation 2025-05-13 10:21:58 -05:00
dmahan93
6fc356e76e fix type checking to substantiate an appropriate class instead of the abstract server class 2025-05-13 10:09:36 -05:00
dmahan93
e2128b817e restructure config_init... 2025-05-13 10:00:45 -05:00
dmahan93
bb658631e9 fix bad import from restructuring... 2025-05-13 09:58:16 -05:00
dmahan93
df62979b90 refactor to not mess up process... 2025-05-13 09:22:07 -05:00
dmahan93
6e9405ba95 Fix bad merge 2025-05-12 20:02:54 -05:00
dmahan93
0aaf59fc9a add trl server
add gsm8k example for axolotl checking
2025-05-12 19:04:46 -05:00
dmahan93
96be544228 Merge commit '71e7a5ca27' into add-support-for-custom-api-servers 2025-05-12 18:40:35 -05:00
dmahan93
71e7a5ca27
Merge pull request #41 from NousResearch/workaround-provider-ignoring-n-kwarg-openai-api
Add n kwarg being ignored workaround
2025-05-12 18:19:47 -05:00
Shannon Sands
36f6822d71 Merge branch 'main' into blackjack2-env 2025-05-13 07:54:04 +10:00
Shannon Sands
d980acfaf9 linting 2025-05-13 07:52:03 +10:00
Shannon Sands
e480c30b8b removed new fn 2025-05-13 07:49:28 +10:00
dmahan93
1aa72d7e7e Add n kwarg being ignored workaround 2025-05-12 12:06:03 -05:00
dmahan93
727c7ba640 Remove dependency on torch for default installation 2025-05-12 10:17:41 -05:00
dmahan93
706097db21
Merge pull request #36 from NousResearch/add-gym-frozen-lake-example
add gym taxi env
2025-05-12 08:49:11 -05:00
Shannon Sands
101cbdd803 Merge branch 'main' into blackjack2-env 2025-05-12 07:22:24 +10:00
Shannon Sands
f69b16357b removed unused fn 2025-05-10 21:29:08 +10:00
hjc-puro
e68df555ba use parse_http_rseponse 2025-05-10 05:12:08 -04:00
hjc-puro
a659217afe
Merge branch 'main' into 2025-05-03-http-error-logging 2025-05-10 17:09:22 +08:00
dmahan93
1fe7deae47 Merge commit 'b386960d78' into add-gym-frozen-lake-example 2025-05-09 19:23:16 -05:00
dmahan93
37f040a883 fix pre-commit 2025-05-09 19:14:45 -05:00
dmahan93
92428fec8f add gym taxi env 2025-05-09 19:05:01 -05:00
Shannon Sands
7fe1a40368 readd multistep masking 2025-05-10 09:24:55 +10:00
Shannon Sands
4d0f919fd1 linting 2025-05-10 09:10:31 +10:00
Shannon Sands
6c6a1c5d06 update handle_send_to_api 2025-05-10 09:07:54 +10:00
Shannon Sands
c506bb147e simplified config and reward 2025-05-10 08:04:39 +10:00
Shannon Sands
a7dfd377da moving env to clean branch 2025-05-10 07:44:29 +10:00
Shannon Sands
4f6a0014bc precommit 2025-05-10 07:30:57 +10:00