Commit graph

27 commits

Author SHA1 Message Date
teknium1
287bbcd356 some cleanup for final merge 2025-05-16 19:24:50 -07:00
teknium1
daa6f0ff18 add stricter enforcement of think tags 2025-05-16 13:18:20 -07:00
teknium1
6ae0703ad6 fix some regex and show special tokens for completions table 2025-05-15 22:29:42 -07:00
teknium1
24c571654e match num_max_requests with groupsize 2025-05-15 15:57:39 -07:00
hjc-puro
dcda88d79b fix validation errors 2025-05-15 04:30:59 -07:00
teknium1
1a9fa016b5 add dependencies to the env readme 2025-05-14 19:44:13 -07:00
teknium1
90e235a3e9 update environments readme 2025-05-14 19:40:32 -07:00
teknium1
2ab8905d4f fix score 2025-05-14 19:35:43 -07:00
teknium1
8a0e107806 change eval set size since this is a small dataset we need mo data for trainn 2025-05-14 19:18:01 -07:00
teknium1
bcc38567ca update some dataset stuff to use allenai's 2025-05-14 18:39:31 -07:00
teknium1
881af55f9a add instruction following algo env 2025-05-14 18:31:05 -07:00
dmahan93
6e9405ba95 Fix bad merge 2025-05-12 20:02:54 -05:00
dmahan93
0aaf59fc9a add trl server
add gsm8k example for axolotl checking
2025-05-12 19:04:46 -05:00
dmahan93
96be544228 Merge commit '71e7a5ca27' into add-support-for-custom-api-servers 2025-05-12 18:40:35 -05:00
dmahan93
92428fec8f add gym taxi env 2025-05-09 19:05:01 -05:00
dmahan93
40b12dae60 run pre-commit on all files 2025-05-09 09:54:20 -05:00
dmahan93
b959c30ebf
Merge pull request #31 from NousResearch/fix-math-evals-due-to-updated-dataset
fix olympiadbench due to upstream changes
2025-05-09 09:42:06 -05:00
dmahan93
e09ae8d3d3 fix olympiadbench due to upstream changes 2025-05-09 09:41:10 -05:00
hjc-puro
629d8c1731
Merge pull request #14 from NousResearch/2025-05-02-server-cli 2025-05-09 13:37:54 +08:00
dmahan93
70cf61c210 add custom server support 2025-05-08 12:01:49 -05:00
Artem Yatsenko
0f15be68a2 fix multimodal envs. add view_run_multimodal 2025-05-07 21:53:01 +00:00
edmund
2cb1ff0087 Removed mentions of NousResearch/DeepHermes-3-Llama-3-1B-Preview and swapped it for NousResearch/DeepHermes-3-Llama-3-3B-Preview
I don't think there is a NousResearch/DeepHermes-3-Llama-3-1B-Preview
2025-05-07 18:03:17 +01:00
teknium1
d2dbab7d22 Add additional completions table info: metric, magnitude, and direction for ground truth 2025-05-04 03:30:50 -07:00
teknium1
c3b80832e9 lowering the defaults for fundamental finance env 2025-05-04 03:05:25 -07:00
hjc-puro
4348dd2ec1 hide complicated openai config override behavior somewhere else 2025-05-03 14:18:50 -07:00
teknium1
a2e36227aa add metric logging 2025-05-02 02:34:17 -07:00
Dakota Nous
621d00dd80 first commit 2025-04-29 12:10:10 -07:00