teknium1
|
287bbcd356
|
some cleanup for final merge
|
2025-05-16 19:24:50 -07:00 |
|
teknium1
|
daa6f0ff18
|
add stricter enforcement of think tags
|
2025-05-16 13:18:20 -07:00 |
|
teknium1
|
6ae0703ad6
|
fix some regex and show special tokens for completions table
|
2025-05-15 22:29:42 -07:00 |
|
teknium1
|
24c571654e
|
match num_max_requests with groupsize
|
2025-05-15 15:57:39 -07:00 |
|
hjc-puro
|
dcda88d79b
|
fix validation errors
|
2025-05-15 04:30:59 -07:00 |
|
teknium1
|
1a9fa016b5
|
add dependencies to the env readme
|
2025-05-14 19:44:13 -07:00 |
|
teknium1
|
90e235a3e9
|
update environments readme
|
2025-05-14 19:40:32 -07:00 |
|
teknium1
|
2ab8905d4f
|
fix score
|
2025-05-14 19:35:43 -07:00 |
|
teknium1
|
8a0e107806
|
change eval set size since this is a small dataset we need mo data for trainn
|
2025-05-14 19:18:01 -07:00 |
|
teknium1
|
bcc38567ca
|
update some dataset stuff to use allenai's
|
2025-05-14 18:39:31 -07:00 |
|
teknium1
|
881af55f9a
|
add instruction following algo env
|
2025-05-14 18:31:05 -07:00 |
|
dmahan93
|
6e9405ba95
|
Fix bad merge
|
2025-05-12 20:02:54 -05:00 |
|
dmahan93
|
0aaf59fc9a
|
add trl server
add gsm8k example for axolotl checking
|
2025-05-12 19:04:46 -05:00 |
|
dmahan93
|
96be544228
|
Merge commit '71e7a5ca27' into add-support-for-custom-api-servers
|
2025-05-12 18:40:35 -05:00 |
|
dmahan93
|
92428fec8f
|
add gym taxi env
|
2025-05-09 19:05:01 -05:00 |
|
dmahan93
|
40b12dae60
|
run pre-commit on all files
|
2025-05-09 09:54:20 -05:00 |
|
dmahan93
|
b959c30ebf
|
Merge pull request #31 from NousResearch/fix-math-evals-due-to-updated-dataset
fix olympiadbench due to upstream changes
|
2025-05-09 09:42:06 -05:00 |
|
dmahan93
|
e09ae8d3d3
|
fix olympiadbench due to upstream changes
|
2025-05-09 09:41:10 -05:00 |
|
hjc-puro
|
629d8c1731
|
Merge pull request #14 from NousResearch/2025-05-02-server-cli
|
2025-05-09 13:37:54 +08:00 |
|
dmahan93
|
70cf61c210
|
add custom server support
|
2025-05-08 12:01:49 -05:00 |
|
Artem Yatsenko
|
0f15be68a2
|
fix multimodal envs. add view_run_multimodal
|
2025-05-07 21:53:01 +00:00 |
|
edmund
|
2cb1ff0087
|
Removed mentions of NousResearch/DeepHermes-3-Llama-3-1B-Preview and swapped it for NousResearch/DeepHermes-3-Llama-3-3B-Preview
I don't think there is a NousResearch/DeepHermes-3-Llama-3-1B-Preview
|
2025-05-07 18:03:17 +01:00 |
|
teknium1
|
d2dbab7d22
|
Add additional completions table info: metric, magnitude, and direction for ground truth
|
2025-05-04 03:30:50 -07:00 |
|
teknium1
|
c3b80832e9
|
lowering the defaults for fundamental finance env
|
2025-05-04 03:05:25 -07:00 |
|
hjc-puro
|
4348dd2ec1
|
hide complicated openai config override behavior somewhere else
|
2025-05-03 14:18:50 -07:00 |
|
teknium1
|
a2e36227aa
|
add metric logging
|
2025-05-02 02:34:17 -07:00 |
|
Dakota Nous
|
621d00dd80
|
first commit
|
2025-04-29 12:10:10 -07:00 |
|