Teknium
|
bdb15e5d85
|
Merge pull request #21 from NousResearch/add-sft-data-env
add an SFT data loading env
|
2025-05-17 11:30:07 -07:00 |
|
shannonsands
|
b1c9b23956
|
Merge pull request #50 from NousResearch/readme-updates
Added new env info
|
2025-05-17 10:22:20 -07:00 |
|
artem
|
651b40968d
|
Merge pull request #51 from NousResearch/kernelbench_env
Kernelbench env with parallel compilation
|
2025-05-16 21:38:57 -07:00 |
|
Artem Yatsenko
|
4b1c07232c
|
readme updates
|
2025-05-17 04:37:09 +00:00 |
|
Artem Yatsenko
|
5396975bab
|
move to its own folder, add readme, citation.
|
2025-05-17 04:34:49 +00:00 |
|
Shannon Sands
|
edf2beaa32
|
linting
|
2025-05-16 20:40:15 -07:00 |
|
Artem Yatsenko
|
9875f5dc06
|
pre-commit. add KB path as variable
|
2025-05-17 03:17:14 +00:00 |
|
Artem Yatsenko
|
fbfe06771b
|
add parallel compile. add instructions
|
2025-05-16 19:53:46 -07:00 |
|
Shannon Sands
|
41caa05a1a
|
remvoed merge error
|
2025-05-16 19:49:37 -07:00 |
|
Shannon Sands
|
9753d5a122
|
resolved conflict
|
2025-05-16 19:48:15 -07:00 |
|
Teknium
|
c9b6534eae
|
Merge pull request #44 from NousResearch/instruction-following-algo-environment
Instruction following algo environment
|
2025-05-16 19:37:00 -07:00 |
|
teknium1
|
20d263a495
|
add citation to allenai
|
2025-05-16 19:34:51 -07:00 |
|
teknium1
|
287bbcd356
|
some cleanup for final merge
|
2025-05-16 19:24:50 -07:00 |
|
Shannon Sands
|
fd63c76a5c
|
Added new env info
|
2025-05-16 16:44:33 -07:00 |
|
teknium1
|
daa6f0ff18
|
add stricter enforcement of think tags
|
2025-05-16 13:18:20 -07:00 |
|
Artem Yatsenko
|
45aece515f
|
first commit for kernelbench env
|
2025-05-16 11:29:10 -07:00 |
|
teknium1
|
6ae0703ad6
|
fix some regex and show special tokens for completions table
|
2025-05-15 22:29:42 -07:00 |
|
teknium1
|
24c571654e
|
match num_max_requests with groupsize
|
2025-05-15 15:57:39 -07:00 |
|
Shannon Sands
|
bfb822f1e0
|
updated APIServerConfig and added requirements.txt and install instructions to README
|
2025-05-15 12:22:00 -07:00 |
|
hjc-puro
|
dcda88d79b
|
fix validation errors
|
2025-05-15 04:30:59 -07:00 |
|
teknium1
|
1a9fa016b5
|
add dependencies to the env readme
|
2025-05-14 19:44:13 -07:00 |
|
teknium1
|
90e235a3e9
|
update environments readme
|
2025-05-14 19:40:32 -07:00 |
|
teknium1
|
2ab8905d4f
|
fix score
|
2025-05-14 19:35:43 -07:00 |
|
teknium1
|
8a0e107806
|
change eval set size since this is a small dataset we need mo data for trainn
|
2025-05-14 19:18:01 -07:00 |
|
teknium1
|
bcc38567ca
|
update some dataset stuff to use allenai's
|
2025-05-14 18:39:31 -07:00 |
|
teknium1
|
881af55f9a
|
add instruction following algo env
|
2025-05-14 18:31:05 -07:00 |
|
Shannon Sands
|
c72a27d376
|
fixed linting in latest main
|
2025-05-14 17:29:57 -07:00 |
|
Shannon Sands
|
00dd120067
|
Merge branch 'main' into blackjack2-env
|
2025-05-14 17:27:44 -07:00 |
|
Shannon Sands
|
8fad665f6a
|
moved folder location
|
2025-05-14 17:22:30 -07:00 |
|
Shannon Sands
|
c2bf3f5acd
|
moved folder location
|
2025-05-14 17:22:18 -07:00 |
|
Joe Li
|
c1ae25c202
|
Merge pull request #26 from NousResearch/coding_server
add code execution environment
|
2025-05-14 15:08:10 -07:00 |
|
Shannon Sands
|
3fba8e3527
|
linting
|
2025-05-14 14:22:25 -07:00 |
|
Shannon Sands
|
d8ab1a6758
|
linting
|
2025-05-14 14:20:54 -07:00 |
|
Shannon Sands
|
1a7c0294fa
|
refactoring for more clarity
|
2025-05-14 14:18:43 -07:00 |
|
Shannon Sands
|
bb6c205efe
|
Linting
|
2025-05-14 14:05:52 -07:00 |
|
Shannon Sands
|
67cfd961c5
|
linting
|
2025-05-14 14:01:31 -07:00 |
|
Shannon Sands
|
826de9e283
|
Updated README
|
2025-05-14 13:57:20 -07:00 |
|
Shannon Sands
|
f5172b45a8
|
Added README
|
2025-05-14 13:35:15 -07:00 |
|
Shannon Sands
|
85f462df5e
|
Updated test scripts
|
2025-05-14 12:05:59 -07:00 |
|
Shannon Sands
|
d6f9d58606
|
new env runs locally
|
2025-05-14 11:57:45 -07:00 |
|
Shannon Sands
|
54ae40840d
|
no-thinking env added
|
2025-05-14 11:28:39 -07:00 |
|
Shannon Sands
|
21cc528b85
|
move best-of-n selection to util
|
2025-05-14 10:35:12 -07:00 |
|
Shannon Sands
|
4c00e2b209
|
move message history out to utils
|
2025-05-14 10:13:56 -07:00 |
|
dmahan93
|
6e9405ba95
|
Fix bad merge
|
2025-05-12 20:02:54 -05:00 |
|
dmahan93
|
0aaf59fc9a
|
add trl server
add gsm8k example for axolotl checking
|
2025-05-12 19:04:46 -05:00 |
|
dmahan93
|
96be544228
|
Merge commit '71e7a5ca27' into add-support-for-custom-api-servers
|
2025-05-12 18:40:35 -05:00 |
|
Shannon Sands
|
8cd9e4d776
|
made private collect_trajectory re changes
|
2025-05-13 07:58:48 +10:00 |
|
Shannon Sands
|
36f6822d71
|
Merge branch 'main' into blackjack2-env
|
2025-05-13 07:54:04 +10:00 |
|
Shannon Sands
|
e480c30b8b
|
removed new fn
|
2025-05-13 07:49:28 +10:00 |
|
Shannon Sands
|
220b92be47
|
Linting and cleanup
|
2025-05-10 21:15:00 +10:00 |
|