pre-commit-ci[bot]
|
77e14199ce
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-11-14 09:53:34 +00:00 |
|
teknium
|
46f05673aa
|
convert instruct following env to use managedserver
|
2025-11-14 09:52:02 +00:00 |
|
teknium
|
6d6a02eb38
|
convert instruction following env to use managed server
|
2025-11-14 09:49:04 +00:00 |
|
teknium1
|
81631b9c59
|
Merge branch 'updates-to-instructfollowing-env' of https://github.com/NousResearch/atropos into updates-to-instructfollowing-env
|
2025-06-14 12:32:31 -07:00 |
|
teknium1
|
bf78ad44e3
|
Add optional solve flagging strategy
|
2025-06-14 12:32:27 -07:00 |
|
pre-commit-ci[bot]
|
7fa9980b5c
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2025-06-14 14:47:51 +00:00 |
|
teknium1
|
ad1bdf7f80
|
Add cycling curriculum, difficulty threshold, update datadumps
|
2025-06-14 07:44:47 -07:00 |
|
teknium1
|
287bbcd356
|
some cleanup for final merge
|
2025-05-16 19:24:50 -07:00 |
|
teknium1
|
daa6f0ff18
|
add stricter enforcement of think tags
|
2025-05-16 13:18:20 -07:00 |
|
teknium1
|
6ae0703ad6
|
fix some regex and show special tokens for completions table
|
2025-05-15 22:29:42 -07:00 |
|
teknium1
|
24c571654e
|
match num_max_requests with groupsize
|
2025-05-15 15:57:39 -07:00 |
|
hjc-puro
|
dcda88d79b
|
fix validation errors
|
2025-05-15 04:30:59 -07:00 |
|
teknium1
|
2ab8905d4f
|
fix score
|
2025-05-14 19:35:43 -07:00 |
|
teknium1
|
8a0e107806
|
change eval set size since this is a small dataset we need mo data for trainn
|
2025-05-14 19:18:01 -07:00 |
|
teknium1
|
bcc38567ca
|
update some dataset stuff to use allenai's
|
2025-05-14 18:39:31 -07:00 |
|
teknium1
|
881af55f9a
|
add instruction following algo env
|
2025-05-14 18:31:05 -07:00 |
|