Commit graph

41 commits

Author SHA1 Message Date
teknium
bcfbd647e3 fix some bugs 2025-12-28 04:09:34 +00:00
teknium
830a129655 add phybench eval 2025-12-28 01:44:20 +00:00
pre-commit-ci[bot]
d04f8c0ae7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:54:56 +00:00
teknium
8435371d80 linty 2025-12-25 09:54:11 +00:00
pre-commit-ci[bot]
269fb71713 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-25 09:49:33 +00:00
teknium
9e9f1cd88e Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-25 09:48:50 +00:00
teknium
c871f6a56a fix eval ctx len 2025-12-25 09:48:47 +00:00
pre-commit-ci[bot]
6bb6a5976d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 23:37:21 +00:00
teknium
85296c519e hopefully final linter fixes lol 2025-12-24 23:36:36 +00:00
pre-commit-ci[bot]
67869c3a79 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 11:05:18 +00:00
teknium
148333a23b Merge branch 'port_many_evals' of https://github.com/NousResearch/atropos into port_many_evals 2025-12-24 11:04:35 +00:00
teknium
abdda3978a more linter nonsense 2025-12-24 11:04:33 +00:00
pre-commit-ci[bot]
fbf1a26559 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 10:54:58 +00:00
teknium
f18d46549d fix linter errors 2025-12-24 10:53:45 +00:00
pre-commit-ci[bot]
afab28dfa9 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-12-24 10:48:24 +00:00
teknium
ef9c0c3699 Port many benchmarks into atropos 2025-12-24 10:23:16 +00:00
pre-commit-ci[bot]
127b5736a5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-28 18:08:26 +00:00
Jai Suphavadeeprasit
7462f45447 sampling params 2025-08-28 14:07:29 -04:00
Jai Suphavadeeprasit
3944e7ef9b linting 2025-08-28 12:54:08 -04:00
Jai Suphavadeeprasit
1bfe294414 Other major changes 2025-08-28 12:24:08 -04:00
Jai Suphavadeeprasit
ec09a1caee Other major changes 2025-08-28 12:04:42 -04:00
Jai Suphavadeeprasit
b56d03b25c changes linting 2025-08-28 03:53:12 -04:00
Jai Suphavadeeprasit
f6f3c04313 organized 2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
0bcc406b02 race conditions 2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
53710e95ec min@ 2025-08-28 03:35:41 -04:00
pre-commit-ci[bot]
dec92b2a6e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 16:30:37 +00:00
Jai Suphavadeeprasit
6266748027 Other linting 2025-08-19 12:20:33 -04:00
Jai Suphavadeeprasit
4d404c0be6 os 2025-08-19 12:05:04 -04:00
Jai Suphavadeeprasit
aac9f5a926 linting 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
c1d97b85a3 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
8b55815e2f Linting fixes 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
750489493f [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
f76f9d1596 cleanup 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
62b72589c6 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
e55a7a0100 add_danger 2025-08-19 12:03:13 -04:00
teknium
bed7ddcb95 add more default categories 2025-08-19 12:03:13 -04:00
teknium
39f0103313 fix dataset 2025-08-19 12:03:13 -04:00
teknium
ff7a2569dc update default max_toks 2025-08-19 12:03:13 -04:00
teknium
69135320b4 initial refusalbenchv2 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
65aea8bb21 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-30 15:10:36 +00:00
teknium
75f1cf6d2a move eval envs to eval_environments and update readmes 2025-07-30 15:09:34 +00:00