Commit graph

963 commits

Author SHA1 Message Date
Allan Niemerg
1a2551c812 fixed formatting for HTML inclusion 2025-09-08 11:22:08 -05:00
Allan Niemerg
265e4cd69f working HTML writing 2025-09-08 11:22:08 -05:00
Allan Niemerg
8997a1d750 working environment 2025-09-08 11:22:08 -05:00
Allan Niemerg
374f63acc0 remove unneeded dataset utils 2025-09-08 11:22:08 -05:00
Allan Niemerg
86473f9551 currently making complete rollouts 2025-09-08 11:22:08 -05:00
Allan Niemerg
64a82c4b4f Fix BLEUBERI environment server integration 2025-09-08 11:22:08 -05:00
Allan Niemerg
3109fe349b Update BLEUBERI README with OpenAI API instructions and remove redundant reward functions 2025-09-08 11:22:08 -05:00
Allan Niemerg
a520f5f663 Integrate BLEUBERI as a submodule with direct import of reference-based reward functions. 2025-09-08 11:22:08 -05:00
Allan Niemerg
5bb5bd2c3d Add BLEUBERI environment for reference-based RL 2025-09-08 11:21:27 -05:00
Teknium
3f6015e622
Update README.md
Remove redundant information
2025-09-08 02:57:23 -07:00
dmahan93
6ac597fbe7
Merge pull request #245 from prestoalvarez/main
fix typo in variable name
2025-09-04 11:10:55 -05:00
dmahan93
de2da64aea
Merge pull request #246 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-09-04 09:52:12 -05:00
pre-commit-ci[bot]
168b3eb5e1
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.10 → v0.12.11](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.10...v0.12.11)
2025-09-01 16:39:22 +00:00
Alvarez
bad4fb84df
Update plot.py 2025-08-30 19:22:57 +02:00
Teknium
ff64d0cf48
Merge pull request #243 from NousResearch/revert-223-fix-multiple-scored-data-groups
Revert "Fix multiple scored data groups"
2025-08-29 01:40:44 -07:00
shannonsands
1a808e2038
Revert "Fix multiple scored data groups (#223)"
This reverts commit 67b3144113.
2025-08-29 17:55:45 +10:00
shannonsands
67b3144113
Fix multiple scored data groups (#223)
* removed changes to other files

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fail on scores empty

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-08-29 15:47:32 +10:00
J-SUPHA
c6dfb26064
Merge pull request #242 from NousResearch/refusalbench-v2
Refusalbench v2
2025-08-28 15:03:17 -04:00
pre-commit-ci[bot]
127b5736a5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-28 18:08:26 +00:00
Jai Suphavadeeprasit
7462f45447 sampling params 2025-08-28 14:07:29 -04:00
Jai Suphavadeeprasit
3944e7ef9b linting 2025-08-28 12:54:08 -04:00
Jai Suphavadeeprasit
1bfe294414 Other major changes 2025-08-28 12:24:08 -04:00
Jai Suphavadeeprasit
ec09a1caee Other major changes 2025-08-28 12:04:42 -04:00
Jai Suphavadeeprasit
b56d03b25c changes linting 2025-08-28 03:53:12 -04:00
Jai Suphavadeeprasit
f6f3c04313 organized 2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
0bcc406b02 race conditions 2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
53710e95ec min@ 2025-08-28 03:35:41 -04:00
dmahan93
1bb9235d46
Merge pull request #241 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-08-25 13:07:15 -05:00
pre-commit-ci[bot]
6ca3d2ea71
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.9 → v0.12.10](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.9...v0.12.10)
2025-08-25 16:38:28 +00:00
Teknium
ee8094d697
Merge pull request #239 from NousResearch/refusalbench-v2
Refusalbench v2
2025-08-20 00:27:23 -07:00
pre-commit-ci[bot]
dec92b2a6e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 16:30:37 +00:00
Jai Suphavadeeprasit
6266748027 Other linting 2025-08-19 12:20:33 -04:00
Jai Suphavadeeprasit
4d404c0be6 os 2025-08-19 12:05:04 -04:00
Jai Suphavadeeprasit
aac9f5a926 linting 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
c1d97b85a3 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
8b55815e2f Linting fixes 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
750489493f [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
f76f9d1596 cleanup 2025-08-19 12:03:13 -04:00
pre-commit-ci[bot]
62b72589c6 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-08-19 12:03:13 -04:00
Jai Suphavadeeprasit
e55a7a0100 add_danger 2025-08-19 12:03:13 -04:00
teknium
bed7ddcb95 add more default categories 2025-08-19 12:03:13 -04:00
teknium
39f0103313 fix dataset 2025-08-19 12:03:13 -04:00
teknium
ff7a2569dc update default max_toks 2025-08-19 12:03:13 -04:00
teknium
69135320b4 initial refusalbenchv2 2025-08-19 12:03:13 -04:00
hjc-puro
8c3ea257cd
Merge pull request #235 from NousResearch/bibtex
Update bibtex
2025-08-18 13:39:55 -04:00
dmahan93
83003d0988
Merge pull request #238 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
2025-08-18 12:39:04 -05:00
pre-commit-ci[bot]
5b1fb70132
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.8 → v0.12.9](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.8...v0.12.9)
2025-08-18 16:39:43 +00:00
dmahan93
4e3ad29fac
Merge pull request #237 from NousResearch/log-errors-from-collect-trajectories
add error logging to collect_trajectories so they don't fail silently
2025-08-15 16:56:37 -05:00
Dakota
11f1303da0 add error logging to collect_trajectories so they don't fail silently 2025-08-15 16:34:21 -05:00
dmahan93
628bd3d2ad
Merge pull request #236 from brawncode/patch-1
fix: division-by-zero in gradient calculation
2025-08-14 13:18:39 -05:00