ropresearch
c3fc68879c
group temps, sample temps, and logprob api params
2025-09-25 16:41:58 -04:00
dmahan93
50d99de500
Merge pull request #250 from DeVikingMark/devikingmark
...
refactor(api): improve attribute checking and remove hardcoded values
2025-09-25 11:14:55 -05:00
dmahan93
efc6b55f0a
Merge pull request #251 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-09-25 11:00:05 -05:00
pre-commit-ci[bot]
94d597fc5f
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/psf/black: 25.1.0 → 25.9.0](https://github.com/psf/black/compare/25.1.0...25.9.0 )
- [github.com/astral-sh/ruff-pre-commit: v0.13.0 → v0.13.1](https://github.com/astral-sh/ruff-pre-commit/compare/v0.13.0...v0.13.1 )
2025-09-22 16:41:25 +00:00
pre-commit-ci[bot]
e02d2c373e
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-09-21 22:33:59 +00:00
Ragnar
60addb9a7d
Update server.py
2025-09-22 00:32:39 +02:00
dmahan93
4380dc41d2
Merge pull request #249 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-09-17 22:47:52 -05:00
pre-commit-ci[bot]
34cabbb30f
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-09-15 16:41:26 +00:00
pre-commit-ci[bot]
030118dd31
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.12 → v0.13.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.12...v0.13.0 )
2025-09-15 16:40:11 +00:00
dmahan93
5738941e21
Merge pull request #175 from aniemerg/environments/bleuberi
...
WIP: Environments/bleuberi
2025-09-12 12:09:13 -05:00
dmahan93
89b59d489f
Merge branch 'main' into environments/bleuberi
2025-09-12 12:06:18 -05:00
dmahan93
02e2dcd49a
Merge pull request #160 from interstellarninja/feat/multiturn_tool_use_env
...
Multi-Turn Tool-Use RL Environment
2025-09-10 19:43:42 -05:00
dmahan93
37e8720433
Merge pull request #248 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-09-10 19:43:19 -05:00
pre-commit-ci[bot]
9d7c2772af
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-09-08 19:45:00 +00:00
Allan Niemerg
0f6c06bb56
Move BLEUBERI environment to community folder
...
- Moved environments/bleuberi to environments/community/bleuberi
- Updated .gitmodules to reflect new submodule path
- Fixed pre-commit formatting issues
- Cleaned up test output files
2025-09-08 14:38:43 -05:00
pre-commit-ci[bot]
90a870af17
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.11 → v0.12.12](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.11...v0.12.12 )
2025-09-08 16:39:45 +00:00
Allan Niemerg
532024d01e
remove unnecessary code, change log level
2025-09-08 11:22:08 -05:00
Allan Niemerg
1a2551c812
fixed formatting for HTML inclusion
2025-09-08 11:22:08 -05:00
Allan Niemerg
265e4cd69f
working HTML writing
2025-09-08 11:22:08 -05:00
Allan Niemerg
8997a1d750
working environment
2025-09-08 11:22:08 -05:00
Allan Niemerg
374f63acc0
remove unneeded dataset utils
2025-09-08 11:22:08 -05:00
Allan Niemerg
86473f9551
currently making complete rollouts
2025-09-08 11:22:08 -05:00
Allan Niemerg
64a82c4b4f
Fix BLEUBERI environment server integration
2025-09-08 11:22:08 -05:00
Allan Niemerg
3109fe349b
Update BLEUBERI README with OpenAI API instructions and remove redundant reward functions
2025-09-08 11:22:08 -05:00
Allan Niemerg
a520f5f663
Integrate BLEUBERI as a submodule with direct import of reference-based reward functions.
2025-09-08 11:22:08 -05:00
Allan Niemerg
5bb5bd2c3d
Add BLEUBERI environment for reference-based RL
2025-09-08 11:21:27 -05:00
Teknium
3f6015e622
Update README.md
...
Remove redundant information
2025-09-08 02:57:23 -07:00
dmahan93
6ac597fbe7
Merge pull request #245 from prestoalvarez/main
...
fix typo in variable name
2025-09-04 11:10:55 -05:00
dmahan93
de2da64aea
Merge pull request #246 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-09-04 09:52:12 -05:00
pre-commit-ci[bot]
168b3eb5e1
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.10 → v0.12.11](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.10...v0.12.11 )
2025-09-01 16:39:22 +00:00
Alvarez
bad4fb84df
Update plot.py
2025-08-30 19:22:57 +02:00
Teknium
ff64d0cf48
Merge pull request #243 from NousResearch/revert-223-fix-multiple-scored-data-groups
...
Revert "Fix multiple scored data groups"
2025-08-29 01:40:44 -07:00
shannonsands
1a808e2038
Revert "Fix multiple scored data groups ( #223 )"
...
This reverts commit 67b3144113 .
2025-08-29 17:55:45 +10:00
shannonsands
67b3144113
Fix multiple scored data groups ( #223 )
...
* removed changes to other files
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fail on scores empty
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-08-29 15:47:32 +10:00
J-SUPHA
c6dfb26064
Merge pull request #242 from NousResearch/refusalbench-v2
...
Refusalbench v2
2025-08-28 15:03:17 -04:00
pre-commit-ci[bot]
127b5736a5
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-08-28 18:08:26 +00:00
Jai Suphavadeeprasit
7462f45447
sampling params
2025-08-28 14:07:29 -04:00
Jai Suphavadeeprasit
3944e7ef9b
linting
2025-08-28 12:54:08 -04:00
Jai Suphavadeeprasit
1bfe294414
Other major changes
2025-08-28 12:24:08 -04:00
Jai Suphavadeeprasit
ec09a1caee
Other major changes
2025-08-28 12:04:42 -04:00
Jai Suphavadeeprasit
b56d03b25c
changes linting
2025-08-28 03:53:12 -04:00
Jai Suphavadeeprasit
f6f3c04313
organized
2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
0bcc406b02
race conditions
2025-08-28 03:35:41 -04:00
Jai Suphavadeeprasit
53710e95ec
min@
2025-08-28 03:35:41 -04:00
dmahan93
1bb9235d46
Merge pull request #241 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-08-25 13:07:15 -05:00
pre-commit-ci[bot]
6ca3d2ea71
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.9 → v0.12.10](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.9...v0.12.10 )
2025-08-25 16:38:28 +00:00
Teknium
ee8094d697
Merge pull request #239 from NousResearch/refusalbench-v2
...
Refusalbench v2
2025-08-20 00:27:23 -07:00
pre-commit-ci[bot]
dec92b2a6e
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-08-19 16:30:37 +00:00
Jai Suphavadeeprasit
6266748027
Other linting
2025-08-19 12:20:33 -04:00
Jai Suphavadeeprasit
4d404c0be6
os
2025-08-19 12:05:04 -04:00