hjc-puro
3e1eba6e92
rename method
2025-07-11 00:39:28 +00:00
hjc-puro
468b599ddb
Merge branch '2025-07-08-evals' of github.com:NousResearch/atropos into 2025-07-08-evals
2025-07-11 00:34:35 +00:00
hjc-puro
da0d64ae89
linting errors
2025-07-11 00:29:57 +00:00
pre-commit-ci[bot]
5aacb1ea9b
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-07-11 00:26:48 +00:00
hjc-puro
e601251893
gsm8k eval example
2025-07-11 00:22:36 +00:00
hjc-puro
afe4fc318f
sample level logging
2025-07-10 02:03:07 +00:00
hjc-puro
eb926dc58b
working evals
2025-07-10 01:45:21 +00:00
hjc-puro
f4de3ad6f5
add printing
2025-07-09 23:35:26 +00:00
hjc-puro
a11af27298
add eval saving cli args
2025-07-09 03:12:13 +00:00
hjc-puro
72e75c2b13
Merge pull request #207 from NousResearch/2025-07-07-evaluate
...
Add evaluate subcommand to cli
2025-07-08 06:53:37 +08:00
pre-commit-ci[bot]
9340791a6c
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-07-07 21:53:41 +00:00
hjc-puro
e2bcf27398
evaluate
2025-07-07 17:46:35 -04:00
hjc-puro
e56257311e
describe evaluate subcommand
2025-07-07 17:44:52 -04:00
hjc-puro
5519f190d2
add evaluate subcommand to cli
2025-07-07 17:39:33 -04:00
dmahan93
ecc5eebeca
Merge pull request #167 from paulsengh/feat/pay-to-play-env
...
feat: add pay-to-play environment with mixture of judges and micropay…
2025-07-07 09:02:46 -05:00
dmahan93
58446dbcb1
Merge pull request #204 from NousResearch/multienv-enforce-mins
...
Multienv with enforced minimum samples in a batch
2025-07-07 08:53:43 -05:00
Dakota
08e14cc745
feat: add minimum batch allocation support for environments
...
- Add min_batch_allocation parameter to ensure environments contribute minimum proportion to each batch
- Implement grab_batch_with_minimum_allocations function with proper scaling when allocations exceed 100%
- Add mixed-size group buffering to handle variable-sized data submissions
- Update server to use minimum allocation logic when any env has min_batch_allocation set
- Add comprehensive tests for minimum allocation scenarios
- Update documentation in API README and CONFIG.md
- Update example environments to demonstrate the feature
This feature allows critical environments to guarantee they contribute at least a specified proportion (0.0-1.0) to each training batch, ensuring important data sources are always represented during training.
🤖 Generated with [Claude Code](https://claude.ai/code )
Co-Authored-By: Claude <noreply@anthropic.com>
2025-07-07 08:50:28 -05:00
dmahan93
3b8d8a6f09
Merge pull request #202 from Myashka/main
...
Include run name in wandb initialization in BaseEnv
2025-07-07 08:05:47 -05:00
dmahan93
4dfc397845
Merge pull request #203 from Myashka/wait_for_sem-fix
...
Fix infinite loop in wait_for_sem by updating semaphore values inside loop
2025-07-07 07:56:24 -05:00
Alexey Gorbatovski
35c542328a
Fix infinite loop in wait_for_sem by updating semaphore values inside loop
2025-07-06 00:27:45 +03:00
pre-commit-ci[bot]
ee5257522a
[pre-commit.ci] auto fixes from pre-commit.com hooks
...
for more information, see https://pre-commit.ci
2025-07-04 14:34:37 +00:00
Alexey Gorbatovski
14c70c0e68
Include run name in wandb initialization in BaseEnv
2025-07-04 17:13:34 +03:00
dmahan93
2c8340bece
Merge pull request #201 from eeemmmmmm/main
...
polishing docs: error fixes for clarity
2025-07-02 06:08:53 -05:00
emmmm
10f1b466b6
Update curriculum.py
2025-07-02 10:22:33 +02:00
dmahan93
38b963817b
Merge pull request #200 from NousResearch/allow-ignore-value-in-max-trainer-len
...
allow inf (<= 0 max_token_len) generations if trainer requests it
2025-07-01 09:54:19 -05:00
Dakota
683559afd2
allow inf (<= 0 max_token_len) generations if trainer requests it, but raise a warning so that users can check their logs and get info if their trainers are doing something weird
2025-07-01 09:52:10 -05:00
dmahan93
0ff966c276
Merge pull request #198 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-06-30 20:08:45 -05:00
pre-commit-ci[bot]
4d6fbf4fa7
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.12.0 → v0.12.1](https://github.com/astral-sh/ruff-pre-commit/compare/v0.12.0...v0.12.1 )
2025-06-30 16:36:06 +00:00
dmahan93
36a362858b
Merge pull request #197 from reallesee/main
...
Refined wording across the codebase
2025-06-27 15:24:24 -05:00
Micke
af57208da2
fix error in function inference_node_wandb_watcher.py
2025-06-27 22:13:37 +02:00
dmahan93
4769eeb4a6
Merge pull request #196 from reject-i/main
...
fix dead link README.md
2025-06-25 22:02:35 -05:00
Alex Pikme
ba992757e1
fix dead link README.md
2025-06-25 10:51:21 +02:00
Dakota
838eb94413
fix weird folder
2025-06-23 20:25:36 -05:00
dmahan93
d81da03a11
Merge pull request #193 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-06-23 20:18:08 -05:00
pre-commit-ci[bot]
3c96108521
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.11.13 → v0.12.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.11.13...v0.12.0 )
- [github.com/pycqa/flake8: 7.2.0 → 7.3.0](https://github.com/pycqa/flake8/compare/7.2.0...7.3.0 )
2025-06-23 16:37:27 +00:00
dmahan93
5b2b5e9947
Merge pull request #192 from rnkrtt/main
...
Fix typo in author name Gurning -> Gurung in community README
2025-06-23 10:15:58 -05:00
Merkel Tranjes
af1c98d7a8
Update README.md
2025-06-23 16:23:02 +02:00
Teknium
f30453514b
Merge pull request #191 from zeroprooff/main
...
fix duplicate plot.py
2025-06-23 07:16:09 -07:00
Tomass
14917440db
fix duplicate plot.py
2025-06-23 15:32:54 +02:00
Teknium
8bf0312b8a
Merge pull request #190 from crStiv/a
...
fix: multiple typos of different importance
2025-06-22 13:15:39 -07:00
Jeremy Melvin
3bed7c64b9
Ethereum Virtual Machine Text to Transaction Environment ( #187 )
...
* EVM-text_to_transaction
* update structure
* Update README
---------
Co-authored-by: Jeremy Melvin <jeremy@openblocklabs.com>
2025-06-20 09:16:00 +10:00
crStiv
b65b614132
Update hpo.py
2025-06-19 22:59:42 +02:00
crStiv
e934094173
Update helpers.py
2025-06-19 22:52:43 +02:00
crStiv
e9a547ce32
Update base.py
2025-06-19 22:52:26 +02:00
dmahan93
d0a253e1b5
Merge pull request #186 from NousResearch/pre-commit-ci-update-config
...
[pre-commit.ci] pre-commit autoupdate
2025-06-19 10:15:03 -05:00
dmahan93
5a8af66398
Merge pull request #188 from leopardracer/main
...
Fix Typos in Documentation and Configuration Comments
2025-06-19 10:14:41 -05:00
leopardracer
0d6297ad35
Update default.yaml
2025-06-18 22:23:15 +03:00
leopardracer
117783f5d5
Update README.md
2025-06-18 22:22:38 +03:00
pre-commit-ci[bot]
f23097c024
[pre-commit.ci] pre-commit autoupdate
...
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.11.12 → v0.11.13](https://github.com/astral-sh/ruff-pre-commit/compare/v0.11.12...v0.11.13 )
2025-06-16 16:35:35 +00:00
Teknium
202ecff996
Merge pull request #170 from NousResearch/add-format-following-environment
...
Add format following environment
2025-06-16 06:50:17 -07:00