Commit graph

1320 commits

Author SHA1 Message Date
Dakota
522e049d27 Remove unused config_handler.py and its import
- Deleted config_handler.py which had unused torch import
- Cleaned up utils/__init__.py to remove ConfigHandler import
2025-06-04 10:21:46 -05:00
Dakota
371d86d9a4 Remove torch from main dependencies and uv workspace config
- Move torch to optional dependencies only (under rewardfns)
- Remove uv.workspace configuration
- Reduces install size and CI time significantly
2025-06-04 10:13:15 -05:00
Dakota
8833c6e04d Fix end-of-file newline for pre-commit 2025-06-04 10:08:33 -05:00
Dakota
bcef1eac81 Add GitHub Actions workflow for pytest on Python 3.10 and 3.12
🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-06-04 10:00:24 -05:00
Teknium
eaf1e8e667
Merge pull request #157 from NousResearch/add-struct-output-pydantic-validator-environment
Add Pydantic Schema to Structured Output Environment
2025-06-04 02:10:44 -07:00
hjc-puro
599a5ec824
Merge pull request #156 from NousResearch/process-defaults
Remove process defaults and respect config_init
2025-06-04 07:51:55 +08:00
teknium1
49adab3f4b Cleanup 2025-06-03 16:10:59 -07:00
teknium1
aa3d01e5ea Add YAML and TOML support, experimental XML support, add multi-format randomizer 2025-06-03 15:06:21 -07:00
hjc-puro
b5e7746c99 remove process defaults, respect config init 2025-06-02 21:19:45 -04:00
teknium1
e359a72b06 add pydantic schema environment 2025-06-02 16:58:12 -07:00
dmahan93
0f8b60c119
Merge pull request #155 from NousResearch/fix-api-spec-messages-advantage-param
Align ScoredData model between API and base.py
2025-06-02 17:39:38 -05:00
dmahan93
8f3b603ac0
Merge pull request #153 from kilavvy/main
Fix Typos in Comments and Documentation
2025-06-02 17:32:53 -05:00
dmahan93
4a21ed0891 Enhance ScoredData model and API documentation
- Added optional fields: advantages, messages, and images to the ScoredData model.
- Updated API responses to include these new fields when no data is available.
- Revised README.md to reflect changes in the API structure and response format.
2025-06-02 17:28:25 -05:00
dmahan93
46a43a89bf
Merge pull request #154 from NousResearch/split-large-n-in-server-manager
Add max_n_completions parameter to ServerManager for load balancing
2025-06-02 13:03:33 -05:00
dmahan93
8eab09a98b update config md 2025-06-02 11:16:10 -05:00
dmahan93
44b96c7b6c Add max_n_completions parameter to ServerManager for handling multiple completions
- Introduced max_n_completions configuration to limit the number of completions requested per server call.
- Updated chat_completion and completion methods to split requests exceeding max_n_completions into multiple calls, merging results accordingly.
- Enhanced documentation for max_n_completions in ServerManagerConfig.
2025-06-02 11:11:55 -05:00
kilavvy
39261643dc
Update open_robot_env.py 2025-06-02 13:56:48 +02:00
kilavvy
33b8e39599
Update README.md 2025-06-02 13:55:02 +02:00
Teknium
134a9713ce
Merge pull request #147 from zeevick10/main
Fix Typos in MCP Tool Calling Environment Documentation
2025-06-01 14:24:54 -07:00
Teknium
6d1f029499
Merge pull request #151 from emmanuel-ferdman/main
Fix contribution guide source
2025-06-01 14:24:26 -07:00
Emmanuel Ferdman
2bc9111139
Fix contribution guide source
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-05-31 13:18:17 -07:00
shannonsands
283877dd88
Intern bootcamp env (#146)
* Created registry and started off the env

* Local testing works

* process working but error in gen

* removed old code

* adding debug, it's still not progressing to collect trajectories

* linting

* removed redundant settings
2025-05-31 11:22:59 +10:00
FT
20a90764ac
Update README.md 2025-05-28 14:33:48 +02:00
shannonsands
ea304892ee
Integrate chinguun101 goofy math (#145)
* Add GoofyMath environment for fun, engaging math learning

* linting, moved to community folder

* linting

---------

Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
2025-05-28 12:11:02 +10:00
shannonsands
1a79132809
Integrate michaelwaves options iv (#144)
* options iv agent

* bug fix

* outputs

* linted and moved to community folder

* linting

---------

Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
2025-05-28 10:57:24 +10:00
fuder.eth
1862b193ee
Update README.md (#118) 2025-05-28 10:24:12 +10:00
shannonsands
f21154ff49
Integrate aniemerg wikipedia (#143)
* initial commit

* initial draft of wikipedia article creation environment

* add openai for rollouts, update requirements, create script to run, etc.

* add configuration, add debugging, fix tool calls, prevent wikipedia access

* now creates html file

* fix output for html page

* check in Claude plan

* fixed formatting and other issues

* add zip file

* update README

* linting, moved to community folder

* linting

* linting

* linting

* linting

---------

Co-authored-by: Allan Niemerg <niemerg@gmail.com>
2025-05-28 10:22:11 +10:00
shannonsands
b774e97215
Integrate subrahmanyam cybersecurity (#142)
* cybersecurity env for offline RL trajectories

* output file addition

* jsonl outputs

* code cleanup

* pulled out outputs and fixing .gitignore

* removed zip file

* gitignore typo fix

* Integrate cybersecurity Sigma rule generation environment

---------

Co-authored-by: Subrahmanyam Arunachalam <subrahmanyam.arunachalam@FVFGK0VTQ05P.local>
2025-05-28 08:41:51 +10:00
Allan Niemerg
bf4d84839b update README 2025-05-27 11:58:54 -05:00
Allan Niemerg
0d54b3e83e add automatic dataset load 2025-05-27 11:57:17 -05:00
Allan Niemerg
013090579d fix imports and style issues 2025-05-27 11:00:35 -05:00
shannonsands
b33070f56b
Merge pull request #140 from NousResearch/integrate-fahrenheitresearch-meteorology
Integrate fahrenheitresearch meteorology
2025-05-26 23:37:14 -07:00
Shannon Sands
d2fb30c4d0 linting 2025-05-27 16:36:14 +10:00
Shannon Sands
6acb4cc91a linting 2025-05-27 16:34:52 +10:00
Shannon Sands
e37e02a94c linting, moved env to community folder 2025-05-27 16:30:11 +10:00
Shannon Sands
a403b16ec4 commiting 2025-05-27 16:18:26 +10:00
shannonsands
9527aa05df
Merge pull request #139 from NousResearch/integrate-kirilligum-consumer-journey
Integrate kirilligum consumer journey
2025-05-26 23:08:39 -07:00
Shannon Sands
a6ac7a3e42 linting and moved to community 2025-05-27 16:04:50 +10:00
Shannon Sands
8df34efc56 Resolve merge conflict in .gitignore 2025-05-27 15:56:22 +10:00
shannonsands
d232b0fd17
Merge pull request #58 from leehanchung/patch-1
docs: update README.md in atroposlib/env/README.md
2025-05-26 22:48:39 -07:00
shannonsands
ed986fe43a
Merge pull request #138 from NousResearch/integrate-basedlsg-padres-spatial
Integrate basedlsg padres spatial
2025-05-26 22:47:41 -07:00
Shannon Sands
8f08a398c0 linting 2025-05-27 15:46:10 +10:00
Shannon Sands
a353bb6eb1 linting 2025-05-27 15:45:07 +10:00
Shannon Sands
e7e747a396 linting 2025-05-27 15:43:12 +10:00
Shannon Sands
2efb690a24 linting, moved to community 2025-05-27 15:36:24 +10:00
shannonsands
e3e94d1dd1
Merge pull request #137 from NousResearch/integrate-caradmico-starmap-compression
Integrate caradmico starmap compression
2025-05-26 22:13:17 -07:00
Shannon Sands
33d5e4a25e linting 2025-05-27 15:12:14 +10:00
Shannon Sands
eba7aac72b linting, moving files into community subdirectory 2025-05-27 15:08:30 +10:00
shannonsands
dab8bb87c4
Merge pull request #136 from NousResearch/integrate-rahulschand-openvla
Integrate rahulschand openvla
2025-05-26 21:07:32 -07:00
Shannon Sands
d3803f62f2 Fix trailing whitespace in community README 2025-05-27 13:58:40 +10:00