Commit graph

423 commits

Author SHA1 Message Date
Shannon Sands
3d15f0482c linting 2025-05-27 08:59:03 +10:00
Shannon Sands
8b09ace467 Linting, move env to community 2025-05-27 08:53:06 +10:00
Shannon Sands
67e057b13c Merge remote-tracking branch 'ecsbeats/main' into merge-ecsbeats-contributions
# Conflicts:
#	uv.lock
2025-05-27 08:41:14 +10:00
hjc-puro
4b9e2106b0
Delete environments/.DS_Store 2025-05-26 18:29:10 -04:00
Teknium
3e6247b26e
Merge pull request #106 from NousResearch/feat/swe-rl-environment-v2
Feat/swe rl environment v2
2025-05-26 12:47:11 -07:00
teknium1
2ddc7f39cd few small defaults changes 2025-05-26 12:43:57 -07:00
teknium1
b2bb61f8cd make eval stuff clearer 2025-05-26 01:56:45 -07:00
teknium1
e2ea82b29b Fix up dataset and data dumps 2025-05-26 01:50:22 -07:00
Shannon Sands
de3bf7c505 linting 2025-05-26 16:59:39 +10:00
Shannon Sands
98d9ef87a2 Merge remote-tracking branch 'slyracoon23/main' into merge-slyracoon23-contributions
# Conflicts:
#	.gitignore
2025-05-26 16:23:41 +10:00
Shannon Sands
cd77eab79e linting 2025-05-26 16:12:08 +10:00
Shannon Sands
38ffb0ebc4 moved to community 2025-05-26 16:11:20 +10:00
Shannon Sands
a70a8d7086 Merge remote-tracking branch 'tsadpbb/main' into merge-tsadpbb-contributions 2025-05-26 16:01:43 +10:00
Shannon Sands
bf12e7df15 linting, moved env, updated contrib credit 2025-05-26 14:35:16 +10:00
Shannon Sands
81d1ebeaef Merge remote-tracking branch 'arihanv/dev' into merge-arihanv-contributions 2025-05-26 14:18:44 +10:00
Shannon Sands
84e4654795 Fix trailing whitespace and formatting issues in quantum environment documentation - Remove trailing whitespace from code blocks and documentation - Fix end-of-file formatting in README.md - Ensure all pre-commit checks pass for workflow compatibility 2025-05-26 14:14:36 +10:00
Shannon Sands
b845c635d4 linted, moved to community folder 2025-05-26 14:10:26 +10:00
Shannon Sands
20c6e9d8d7 Merge remote-tracking branch 'jeannemtl/hack/env_quant' into merge-jeannemtl-contributions 2025-05-26 13:52:36 +10:00
Shannon Sands
aff033443f linting 2025-05-26 13:42:14 +10:00
Shannon Sands
7cfd3af149 Integrate Caput Mundi poker environment from yoniebans - Add Six-Seat No-Limit Hold'em poker training environment - Features expert hand history training with dual reward system - Includes action matching and bet sizing evaluation components - Supports multi-stage game analysis (preflop/flop/turn/river) - Integrates with HuggingFace datasets and WandB monitoring - Comprehensive documentation added to community README (#17) - All code quality checks passing (black, isort, flake8) Environment moved from hack0/poker to environments/community/poker_holdem/ Resolves PR #84 from yoniebans/atropos 2025-05-26 13:38:49 +10:00
Shannon Sands
04c06c3e20 Merge remote-tracking branch 'yoniebans/main' into merge-yoniebans-contributions 2025-05-26 13:33:36 +10:00
Shannon Sands
0f61c9dbde moved to community folder 2025-05-26 13:27:43 +10:00
Shannon Sands
a17dbdfedc Merge remote-tracking branch 'metonym/deepsacrifice' into merge-metonym-contributions 2025-05-26 13:22:22 +10:00
Shannon Sands
3707ac939f linting 2025-05-26 13:03:23 +10:00
Shannon Sands
bc1f85619f Fixed linting issues 2025-05-26 12:59:55 +10:00
Shannon Sands
f17c07c823 Merge remote-tracking branch 'justin5764/LeanRL' into merge-justin5764-contributions 2025-05-26 12:47:52 +10:00
Shannon Sands
5551580170 resolved conflicts 2025-05-26 12:42:37 +10:00
Shannon Sands
5d22d360e2 Add Solitaire Winning Probability Environment - Mathematical probability analysis environment for training LLMs - Combines theoretical formula derivation with Monte Carlo simulation - Supports various solitaire-style card games - Includes sophisticated reward system with relative error calculation - All API keys removed for security - Comprehensive documentation added to community README - Author: davidedipeppe, PR: #88 2025-05-26 12:36:24 +10:00
Shannon Sands
d789128f20 Fix final code quality issues in Conversational Style DPO environment 2025-05-26 10:48:11 +10:00
Shannon Sands
441fd1036d Merge Karthik-Ragunath conversational style DPO environment contribution 2025-05-26 10:25:08 +10:00
Shannon Sands
c2c4928882 Fix final line length violations in Pokemon Showdown environment 2025-05-26 10:15:32 +10:00
Shannon Sands
0038a710d0 merging 2025-05-26 09:58:38 +10:00
Shannon Sands
c360ee20e7 linting 2025-05-26 09:39:51 +10:00
Shannon Sands
65108d12b2 Linting done 2025-05-26 09:28:23 +10:00
Shannon Sands
a58562447f Merge branch 'joshuajerin-selcube' into merge-joshuajerin-contributions 2025-05-26 09:07:25 +10:00
Shannon Sands
abab64e8dc full commit 2025-05-26 08:48:33 +10:00
Shannon Sands
129b310593 Integrate JakeBoggs punchline VR-CLI environment - Add Punchline VR-CLI environment for training humor understanding using VR-CLI methodology - Moved from environments/hack0/punchlines to environments/community/punchline_vrcli - Updated community README with comprehensive environment description - Fixed linting issues and formatted code per project standards - Credit: @JakeBoggs 2025-05-26 08:45:45 +10:00
Shannon Sands
c3e2046a20 Merge branch 'JakeBoggs-punchline' into merge-jakeboggs-contributions 2025-05-26 08:33:37 +10:00
Teknium
4b532da35e
Merge pull request #114 from leopardracer/main
Improve API Server Documentation and Update UFC Prediction Output Format
2025-05-25 01:34:48 -07:00
Shannon Sands
0c4c3e1e6c linting 2025-05-24 14:43:24 +10:00
Shannon Sands
47c42bdc72 linting 2025-05-24 14:37:24 +10:00
Shannon Sands
160abf8574 Integrate krishpop's Cat Behavior Communication Environment - Merged cat behavior environment from krishpop:main - Moved cat files from environments/ to environments/community/cat_behavior_env/ - Fixed file paths for cat_behaviors.json and cat_scenarios.json - Removed unused imports and fixed all linting issues - Updated community README with comprehensive cat environment description - Credited author @krishpop with GitHub link 2025-05-24 14:21:58 +10:00
Shannon Sands
f399e3513f Merge remote-tracking branch 'krishpop/main' into merge-krishpop-contributions 2025-05-24 13:54:43 +10:00
Shannon Sands
95bec5e7a8 Integrate RoshanSanjeev's ExamCraft environment - Merged ExamCraft environment from RoshanSanjeev PR #95 - Moved from environments/hack0/ to environments/community/ - Removed demo_artifacts.tar.gz file to avoid repo clutter - Updated community README with comprehensive ExamCraft description - Fixed all linting issues (flake8, black, isort) - Credited author @RoshanSanjeev with GitHub link 2025-05-24 13:48:52 +10:00
Shannon Sands
455fbd053c Merge branch 'RoshanSanjeev-examcraft' into merge-roshansanjeev-contributions 2025-05-24 13:39:03 +10:00
Shannon Sands
32cf5e3d42 Integrate joshgarza's accessibility environment - Merged accessibility environment from joshgarza:main - Moved from environments/hack0/ to environments/community/ - Updated community README with detailed description of accessibility auto-fixer - Added note about missing dataset file - Credited author @joshgarza with GitHub link 2025-05-24 13:31:50 +10:00
Shannon Sands
30ddc8a36d Merge remote-tracking branch 'joshgarza/main' into merge-joshgarza-contributions 2025-05-24 13:29:23 +10:00
teknium1
ae0340bb9f prevent token explosion issue by reducing max_token to 15k instead of 16k 2025-05-23 18:09:36 -07:00
teknium1
1fa798a69e Making saving data optional in config, add scores to saved data 2025-05-23 14:11:11 -07:00
teknium1
a20886d720 fix many many things jules didnt do right 2025-05-23 12:50:38 -07:00