Teknium
134a9713ce
Merge pull request #147 from zeevick10/main
...
Fix Typos in MCP Tool Calling Environment Documentation
2025-06-01 14:24:54 -07:00
Emmanuel Ferdman
2bc9111139
Fix contribution guide source
...
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-05-31 13:18:17 -07:00
FT
20a90764ac
Update README.md
2025-05-28 14:33:48 +02:00
shannonsands
ea304892ee
Integrate chinguun101 goofy math ( #145 )
...
* Add GoofyMath environment for fun, engaging math learning
* linting, moved to community folder
* linting
---------
Co-authored-by: chinguun101 <chinguun@uni.minerva.edu>
2025-05-28 12:11:02 +10:00
shannonsands
1a79132809
Integrate michaelwaves options iv ( #144 )
...
* options iv agent
* bug fix
* outputs
* linted and moved to community folder
* linting
---------
Co-authored-by: michaelwaves <michaelyu713705@gmail.com>
2025-05-28 10:57:24 +10:00
shannonsands
f21154ff49
Integrate aniemerg wikipedia ( #143 )
...
* initial commit
* initial draft of wikipedia article creation environment
* add openai for rollouts, update requirements, create script to run, etc.
* add configuration, add debugging, fix tool calls, prevent wikipedia access
* now creates html file
* fix output for html page
* check in Claude plan
* fixed formatting and other issues
* add zip file
* update README
* linting, moved to community folder
* linting
* linting
* linting
* linting
---------
Co-authored-by: Allan Niemerg <niemerg@gmail.com>
2025-05-28 10:22:11 +10:00
shannonsands
b774e97215
Integrate subrahmanyam cybersecurity ( #142 )
...
* cybersecurity env for offline RL trajectories
* output file addition
* jsonl outputs
* code cleanup
* pulled out outputs and fixing .gitignore
* removed zip file
* gitignore typo fix
* Integrate cybersecurity Sigma rule generation environment
---------
Co-authored-by: Subrahmanyam Arunachalam <subrahmanyam.arunachalam@FVFGK0VTQ05P.local>
2025-05-28 08:41:51 +10:00
Shannon Sands
d2fb30c4d0
linting
2025-05-27 16:36:14 +10:00
Shannon Sands
6acb4cc91a
linting
2025-05-27 16:34:52 +10:00
Shannon Sands
e37e02a94c
linting, moved env to community folder
2025-05-27 16:30:11 +10:00
Shannon Sands
a6ac7a3e42
linting and moved to community
2025-05-27 16:04:50 +10:00
Shannon Sands
8f08a398c0
linting
2025-05-27 15:46:10 +10:00
Shannon Sands
a353bb6eb1
linting
2025-05-27 15:45:07 +10:00
Shannon Sands
e7e747a396
linting
2025-05-27 15:43:12 +10:00
Shannon Sands
2efb690a24
linting, moved to community
2025-05-27 15:36:24 +10:00
Shannon Sands
33d5e4a25e
linting
2025-05-27 15:12:14 +10:00
Shannon Sands
eba7aac72b
linting, moving files into community subdirectory
2025-05-27 15:08:30 +10:00
Shannon Sands
d3803f62f2
Fix trailing whitespace in community README
2025-05-27 13:58:40 +10:00
Shannon Sands
f8912ae41d
linting, moved to community folder
2025-05-27 13:50:43 +10:00
Shannon Sands
c6a0439ec6
Integrate Sanskrit Poetry Environment from KhoomeiK - Add ChandasMeterReward to reward function registry - Move sanskrit_poetry_env.py to environments/community/sanskrit_poetry/ - Add comprehensive documentation as entry #25 in community README - Environment supports traditional Sanskrit meter validation using chandas classifier - Includes IAST to SLP1 transliteration for accurate meter analysis - Fixed code formatting with pre-commit hooks
2025-05-27 13:29:45 +10:00
Shannon Sands
89b38a233b
more linting
2025-05-27 13:09:07 +10:00
Shannon Sands
bfdf862829
more linting
2025-05-27 13:06:34 +10:00
Shannon Sands
46892c7bdc
linting & moved to community
2025-05-27 12:52:37 +10:00
Shannon Sands
ec2b6f093d
linting
2025-05-27 12:29:10 +10:00
Shannon Sands
54967ecae9
linting
2025-05-27 12:15:15 +10:00
Shannon Sands
3d15f0482c
linting
2025-05-27 08:59:03 +10:00
Shannon Sands
8b09ace467
Linting, move env to community
2025-05-27 08:53:06 +10:00
Shannon Sands
de3bf7c505
linting
2025-05-26 16:59:39 +10:00
Shannon Sands
38ffb0ebc4
moved to community
2025-05-26 16:11:20 +10:00
Shannon Sands
bf12e7df15
linting, moved env, updated contrib credit
2025-05-26 14:35:16 +10:00
Shannon Sands
84e4654795
Fix trailing whitespace and formatting issues in quantum environment documentation - Remove trailing whitespace from code blocks and documentation - Fix end-of-file formatting in README.md - Ensure all pre-commit checks pass for workflow compatibility
2025-05-26 14:14:36 +10:00
Shannon Sands
b845c635d4
linted, moved to community folder
2025-05-26 14:10:26 +10:00
Shannon Sands
7cfd3af149
Integrate Caput Mundi poker environment from yoniebans - Add Six-Seat No-Limit Hold'em poker training environment - Features expert hand history training with dual reward system - Includes action matching and bet sizing evaluation components - Supports multi-stage game analysis (preflop/flop/turn/river) - Integrates with HuggingFace datasets and WandB monitoring - Comprehensive documentation added to community README ( #17 ) - All code quality checks passing (black, isort, flake8) Environment moved from hack0/poker to environments/community/poker_holdem/ Resolves PR #84 from yoniebans/atropos
2025-05-26 13:38:49 +10:00
Shannon Sands
0f61c9dbde
moved to community folder
2025-05-26 13:27:43 +10:00
Shannon Sands
3707ac939f
linting
2025-05-26 13:03:23 +10:00
Shannon Sands
bc1f85619f
Fixed linting issues
2025-05-26 12:59:55 +10:00
Shannon Sands
5d22d360e2
Add Solitaire Winning Probability Environment - Mathematical probability analysis environment for training LLMs - Combines theoretical formula derivation with Monte Carlo simulation - Supports various solitaire-style card games - Includes sophisticated reward system with relative error calculation - All API keys removed for security - Comprehensive documentation added to community README - Author: davidedipeppe, PR: #88
2025-05-26 12:36:24 +10:00
Shannon Sands
d789128f20
Fix final code quality issues in Conversational Style DPO environment
2025-05-26 10:48:11 +10:00
Shannon Sands
c2c4928882
Fix final line length violations in Pokemon Showdown environment
2025-05-26 10:15:32 +10:00
Shannon Sands
c360ee20e7
linting
2025-05-26 09:39:51 +10:00
Shannon Sands
65108d12b2
Linting done
2025-05-26 09:28:23 +10:00
Shannon Sands
abab64e8dc
full commit
2025-05-26 08:48:33 +10:00
Shannon Sands
129b310593
Integrate JakeBoggs punchline VR-CLI environment - Add Punchline VR-CLI environment for training humor understanding using VR-CLI methodology - Moved from environments/hack0/punchlines to environments/community/punchline_vrcli - Updated community README with comprehensive environment description - Fixed linting issues and formatted code per project standards - Credit: @JakeBoggs
2025-05-26 08:45:45 +10:00
Teknium
4b532da35e
Merge pull request #114 from leopardracer/main
...
Improve API Server Documentation and Update UFC Prediction Output Format
2025-05-25 01:34:48 -07:00
Shannon Sands
0c4c3e1e6c
linting
2025-05-24 14:43:24 +10:00
Shannon Sands
47c42bdc72
linting
2025-05-24 14:37:24 +10:00
Shannon Sands
160abf8574
Integrate krishpop's Cat Behavior Communication Environment - Merged cat behavior environment from krishpop:main - Moved cat files from environments/ to environments/community/cat_behavior_env/ - Fixed file paths for cat_behaviors.json and cat_scenarios.json - Removed unused imports and fixed all linting issues - Updated community README with comprehensive cat environment description - Credited author @krishpop with GitHub link
2025-05-24 14:21:58 +10:00
Shannon Sands
95bec5e7a8
Integrate RoshanSanjeev's ExamCraft environment - Merged ExamCraft environment from RoshanSanjeev PR #95 - Moved from environments/hack0/ to environments/community/ - Removed demo_artifacts.tar.gz file to avoid repo clutter - Updated community README with comprehensive ExamCraft description - Fixed all linting issues (flake8, black, isort) - Credited author @RoshanSanjeev with GitHub link
2025-05-24 13:48:52 +10:00
Shannon Sands
32cf5e3d42
Integrate joshgarza's accessibility environment - Merged accessibility environment from joshgarza:main - Moved from environments/hack0/ to environments/community/ - Updated community README with detailed description of accessibility auto-fixer - Added note about missing dataset file - Credited author @joshgarza with GitHub link
2025-05-24 13:31:50 +10:00
leopardracer
c15149c23b
Update ufc_image_env.py
2025-05-23 19:43:04 +03:00