Shannon Sands
|
a353bb6eb1
|
linting
|
2025-05-27 15:45:07 +10:00 |
|
Shannon Sands
|
e7e747a396
|
linting
|
2025-05-27 15:43:12 +10:00 |
|
Shannon Sands
|
2efb690a24
|
linting, moved to community
|
2025-05-27 15:36:24 +10:00 |
|
Shannon Sands
|
33d5e4a25e
|
linting
|
2025-05-27 15:12:14 +10:00 |
|
Shannon Sands
|
eba7aac72b
|
linting, moving files into community subdirectory
|
2025-05-27 15:08:30 +10:00 |
|
Shannon Sands
|
d3803f62f2
|
Fix trailing whitespace in community README
|
2025-05-27 13:58:40 +10:00 |
|
Shannon Sands
|
f8912ae41d
|
linting, moved to community folder
|
2025-05-27 13:50:43 +10:00 |
|
Shannon Sands
|
c6a0439ec6
|
Integrate Sanskrit Poetry Environment from KhoomeiK - Add ChandasMeterReward to reward function registry - Move sanskrit_poetry_env.py to environments/community/sanskrit_poetry/ - Add comprehensive documentation as entry #25 in community README - Environment supports traditional Sanskrit meter validation using chandas classifier - Includes IAST to SLP1 transliteration for accurate meter analysis - Fixed code formatting with pre-commit hooks
|
2025-05-27 13:29:45 +10:00 |
|
Shannon Sands
|
f789e01347
|
more linting
|
2025-05-27 13:15:26 +10:00 |
|
Shannon Sands
|
b82e23f11d
|
more linting
|
2025-05-27 13:14:01 +10:00 |
|
Shannon Sands
|
54e574d350
|
more linting
|
2025-05-27 13:11:36 +10:00 |
|
Shannon Sands
|
89b38a233b
|
more linting
|
2025-05-27 13:09:07 +10:00 |
|
Shannon Sands
|
bfdf862829
|
more linting
|
2025-05-27 13:06:34 +10:00 |
|
Shannon Sands
|
46892c7bdc
|
linting & moved to community
|
2025-05-27 12:52:37 +10:00 |
|
Shannon Sands
|
7b194642b3
|
Remove uv.lock file - blocked in gitignore
|
2025-05-27 12:46:33 +10:00 |
|
Shannon Sands
|
ec2b6f093d
|
linting
|
2025-05-27 12:29:10 +10:00 |
|
Shannon Sands
|
54967ecae9
|
linting
|
2025-05-27 12:15:15 +10:00 |
|
Shannon Sands
|
13a70e09ab
|
Merge remote-tracking branch 'hallerite/protein_env' into merge-hallerite-contributions
|
2025-05-27 09:05:15 +10:00 |
|
Shannon Sands
|
3d15f0482c
|
linting
|
2025-05-27 08:59:03 +10:00 |
|
Shannon Sands
|
8b09ace467
|
Linting, move env to community
|
2025-05-27 08:53:06 +10:00 |
|
Shannon Sands
|
67e057b13c
|
Merge remote-tracking branch 'ecsbeats/main' into merge-ecsbeats-contributions
# Conflicts:
# uv.lock
|
2025-05-27 08:41:14 +10:00 |
|
hjc-puro
|
4b9e2106b0
|
Delete environments/.DS_Store
|
2025-05-26 18:29:10 -04:00 |
|
Teknium
|
3e6247b26e
|
Merge pull request #106 from NousResearch/feat/swe-rl-environment-v2
Feat/swe rl environment v2
|
2025-05-26 12:47:11 -07:00 |
|
teknium1
|
2ddc7f39cd
|
few small defaults changes
|
2025-05-26 12:43:57 -07:00 |
|
teknium1
|
b2bb61f8cd
|
make eval stuff clearer
|
2025-05-26 01:56:45 -07:00 |
|
teknium1
|
e2ea82b29b
|
Fix up dataset and data dumps
|
2025-05-26 01:50:22 -07:00 |
|
Shannon Sands
|
de3bf7c505
|
linting
|
2025-05-26 16:59:39 +10:00 |
|
Shannon Sands
|
98d9ef87a2
|
Merge remote-tracking branch 'slyracoon23/main' into merge-slyracoon23-contributions
# Conflicts:
# .gitignore
|
2025-05-26 16:23:41 +10:00 |
|
Shannon Sands
|
cd77eab79e
|
linting
|
2025-05-26 16:12:08 +10:00 |
|
Shannon Sands
|
38ffb0ebc4
|
moved to community
|
2025-05-26 16:11:20 +10:00 |
|
Shannon Sands
|
a70a8d7086
|
Merge remote-tracking branch 'tsadpbb/main' into merge-tsadpbb-contributions
|
2025-05-26 16:01:43 +10:00 |
|
Shannon Sands
|
bf12e7df15
|
linting, moved env, updated contrib credit
|
2025-05-26 14:35:16 +10:00 |
|
Shannon Sands
|
81d1ebeaef
|
Merge remote-tracking branch 'arihanv/dev' into merge-arihanv-contributions
|
2025-05-26 14:18:44 +10:00 |
|
Shannon Sands
|
84e4654795
|
Fix trailing whitespace and formatting issues in quantum environment documentation - Remove trailing whitespace from code blocks and documentation - Fix end-of-file formatting in README.md - Ensure all pre-commit checks pass for workflow compatibility
|
2025-05-26 14:14:36 +10:00 |
|
Shannon Sands
|
b845c635d4
|
linted, moved to community folder
|
2025-05-26 14:10:26 +10:00 |
|
Shannon Sands
|
20c6e9d8d7
|
Merge remote-tracking branch 'jeannemtl/hack/env_quant' into merge-jeannemtl-contributions
|
2025-05-26 13:52:36 +10:00 |
|
Shannon Sands
|
aff033443f
|
linting
|
2025-05-26 13:42:14 +10:00 |
|
Shannon Sands
|
7cfd3af149
|
Integrate Caput Mundi poker environment from yoniebans - Add Six-Seat No-Limit Hold'em poker training environment - Features expert hand history training with dual reward system - Includes action matching and bet sizing evaluation components - Supports multi-stage game analysis (preflop/flop/turn/river) - Integrates with HuggingFace datasets and WandB monitoring - Comprehensive documentation added to community README (#17) - All code quality checks passing (black, isort, flake8) Environment moved from hack0/poker to environments/community/poker_holdem/ Resolves PR #84 from yoniebans/atropos
|
2025-05-26 13:38:49 +10:00 |
|
Shannon Sands
|
04c06c3e20
|
Merge remote-tracking branch 'yoniebans/main' into merge-yoniebans-contributions
|
2025-05-26 13:33:36 +10:00 |
|
Shannon Sands
|
0f61c9dbde
|
moved to community folder
|
2025-05-26 13:27:43 +10:00 |
|
Shannon Sands
|
a17dbdfedc
|
Merge remote-tracking branch 'metonym/deepsacrifice' into merge-metonym-contributions
|
2025-05-26 13:22:22 +10:00 |
|
Shannon Sands
|
3707ac939f
|
linting
|
2025-05-26 13:03:23 +10:00 |
|
Shannon Sands
|
bc1f85619f
|
Fixed linting issues
|
2025-05-26 12:59:55 +10:00 |
|
Shannon Sands
|
f17c07c823
|
Merge remote-tracking branch 'justin5764/LeanRL' into merge-justin5764-contributions
|
2025-05-26 12:47:52 +10:00 |
|
Shannon Sands
|
5551580170
|
resolved conflicts
|
2025-05-26 12:42:37 +10:00 |
|
Shannon Sands
|
5d22d360e2
|
Add Solitaire Winning Probability Environment - Mathematical probability analysis environment for training LLMs - Combines theoretical formula derivation with Monte Carlo simulation - Supports various solitaire-style card games - Includes sophisticated reward system with relative error calculation - All API keys removed for security - Comprehensive documentation added to community README - Author: davidedipeppe, PR: #88
|
2025-05-26 12:36:24 +10:00 |
|
Shannon Sands
|
d789128f20
|
Fix final code quality issues in Conversational Style DPO environment
|
2025-05-26 10:48:11 +10:00 |
|
Shannon Sands
|
441fd1036d
|
Merge Karthik-Ragunath conversational style DPO environment contribution
|
2025-05-26 10:25:08 +10:00 |
|
Shannon Sands
|
c2c4928882
|
Fix final line length violations in Pokemon Showdown environment
|
2025-05-26 10:15:32 +10:00 |
|
Shannon Sands
|
0038a710d0
|
merging
|
2025-05-26 09:58:38 +10:00 |
|