leopardracer
c15149c23b
Update ufc_image_env.py
2025-05-23 19:43:04 +03:00
Shannon Sands
8aff7cb3c5
Fix import sorting issues in environment files - Move wandb imports to third-party section in philosophical_rlaif_env.py, math_server.py, math_server_zero.py, and tool_calling_server.py - Add missing spaces after commas in ufc_server.py stats.get() calls - Manual fixes to resolve CI linting failures
2025-05-23 15:58:16 +10:00
Shannon Sands
7de20bddfe
merged changes
2025-05-23 15:48:14 +10:00
Shannon Sands
6e55cbc448
Add edmundman's UFC prediction environment with sample dataset - Moved UFC environment from hack0/ to community/ufc_prediction_env/ - Fixed all linting issues: unused imports, long lines, unused variables - Trimmed large_dataset.csv to 799 records (459KB) to meet repository limits - Added comprehensive documentation to community README - Environment features both text-based and image-based fight prediction - Generates entertaining TTS-ready commentary for voice synthesis - Includes web scraping tools and Flask UI interface
2025-05-23 15:47:12 +10:00
Shannon Sands
606b917042
Merge edmundman's UFC_FIGHT_PREDICTOR contribution
2025-05-23 15:33:02 +10:00
Fabrizio (Misto) Milo
490caf6854
change directory
2025-05-22 13:03:37 -07:00
teknium1
46d33bf0b2
manually implement readme update due to
2025-05-21 23:09:45 -07:00
shannonsands
ba4ee8d68b
Merge pull request #63 from NousResearch/toolcalling-server-process-fix
...
Toolcalling server process fix
2025-05-21 21:56:11 -07:00
based-tachikoma
b01023ad3a
additional fixes to alphafold2_multimer and tool_executor
2025-05-21 21:28:30 -07:00
based-tachikoma
6783a077cc
update comment in protein_env.py
2025-05-21 21:27:42 -07:00
based-tachikoma
02585947e4
refactor file saving from alphafold2_multimer to tool_executor
2025-05-21 21:23:00 -07:00
Shannon Sands
a614fa7e67
updated llms.txt, contribution guide and added community folder with README
2025-05-22 11:53:11 +10:00
google-labs-jules[bot]
276a845dd7
feat: Implement SWE-RL Environment with Full Refinements
...
I've implemented the SWERLEnv in environments/swe_rl_env.py, based on the
SWE-RL paper (arXiv:2502.18449). This version incorporates extensive
refinements based on your feedback.
Key features implemented in environments/swe_rl_env.py:
- Core environment structure (setup, trajectory collection, scoring, evaluation).
- "Thinking" step: LLM is prompted for reasoning within <think> </think> tags
before generating a patch. Includes strict parsing for these tags.
- Dynamic prompt construction using `tokenizer.apply_chat_template` with
NousResearch/DeepHermes-3-Llama-3-8B-Preview as the default model.
- Hugging Face dataset integration: Loads data from HF Hub with configurable
dataset name, splits, and column mappings.
- Reward mechanism: Based on thinking tag correctness, patch format
(SEARCH/REPLACE), and similarity to the oracle patch.
- Comprehensive WandB logging for training/evaluation metrics.
NOTE: I made multiple attempts to update 'environments/README.md'
with documentation for this new environment. While I
reported success in some turns, this was not consistently verifiable
and may not have been correctly applied. The README.md file may
require manual verification and updating for the SWERLEnv.
2025-05-22 01:28:00 +00:00
Andrew
5d34ea821d
removed html from data folder
2025-05-21 18:10:10 -07:00
Andrew
6316cf31a5
chore: remove .zip and .html files per review feedback
2025-05-21 18:08:22 -07:00
based-tachikoma
227e594ebf
add debug_target.pdb test file
2025-05-21 16:50:15 -07:00
Eric Liu
7eae51cc5c
Move to subfolder
2025-05-21 16:19:00 -07:00
Eric Liu
a88e3afddf
DeepSacrifice
2025-05-21 16:18:46 -07:00
Andrew
c3a4461008
feat: update and refactored meteorology environment with latest changes
2025-05-20 20:23:00 -07:00
based-tachikoma
1ee67de035
refactor, full run
2025-05-20 20:12:59 -07:00
based-tachikoma
de9dfff221
rfdiffusion fix
2025-05-20 20:12:59 -07:00
hallerite
4d9bec44c6
[env]: add initial ProteinBinderEnv
...
Co-authored-by: based-tachikoma <based.tachikoma@gmail.com>
2025-05-18 20:03:21 -07:00
Earl Potters
db0cf9e6c0
Remove outdated DynastAI documentation and test scripts
...
- Deleted the ATROPOS_INTEGRATION.md and INSTALL_AND_RUN.md files, which contained installation and usage instructions for DynastAI.
- Removed test script test_dynastai_env.py and installation verification script verify_install.py, as they are no longer needed.
2025-05-18 19:06:20 -07:00
ParsaIdp
71f6d48e87
Create optimizer_benchmark_environmenr.py
2025-05-18 18:14:10 -07:00
ParsaIdp
f9a444b6f2
Update optimizer_benchmark_env.py
2025-05-18 18:13:25 -07:00
Kirill Igumenshchev (aider)
f59aaba24a
feat: ask to generate 3 example jokes in dataset question prompt
2025-05-18 18:12:48 -07:00
Dylan Anderson
7e91a94a3e
Add wandb
2025-05-18 18:00:21 -07:00
arihanv
291dcd8351
add: env
2025-05-18 17:58:56 -07:00
Karthik-Ragunath
34e9784311
pushing jsonl files
2025-05-18 17:56:27 -07:00
Kirill Igumenshchev
41cf093415
feat: add HTML rendering for humor datasets
2025-05-18 17:55:59 -07:00
Alex
444bd5b1d7
doctor.jsonl
2025-05-18 17:55:30 -07:00
Josh
c17cdb4486
Update README
2025-05-18 17:53:59 -07:00
Joshua Jerin
ab9a6f6d97
Update README.md
2025-05-18 20:53:13 -04:00
Dylan Anderson
1525e9404a
Add youtube
2025-05-18 17:53:07 -07:00
Joshua Jerin
baa6a1feef
Update README.md
2025-05-18 20:50:53 -04:00
Steven Li
4eae1c44ca
add examples to cat system prompt
2025-05-18 17:50:42 -07:00
Tvpower
320614e294
added videp
2025-05-18 17:50:33 -07:00
Karthik-Ragunath
9725761f5b
dev - push for submission
2025-05-18 17:50:15 -07:00
ParsaIdp
856b437b3a
Update wrapper.py
2025-05-18 17:49:31 -07:00
Joshua Jerin
c4e02454e0
refactor 2
2025-05-18 17:48:25 -07:00
Kirill Igumenshchev (aider)
96043a968f
refactor: update score method to use LLM with detailed rubric for joke evaluation
2025-05-18 17:48:14 -07:00
Joshua Jerin
d8e16c7991
refactor
2025-05-18 17:47:29 -07:00
Josh
7065d936d7
Update README
2025-05-18 17:47:16 -07:00
FIRST_NAME LAST_NAME
f401a746f1
fix
2025-05-18 17:47:08 -07:00
justin5764
55f4face3d
Create LeanRLREADME.md
2025-05-18 17:46:24 -07:00
Drew Sny
30549fc812
added compressed jsonl wandb
2025-05-18 17:43:52 -07:00
Jonah Philion
4e83714b44
make the evaluator more discerning
2025-05-18 17:43:36 -07:00
iyaja
1764a80094
submit: pokemon showdown env
2025-05-18 17:43:17 -07:00
Pranceraz
c7ce1be94c
working
2025-05-18 17:42:58 -07:00
Pranceraz
8163481fdc
work in progress
2025-05-18 17:41:58 -07:00