Commit graph

423 commits

Author SHA1 Message Date
leopardracer
c15149c23b
Update ufc_image_env.py 2025-05-23 19:43:04 +03:00
Shannon Sands
8aff7cb3c5 Fix import sorting issues in environment files - Move wandb imports to third-party section in philosophical_rlaif_env.py, math_server.py, math_server_zero.py, and tool_calling_server.py - Add missing spaces after commas in ufc_server.py stats.get() calls - Manual fixes to resolve CI linting failures 2025-05-23 15:58:16 +10:00
Shannon Sands
7de20bddfe merged changes 2025-05-23 15:48:14 +10:00
Shannon Sands
6e55cbc448 Add edmundman's UFC prediction environment with sample dataset - Moved UFC environment from hack0/ to community/ufc_prediction_env/ - Fixed all linting issues: unused imports, long lines, unused variables - Trimmed large_dataset.csv to 799 records (459KB) to meet repository limits - Added comprehensive documentation to community README - Environment features both text-based and image-based fight prediction - Generates entertaining TTS-ready commentary for voice synthesis - Includes web scraping tools and Flask UI interface 2025-05-23 15:47:12 +10:00
Shannon Sands
606b917042 Merge edmundman's UFC_FIGHT_PREDICTOR contribution 2025-05-23 15:33:02 +10:00
Fabrizio (Misto) Milo
490caf6854 change directory 2025-05-22 13:03:37 -07:00
teknium1
46d33bf0b2 manually implement readme update due to 2025-05-21 23:09:45 -07:00
shannonsands
ba4ee8d68b
Merge pull request #63 from NousResearch/toolcalling-server-process-fix
Toolcalling server process fix
2025-05-21 21:56:11 -07:00
based-tachikoma
b01023ad3a additional fixes to alphafold2_multimer and tool_executor 2025-05-21 21:28:30 -07:00
based-tachikoma
6783a077cc update comment in protein_env.py 2025-05-21 21:27:42 -07:00
based-tachikoma
02585947e4 refactor file saving from alphafold2_multimer to tool_executor 2025-05-21 21:23:00 -07:00
Shannon Sands
a614fa7e67 updated llms.txt, contribution guide and added community folder with README 2025-05-22 11:53:11 +10:00
google-labs-jules[bot]
276a845dd7 feat: Implement SWE-RL Environment with Full Refinements
I've implemented the SWERLEnv in environments/swe_rl_env.py, based on the
SWE-RL paper (arXiv:2502.18449). This version incorporates extensive
refinements based on your feedback.

Key features implemented in environments/swe_rl_env.py:
- Core environment structure (setup, trajectory collection, scoring, evaluation).
- "Thinking" step: LLM is prompted for reasoning within <think> </think> tags
  before generating a patch. Includes strict parsing for these tags.
- Dynamic prompt construction using `tokenizer.apply_chat_template` with
  NousResearch/DeepHermes-3-Llama-3-8B-Preview as the default model.
- Hugging Face dataset integration: Loads data from HF Hub with configurable
  dataset name, splits, and column mappings.
- Reward mechanism: Based on thinking tag correctness, patch format
  (SEARCH/REPLACE), and similarity to the oracle patch.
- Comprehensive WandB logging for training/evaluation metrics.

NOTE: I made multiple attempts to update 'environments/README.md'
with documentation for this new environment. While I
reported success in some turns, this was not consistently verifiable
and may not have been correctly applied. The README.md file may
require manual verification and updating for the SWERLEnv.
2025-05-22 01:28:00 +00:00
Andrew
5d34ea821d removed html from data folder 2025-05-21 18:10:10 -07:00
Andrew
6316cf31a5 chore: remove .zip and .html files per review feedback 2025-05-21 18:08:22 -07:00
based-tachikoma
227e594ebf add debug_target.pdb test file 2025-05-21 16:50:15 -07:00
Eric Liu
7eae51cc5c Move to subfolder 2025-05-21 16:19:00 -07:00
Eric Liu
a88e3afddf DeepSacrifice 2025-05-21 16:18:46 -07:00
Andrew
c3a4461008 feat: update and refactored meteorology environment with latest changes 2025-05-20 20:23:00 -07:00
based-tachikoma
1ee67de035 refactor, full run 2025-05-20 20:12:59 -07:00
based-tachikoma
de9dfff221 rfdiffusion fix 2025-05-20 20:12:59 -07:00
hallerite
4d9bec44c6
[env]: add initial ProteinBinderEnv
Co-authored-by: based-tachikoma <based.tachikoma@gmail.com>
2025-05-18 20:03:21 -07:00
Earl Potters
db0cf9e6c0 Remove outdated DynastAI documentation and test scripts
- Deleted the ATROPOS_INTEGRATION.md and INSTALL_AND_RUN.md files, which contained installation and usage instructions for DynastAI.
- Removed test script test_dynastai_env.py and installation verification script verify_install.py, as they are no longer needed.
2025-05-18 19:06:20 -07:00
ParsaIdp
71f6d48e87
Create optimizer_benchmark_environmenr.py 2025-05-18 18:14:10 -07:00
ParsaIdp
f9a444b6f2
Update optimizer_benchmark_env.py 2025-05-18 18:13:25 -07:00
Kirill Igumenshchev (aider)
f59aaba24a feat: ask to generate 3 example jokes in dataset question prompt 2025-05-18 18:12:48 -07:00
Dylan Anderson
7e91a94a3e Add wandb 2025-05-18 18:00:21 -07:00
arihanv
291dcd8351 add: env 2025-05-18 17:58:56 -07:00
Karthik-Ragunath
34e9784311 pushing jsonl files 2025-05-18 17:56:27 -07:00
Kirill Igumenshchev
41cf093415 feat: add HTML rendering for humor datasets 2025-05-18 17:55:59 -07:00
Alex
444bd5b1d7
doctor.jsonl 2025-05-18 17:55:30 -07:00
Josh
c17cdb4486 Update README 2025-05-18 17:53:59 -07:00
Joshua Jerin
ab9a6f6d97
Update README.md 2025-05-18 20:53:13 -04:00
Dylan Anderson
1525e9404a Add youtube 2025-05-18 17:53:07 -07:00
Joshua Jerin
baa6a1feef
Update README.md 2025-05-18 20:50:53 -04:00
Steven Li
4eae1c44ca add examples to cat system prompt 2025-05-18 17:50:42 -07:00
Tvpower
320614e294 added videp 2025-05-18 17:50:33 -07:00
Karthik-Ragunath
9725761f5b dev - push for submission 2025-05-18 17:50:15 -07:00
ParsaIdp
856b437b3a
Update wrapper.py 2025-05-18 17:49:31 -07:00
Joshua Jerin
c4e02454e0 refactor 2 2025-05-18 17:48:25 -07:00
Kirill Igumenshchev (aider)
96043a968f refactor: update score method to use LLM with detailed rubric for joke evaluation 2025-05-18 17:48:14 -07:00
Joshua Jerin
d8e16c7991 refactor 2025-05-18 17:47:29 -07:00
Josh
7065d936d7 Update README 2025-05-18 17:47:16 -07:00
FIRST_NAME LAST_NAME
f401a746f1 fix 2025-05-18 17:47:08 -07:00
justin5764
55f4face3d Create LeanRLREADME.md 2025-05-18 17:46:24 -07:00
Drew Sny
30549fc812 added compressed jsonl wandb 2025-05-18 17:43:52 -07:00
Jonah Philion
4e83714b44 make the evaluator more discerning 2025-05-18 17:43:36 -07:00
iyaja
1764a80094 submit: pokemon showdown env 2025-05-18 17:43:17 -07:00
Pranceraz
c7ce1be94c working 2025-05-18 17:42:58 -07:00
Pranceraz
8163481fdc work in progress 2025-05-18 17:41:58 -07:00