Commit graph

599 commits

Author SHA1 Message Date
Shannon Sands
28e1e76cb7 added default factory handling for CLI args 2025-05-23 11:15:44 +10:00
Shannon Sands
d98f65f444 linting 2025-05-23 11:09:06 +10:00
Shannon Sands
606a2615f0 loop check 2025-05-23 11:05:08 +10:00
Fabrizio (Misto) Milo
490caf6854 change directory 2025-05-22 13:03:37 -07:00
Slava
5e58408266
Update CONTRIBUTING.md 2025-05-22 17:22:27 +02:00
teknium1
46d33bf0b2 manually implement readme update due to 2025-05-21 23:09:45 -07:00
shannonsands
ba4ee8d68b
Merge pull request #63 from NousResearch/toolcalling-server-process-fix
Toolcalling server process fix
2025-05-21 21:56:11 -07:00
based-tachikoma
b01023ad3a additional fixes to alphafold2_multimer and tool_executor 2025-05-21 21:28:30 -07:00
based-tachikoma
6783a077cc update comment in protein_env.py 2025-05-21 21:27:42 -07:00
based-tachikoma
02585947e4 refactor file saving from alphafold2_multimer to tool_executor 2025-05-21 21:23:00 -07:00
shannonsands
18bc94f64c
Merge pull request #105 from NousResearch/community-folder
updated llms.txt, contribution guide and added community folder with …
2025-05-21 20:03:55 -07:00
Shannon Sands
0cba08baba wording 2025-05-22 12:00:07 +10:00
Shannon Sands
a614fa7e67 updated llms.txt, contribution guide and added community folder with README 2025-05-22 11:53:11 +10:00
google-labs-jules[bot]
276a845dd7 feat: Implement SWE-RL Environment with Full Refinements
I've implemented the SWERLEnv in environments/swe_rl_env.py, based on the
SWE-RL paper (arXiv:2502.18449). This version incorporates extensive
refinements based on your feedback.

Key features implemented in environments/swe_rl_env.py:
- Core environment structure (setup, trajectory collection, scoring, evaluation).
- "Thinking" step: LLM is prompted for reasoning within <think> </think> tags
  before generating a patch. Includes strict parsing for these tags.
- Dynamic prompt construction using `tokenizer.apply_chat_template` with
  NousResearch/DeepHermes-3-Llama-3-8B-Preview as the default model.
- Hugging Face dataset integration: Loads data from HF Hub with configurable
  dataset name, splits, and column mappings.
- Reward mechanism: Based on thinking tag correctness, patch format
  (SEARCH/REPLACE), and similarity to the oracle patch.
- Comprehensive WandB logging for training/evaluation metrics.

NOTE: I made multiple attempts to update 'environments/README.md'
with documentation for this new environment. While I
reported success in some turns, this was not consistently verifiable
and may not have been correctly applied. The README.md file may
require manual verification and updating for the SWERLEnv.
2025-05-22 01:28:00 +00:00
Andrew
5d34ea821d removed html from data folder 2025-05-21 18:10:10 -07:00
Andrew
6316cf31a5 chore: remove .zip and .html files per review feedback 2025-05-21 18:08:22 -07:00
based-tachikoma
227e594ebf add debug_target.pdb test file 2025-05-21 16:50:15 -07:00
Eric Liu
7eae51cc5c Move to subfolder 2025-05-21 16:19:00 -07:00
Eric Liu
a88e3afddf DeepSacrifice 2025-05-21 16:18:46 -07:00
Andrew
c3a4461008 feat: update and refactored meteorology environment with latest changes 2025-05-20 20:23:00 -07:00
based-tachikoma
1ee67de035 refactor, full run 2025-05-20 20:12:59 -07:00
based-tachikoma
de9dfff221 rfdiffusion fix 2025-05-20 20:12:59 -07:00
hjc-puro
bef6a0b99a ignore uv.lock 2025-05-20 17:38:43 -04:00
hjc-puro
cd74c93468 delete 2025-05-20 17:38:17 -04:00
hallerite
4d9bec44c6
[env]: add initial ProteinBinderEnv
Co-authored-by: based-tachikoma <based.tachikoma@gmail.com>
2025-05-18 20:03:21 -07:00
GabinFay
945ea30c3a add: lean prover environment 2025-05-18 19:27:48 -07:00
Earl Potters
db0cf9e6c0 Remove outdated DynastAI documentation and test scripts
- Deleted the ATROPOS_INTEGRATION.md and INSTALL_AND_RUN.md files, which contained installation and usage instructions for DynastAI.
- Removed test script test_dynastai_env.py and installation verification script verify_install.py, as they are no longer needed.
2025-05-18 19:06:20 -07:00
Karthik-Ragunath
923d74d8b0 updated README with vscode launch configs to run the code 2025-05-18 19:04:08 -07:00
GabinFay
cbb1607f12 add: router agent env 2025-05-18 18:53:53 -07:00
erikqu
6a4647f260 add playwright agent env 2025-05-18 18:47:45 -07:00
ParsaIdp
71f6d48e87
Create optimizer_benchmark_environmenr.py 2025-05-18 18:14:10 -07:00
ParsaIdp
f9a444b6f2
Update optimizer_benchmark_env.py 2025-05-18 18:13:25 -07:00
Kirill Igumenshchev (aider)
f59aaba24a feat: ask to generate 3 example jokes in dataset question prompt 2025-05-18 18:12:48 -07:00
GabinFay
c7a7db309c add: environment for deep philosophical thinking 2025-05-18 18:04:49 -07:00
Dylan Anderson
7e91a94a3e Add wandb 2025-05-18 18:00:21 -07:00
arihanv
291dcd8351 add: env 2025-05-18 17:58:56 -07:00
vivek100
deee926e2b Add hack0 metric card generator environment with artifacts and documentation 2025-05-18 17:58:20 -07:00
Karthik-Ragunath
9125bd5f80 pushing file 2025-05-18 17:58:09 -07:00
Karthik-Ragunath
34e9784311 pushing jsonl files 2025-05-18 17:56:27 -07:00
Kirill Igumenshchev
41cf093415 feat: add HTML rendering for humor datasets 2025-05-18 17:55:59 -07:00
Alex
444bd5b1d7
doctor.jsonl 2025-05-18 17:55:30 -07:00
Josh
c17cdb4486 Update README 2025-05-18 17:53:59 -07:00
Joshua Jerin
ab9a6f6d97
Update README.md 2025-05-18 20:53:13 -04:00
Dylan Anderson
1525e9404a Add youtube 2025-05-18 17:53:07 -07:00
Joshua Jerin
baa6a1feef
Update README.md 2025-05-18 20:50:53 -04:00
Steven Li
4eae1c44ca add examples to cat system prompt 2025-05-18 17:50:42 -07:00
Tvpower
320614e294 added videp 2025-05-18 17:50:33 -07:00
Karthik-Ragunath
9725761f5b dev - push for submission 2025-05-18 17:50:15 -07:00
ParsaIdp
856b437b3a
Update wrapper.py 2025-05-18 17:49:31 -07:00
Joshua Jerin
55e44ee198
Merge pull request #1 from joshuajerin/josh
Initial commit
2025-05-18 20:48:57 -04:00