Commit graph

84 commits

Author SHA1 Message Date
pre-commit-ci[bot]
47c68f06f2 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-09-30 14:03:43 +00:00
hjc-puro
dddfb30c5b Fix smolagents ChatMessage compatibility and improve documentation
This commit fixes compatibility issues with smolagents 1.22.0 ChatMessage
objects and improves the documentation for easier setup.

Changes:
- Fix smolagents_model.py to handle ChatMessage objects (not just dicts)
  in _extract_user_message() and _format_chat_messages()
- Fix smolagents_env.py to handle ChatMessage objects in trajectory
  scoring and data group creation
- Update README.md with clearer installation instructions, Quick Start
  section, and automatic GAIA dataset download documentation
- Add test_run.sh script for easy testing with OpenAI models

Tested with:
- smolagents 1.22.0
- gpt-4o-mini via OpenAI API
- Tavily web search tools
- Automatic GAIA dataset download

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-29 21:27:52 +00:00
Allan Niemerg
bf4d84839b update README 2025-05-27 11:58:54 -05:00
Allan Niemerg
0d54b3e83e add automatic dataset load 2025-05-27 11:57:17 -05:00
Allan Niemerg
013090579d fix imports and style issues 2025-05-27 11:00:35 -05:00
Allan Niemerg
7a653044a4 add GAIA download to README 2025-05-21 16:18:08 -05:00
Allan Niemerg
7710e151cc This adds the SmolaGents integration to Atropos, enabling the creation of high-quality agent trajectories for training data. 2025-05-21 15:47:57 -05:00
Teknium
bdb15e5d85
Merge pull request #21 from NousResearch/add-sft-data-env
add an SFT data loading env
2025-05-17 11:30:07 -07:00
shannonsands
b1c9b23956
Merge pull request #50 from NousResearch/readme-updates
Added new env info
2025-05-17 10:22:20 -07:00
artem
651b40968d
Merge pull request #51 from NousResearch/kernelbench_env
Kernelbench env with parallel compilation
2025-05-16 21:38:57 -07:00
Artem Yatsenko
4b1c07232c readme updates 2025-05-17 04:37:09 +00:00
Artem Yatsenko
5396975bab move to its own folder, add readme, citation. 2025-05-17 04:34:49 +00:00
Shannon Sands
edf2beaa32 linting 2025-05-16 20:40:15 -07:00
Artem Yatsenko
9875f5dc06 pre-commit. add KB path as variable 2025-05-17 03:17:14 +00:00
Artem Yatsenko
fbfe06771b add parallel compile. add instructions 2025-05-16 19:53:46 -07:00
Shannon Sands
41caa05a1a remvoed merge error 2025-05-16 19:49:37 -07:00
Shannon Sands
9753d5a122 resolved conflict 2025-05-16 19:48:15 -07:00
Teknium
c9b6534eae
Merge pull request #44 from NousResearch/instruction-following-algo-environment
Instruction following algo environment
2025-05-16 19:37:00 -07:00
teknium1
20d263a495 add citation to allenai 2025-05-16 19:34:51 -07:00
teknium1
287bbcd356 some cleanup for final merge 2025-05-16 19:24:50 -07:00
Shannon Sands
fd63c76a5c Added new env info 2025-05-16 16:44:33 -07:00
teknium1
daa6f0ff18 add stricter enforcement of think tags 2025-05-16 13:18:20 -07:00
Artem Yatsenko
45aece515f first commit for kernelbench env 2025-05-16 11:29:10 -07:00
teknium1
6ae0703ad6 fix some regex and show special tokens for completions table 2025-05-15 22:29:42 -07:00
teknium1
24c571654e match num_max_requests with groupsize 2025-05-15 15:57:39 -07:00
Shannon Sands
bfb822f1e0 updated APIServerConfig and added requirements.txt and install instructions to README 2025-05-15 12:22:00 -07:00
hjc-puro
dcda88d79b fix validation errors 2025-05-15 04:30:59 -07:00
teknium1
1a9fa016b5 add dependencies to the env readme 2025-05-14 19:44:13 -07:00
teknium1
90e235a3e9 update environments readme 2025-05-14 19:40:32 -07:00
teknium1
2ab8905d4f fix score 2025-05-14 19:35:43 -07:00
teknium1
8a0e107806 change eval set size since this is a small dataset we need mo data for trainn 2025-05-14 19:18:01 -07:00
teknium1
bcc38567ca update some dataset stuff to use allenai's 2025-05-14 18:39:31 -07:00
teknium1
881af55f9a add instruction following algo env 2025-05-14 18:31:05 -07:00
Shannon Sands
c72a27d376 fixed linting in latest main 2025-05-14 17:29:57 -07:00
Shannon Sands
00dd120067 Merge branch 'main' into blackjack2-env 2025-05-14 17:27:44 -07:00
Shannon Sands
8fad665f6a moved folder location 2025-05-14 17:22:30 -07:00
Shannon Sands
c2bf3f5acd moved folder location 2025-05-14 17:22:18 -07:00
Joe Li
c1ae25c202
Merge pull request #26 from NousResearch/coding_server
add code execution environment
2025-05-14 15:08:10 -07:00
Shannon Sands
3fba8e3527 linting 2025-05-14 14:22:25 -07:00
Shannon Sands
d8ab1a6758 linting 2025-05-14 14:20:54 -07:00
Shannon Sands
1a7c0294fa refactoring for more clarity 2025-05-14 14:18:43 -07:00
Shannon Sands
bb6c205efe Linting 2025-05-14 14:05:52 -07:00
Shannon Sands
67cfd961c5 linting 2025-05-14 14:01:31 -07:00
Shannon Sands
826de9e283 Updated README 2025-05-14 13:57:20 -07:00
Shannon Sands
f5172b45a8 Added README 2025-05-14 13:35:15 -07:00
Shannon Sands
85f462df5e Updated test scripts 2025-05-14 12:05:59 -07:00
Shannon Sands
d6f9d58606 new env runs locally 2025-05-14 11:57:45 -07:00
Shannon Sands
54ae40840d no-thinking env added 2025-05-14 11:28:39 -07:00
Shannon Sands
21cc528b85 move best-of-n selection to util 2025-05-14 10:35:12 -07:00
Shannon Sands
4c00e2b209 move message history out to utils 2025-05-14 10:13:56 -07:00