atropos

mirror of https://github.com/NousResearch/atropos.git synced 2026-04-27 17:23:08 +00:00

Author	SHA1	Message	Date
teknium1	287bbcd356	some cleanup for final merge	2025-05-16 19:24:50 -07:00
teknium1	daa6f0ff18	add stricter enforcement of think tags	2025-05-16 13:18:20 -07:00
teknium1	6ae0703ad6	fix some regex and show special tokens for completions table	2025-05-15 22:29:42 -07:00
teknium1	24c571654e	match num_max_requests with groupsize	2025-05-15 15:57:39 -07:00
hjc-puro	dcda88d79b	fix validation errors	2025-05-15 04:30:59 -07:00
teknium1	1a9fa016b5	add dependencies to the env readme	2025-05-14 19:44:13 -07:00
teknium1	90e235a3e9	update environments readme	2025-05-14 19:40:32 -07:00
teknium1	2ab8905d4f	fix score	2025-05-14 19:35:43 -07:00
teknium1	8a0e107806	change eval set size since this is a small dataset we need mo data for trainn	2025-05-14 19:18:01 -07:00
teknium1	bcc38567ca	update some dataset stuff to use allenai's	2025-05-14 18:39:31 -07:00
teknium1	881af55f9a	add instruction following algo env	2025-05-14 18:31:05 -07:00
dmahan93	6e9405ba95	Fix bad merge	2025-05-12 20:02:54 -05:00
dmahan93	0aaf59fc9a	add trl server add gsm8k example for axolotl checking	2025-05-12 19:04:46 -05:00
dmahan93	96be544228	Merge commit '`71e7a5ca27`' into add-support-for-custom-api-servers	2025-05-12 18:40:35 -05:00
dmahan93	92428fec8f	add gym taxi env	2025-05-09 19:05:01 -05:00
dmahan93	40b12dae60	run pre-commit on all files	2025-05-09 09:54:20 -05:00
dmahan93	b959c30ebf	Merge pull request #31 from NousResearch/fix-math-evals-due-to-updated-dataset fix olympiadbench due to upstream changes	2025-05-09 09:42:06 -05:00
dmahan93	e09ae8d3d3	fix olympiadbench due to upstream changes	2025-05-09 09:41:10 -05:00
hjc-puro	629d8c1731	Merge pull request #14 from NousResearch/2025-05-02-server-cli	2025-05-09 13:37:54 +08:00
dmahan93	70cf61c210	add custom server support	2025-05-08 12:01:49 -05:00
Artem Yatsenko	0f15be68a2	fix multimodal envs. add view_run_multimodal	2025-05-07 21:53:01 +00:00
edmund	2cb1ff0087	Removed mentions of NousResearch/DeepHermes-3-Llama-3-1B-Preview and swapped it for NousResearch/DeepHermes-3-Llama-3-3B-Preview I don't think there is a NousResearch/DeepHermes-3-Llama-3-1B-Preview	2025-05-07 18:03:17 +01:00
teknium1	d2dbab7d22	Add additional completions table info: metric, magnitude, and direction for ground truth	2025-05-04 03:30:50 -07:00
teknium1	c3b80832e9	lowering the defaults for fundamental finance env	2025-05-04 03:05:25 -07:00
hjc-puro	4348dd2ec1	hide complicated openai config override behavior somewhere else	2025-05-03 14:18:50 -07:00
teknium1	a2e36227aa	add metric logging	2025-05-02 02:34:17 -07:00
Dakota Nous	621d00dd80	first commit	2025-04-29 12:10:10 -07:00

27 commits