Andreas Koepf (aider)
|
8ccf077faf
|
docs: Add BibTeX citation for Re-ARC dataset in NOTICE.txt
|
2025-02-25 20:19:11 +01:00 |
|
vncntt
|
5f01049607
|
Add KnightsKnavesDataset (knights_knaves)
Adapted code from https://github.com/AlphaPav/mem-kk-logic/blob/main/data_prep/lib_kk.py
---------
Co-authored-by: Andreas Koepf (aider) <andreas.koepf@provisio.com>
|
2025-02-25 20:15:38 +01:00 |
|
Andreas Köpf
|
ed9292a7f4
|
Merge pull request #205 from open-thought/consolidate_eval_script
Consolidate eval scripts to have single eval.py
|
2025-02-25 19:45:05 +01:00 |
|
Andreas Koepf
|
791f16ec0f
|
use results folder name for eval results
|
2025-02-25 19:41:21 +01:00 |
|
joesharratt1229
|
ffe60ef112
|
finalised readme
|
2025-02-25 18:14:39 +00:00 |
|
joesharratt1229
|
56cc111ab3
|
Merge remote-tracking branch 'origin/consolidate_eval_script' into fix/eval
|
2025-02-25 18:10:07 +00:00 |
|
joesharratt1229
|
9ac6ea4eb2
|
changed structure
|
2025-02-25 16:32:42 +00:00 |
|
joesharratt1229
|
52c3c430b9
|
updated config and read me
|
2025-02-25 16:25:16 +00:00 |
|
joesharratt1229
|
7b39f4a3c7
|
updated read me
|
2025-02-25 15:51:31 +00:00 |
|
joesharratt1229
|
046c46c0bb
|
updated read me
|
2025-02-25 15:46:43 +00:00 |
|
Andreas Koepf
|
878f9bbc76
|
move r1 configs into r1 yaml/r1 subfolder
|
2025-02-25 16:24:30 +01:00 |
|
Andreas Koepf
|
e7ae82a831
|
consolidate eval scripts to have single eval.py
|
2025-02-25 16:13:22 +01:00 |
|
Andreas Köpf
|
bea806fe3c
|
Merge pull request #204 from open-thought/requirements_txt_eval
Add eval/requirements-eval.txt
|
2025-02-25 15:55:09 +01:00 |
|
Andreas Koepf
|
8291956554
|
add aiohttp & tenacity deps to requirements-eval.txt
|
2025-02-25 15:50:11 +01:00 |
|
Andreas Koepf (aider)
|
e48c1f82cd
|
docs: Update installation instructions in eval README
|
2025-02-25 15:37:09 +01:00 |
|
Andreas Koepf (aider)
|
a1b0a0414e
|
docs: Add dependency installation step to eval README setup instructions
|
2025-02-25 15:19:38 +01:00 |
|
Andreas Koepf
|
574edb5c5b
|
remove eval results from main repo
|
2025-02-25 11:02:02 +01:00 |
|
Andreas Koepf (aider)
|
205174c532
|
docs: Add info about reasoning-gym-eval repository for evaluation results
|
2025-02-25 10:53:21 +01:00 |
|
Zafir Stojanovski
|
5ed4395613
|
async
|
2025-02-24 22:07:35 +01:00 |
|
Oliver
|
fe502d5eb2
|
Register CodeIODataset
|
2025-02-24 18:28:35 +00:00 |
|
Oliver
|
43daec67ea
|
Initial scoring algo for codeio
|
2025-02-24 18:27:53 +00:00 |
|
Oliver
|
1795c8ea7a
|
Add tiny sample dataset & efficient sampling
|
2025-02-24 17:58:31 +00:00 |
|
Zafir Stojanovski
|
aac7175c69
|
generate inputs synchronously
|
2025-02-24 15:58:06 +01:00 |
|
Andreas Köpf
|
a4b767fa0e
|
Merge pull request #197 from open-thought/notice_txt_first_version
docs: Add NOTICE.txt file to project
|
2025-02-24 15:30:28 +01:00 |
|
Andreas Koepf
|
0bea658c94
|
docs: Add NOTICE.txt file to project
|
2025-02-24 12:57:28 +01:00 |
|
Andreas Köpf
|
3c589f99bd
|
Merge pull request #195 from open-thought/fix/eval
pinned provider to nebius fixes #192
|
2025-02-24 08:34:45 +01:00 |
|
joesharratt1229
|
cffbff935c
|
pinned provider to nebius
|
2025-02-24 05:01:22 +00:00 |
|
Oliver
|
7b5a12a92c
|
Remove outdated comment
|
2025-02-23 22:24:13 +00:00 |
|
Oliver
|
e07287e1f9
|
Add validation
|
2025-02-23 22:23:45 +00:00 |
|
Andreas Koepf
|
b5f6f7d753
|
bump version, update gallery
|
2025-02-23 22:36:39 +01:00 |
|
Andreas Köpf
|
d115655f0a
|
Merge pull request #191 from zafstojano/env/shortest-path
feat(env): Shortest Path
|
2025-02-23 22:28:43 +01:00 |
|
Andreas Koepf
|
45e452bff6
|
reduce size of default shortest_path maze grid
|
2025-02-23 22:27:17 +01:00 |
|
Oliver
|
342902683f
|
Merge branch 'main' into codeio-sampler
|
2025-02-23 20:28:06 +00:00 |
|
Oliver
|
f787069fd2
|
Add input prediction
|
2025-02-23 20:27:27 +00:00 |
|
Zafir Stojanovski
|
c5f37d5e9f
|
predict actual path
|
2025-02-23 18:24:23 +01:00 |
|
Andreas Köpf
|
eaa8f5253b
|
Merge pull request #194 from open-thought/190_fix_arc_1d_out_of_range
minor arc_1d tweaks
|
2025-02-23 16:40:09 +01:00 |
|
Andreas Koepf
|
469934d9b7
|
minor arc_1d tweaks
|
2025-02-23 16:37:40 +01:00 |
|
Andreas Köpf
|
8e4ed9bae9
|
Merge pull request #193 from open-thought/190_fix_arc_1d_out_of_range
Fix index out of range for arc_1d dataset
|
2025-02-23 13:20:08 +01:00 |
|
Andreas Koepf
|
ec3050a4f6
|
remove unnecessary checks, use tuples
|
2025-02-23 13:17:48 +01:00 |
|
Zafir Stojanovski
|
5109ed89c9
|
pre-commit
|
2025-02-23 13:11:31 +01:00 |
|
Andreas Koepf
|
ba56aa0092
|
add arc_1d size range test
|
2025-02-23 12:58:51 +01:00 |
|
Andreas Koepf
|
7a45b14a49
|
fix index out of range of arc_1d dataset (#190)
|
2025-02-23 12:51:41 +01:00 |
|
Zafir Stojanovski
|
97b3097984
|
shortest path
|
2025-02-23 11:25:00 +01:00 |
|
Zafir Stojanovski
|
96dad6c7f3
|
sampling code
|
2025-02-23 00:40:11 +01:00 |
|
Andreas Koepf
|
e4102a44f6
|
dev minor version one ahead of PyPI released version
|
2025-02-22 16:54:05 +01:00 |
|
Andreas Köpf
|
7a1e387d6e
|
Merge pull request #176 from olliestanley/codeio-experiments
Experiments with CodeI/O techniques for synthesising reasoning data
|
2025-02-22 16:24:17 +01:00 |
|
Zafir Stojanovski
|
e04ca72809
|
greedy coreset sampling
|
2025-02-22 16:15:14 +01:00 |
|
Oliver
|
e718168428
|
Draft CodeIO-derived reasoning problems dataset
|
2025-02-22 00:56:52 +00:00 |
|
Oliver
|
563480329e
|
Outline CodeIO dataset classes
|
2025-02-22 00:21:17 +00:00 |
|
Zafir Stojanovski
|
6bbec2ac4e
|
exploratory notebook
|
2025-02-22 00:46:33 +01:00 |
|