Commit graph

14 commits

Author SHA1 Message Date
Oliver Stanley
f0588307d7 Add a few new CodeI/O samples, resolve numeric answer scoring bug (#332)
* add handful of codeio samples

* scoring fix
2025-03-11 23:55:33 +01:00
Oliver Stanley
35c32cd5e7 Tolerant scoring for CodeI/O based on edit distances (#277)
* add zss dep

* codeio edit distance-based scoring

* edit distance tweaks
2025-03-07 22:49:35 +01:00
Oliver Stanley
3286a68361 First version of CodeI/O reasoning data (#264)
* notebook for prepping first set of raw code files
* updated codeio processing notebook for repo-level processing
* fix for edge case in codeio scoring
* Add reformat notebook
* filtering pass
* add non-determinism filtering
* Tweak CodeIODataset & include first real data
* add basic codeio test, metadata
2025-03-05 22:34:11 +01:00
Oliver
8f05e6108c Fix 2025-02-26 11:17:23 +00:00
Oliver
4bdb8c7d6b Add note on code execution to CodeIODataset 2025-02-25 22:39:06 +00:00
Oliver
ef2f8d1978 Move data file & load into memory on first object creation 2025-02-25 22:36:38 +00:00
Oliver
f895a458c7 Register CodeIODataset 2025-02-24 18:28:35 +00:00
Oliver
efbcfb6eed Initial scoring algo for codeio 2025-02-24 18:27:53 +00:00
Oliver
5a222a398b Add tiny sample dataset & efficient sampling 2025-02-24 17:58:31 +00:00
Oliver
7ff162e9bb Remove outdated comment 2025-02-23 22:24:13 +00:00
Oliver
c0923a6fb8 Add validation 2025-02-23 22:23:45 +00:00
Oliver
40d7dfdb5f Add input prediction 2025-02-23 20:27:27 +00:00
Oliver
489dea7267 Draft CodeIO-derived reasoning problems dataset 2025-02-22 00:56:52 +00:00
Oliver
378cba2de1 Outline CodeIO dataset classes 2025-02-22 00:21:17 +00:00