Oliver Stanley
f490b9f760
Tolerant scoring for CodeI/O based on edit distances ( #277 )
...
* add zss dep
* codeio edit distance-based scoring
* edit distance tweaks
2025-03-07 22:49:35 +01:00
Oliver Stanley
d1e505a8e9
First version of CodeI/O reasoning data ( #264 )
...
* notebook for prepping first set of raw code files
* updated codeio processing notebook for repo-level processing
* fix for edge case in codeio scoring
* Add reformat notebook
* filtering pass
* add non-determinism filtering
* Tweak CodeIODataset & include first real data
* add basic codeio test, metadata
2025-03-05 22:34:11 +01:00
Oliver
5fa06c961f
Fix
2025-02-26 11:17:23 +00:00
Oliver
81c77a495d
Add note on code execution to CodeIODataset
2025-02-25 22:39:06 +00:00
Oliver
0252dd905f
Move data file & load into memory on first object creation
2025-02-25 22:36:38 +00:00
Oliver
fe502d5eb2
Register CodeIODataset
2025-02-24 18:28:35 +00:00
Oliver
43daec67ea
Initial scoring algo for codeio
2025-02-24 18:27:53 +00:00
Oliver
1795c8ea7a
Add tiny sample dataset & efficient sampling
2025-02-24 17:58:31 +00:00
Oliver
7b5a12a92c
Remove outdated comment
2025-02-23 22:24:13 +00:00
Oliver
e07287e1f9
Add validation
2025-02-23 22:23:45 +00:00
Oliver
f787069fd2
Add input prediction
2025-02-23 20:27:27 +00:00
Oliver
e718168428
Draft CodeIO-derived reasoning problems dataset
2025-02-22 00:56:52 +00:00
Oliver
563480329e
Outline CodeIO dataset classes
2025-02-22 00:21:17 +00:00