First version of CodeI/O reasoning data (#264)

* notebook for prepping first set of raw code files
* updated codeio processing notebook for repo-level processing
* fix for edge case in codeio scoring
* Add reformat notebook
* filtering pass
* add non-determinism filtering
* Tweak CodeIODataset & include first real data
* add basic codeio test, metadata
This commit is contained in:
Oliver Stanley 2025-03-05 21:34:11 +00:00 committed by GitHub
parent 7458dbc95d
commit 3286a68361
6 changed files with 1061 additions and 113 deletions

File diff suppressed because one or more lines are too long