reasoning-gym/reasoning_gym/data/codeio-hq.jsonl.gz
Roman Machacek 2c52f33c3a
CodeIO HQ Dataset (#382)
* ADD: CodeIO high quality dataset

Based on the dataset for CodeI/O. Annotated using Qwen-Coder and filtered based on the various metrics resulting in high quality filtered dataset, where approx 50% of the original data is kept.

* ADD: Compressed version

* Delete pure json version
2025-04-01 22:34:33 +02:00

2.5 MiB