Andreas Koepf
|
0a660a3409
|
ignore single whitespace at beginning and end of answer, use reward = len(oracle_answer) / len(answer)
|
2025-02-14 15:40:12 +01:00 |
|
Andreas Koepf
|
5a88cf2529
|
add simple dataset gallery generation script
|
2025-01-30 22:30:26 +01:00 |
|
Andreas Koepf (aider)
|
e2d3f4b4e6
|
feat: Add seed wrapping at 2^32 to prevent unbounded growth
|
2025-01-30 22:05:14 +01:00 |
|
Andreas Koepf (aider)
|
dc54a7672f
|
refactor: Use self.dataset.seed directly for chunk seed generation
|
2025-01-30 22:02:21 +01:00 |
|
Andreas Koepf (aider)
|
66f99be4a3
|
feat: Add score_answer method to ReseedingDataset
|
2025-01-30 21:59:50 +01:00 |
|
Andreas Koepf (aider)
|
6d59648264
|
feat: Add ReseedingDataset wrapper for infinite procedural datasets
|
2025-01-30 21:56:43 +01:00 |
|
Andreas Koepf
|
c196d622e0
|
extract answer from last answer tag
|
2025-01-28 16:37:19 +00:00 |
|
Andreas Koepf
|
cc0312e446
|
add first example with OpenRLHF
|
2025-01-28 14:40:06 +00:00 |
|
Andreas Koepf
|
e9549f2a63
|
pass config to ProceduralDataset base
|
2025-01-25 00:23:05 +01:00 |
|
Andreas Koepf (aider)
|
2befe97151
|
feat: Add dataset factory with registration and creation functions
|
2025-01-25 00:00:22 +01:00 |
|
Andreas Koepf
|
20069b2a7d
|
formatting
|
2025-01-24 10:34:07 +01:00 |
|
Andreas Koepf (aider)
|
719d760eea
|
feat: Add return type annotation to ProceduralDataset.__next__()
|
2025-01-24 10:16:27 +01:00 |
|
Andreas Koepf (aider)
|
2a6a9655d7
|
feat: Add Sized and Iterable base classes to ProceduralDataset
|
2025-01-24 10:14:42 +01:00 |
|
Andreas Koepf (aider)
|
018bdcef6f
|
feat: Add ProceduralDataset abstract base class for dataset generators
|
2025-01-24 09:51:04 +01:00 |
|