BLEUBERI/eval/WildBench/eval_results/v2.0625/score.v2
2025-06-04 20:36:43 +00:00
..
eval=claude-3-5-sonnet-20240620 initial commit 2025-06-04 20:36:43 +00:00
eval=gpt-4.1-mini initial commit 2025-06-04 20:36:43 +00:00
eval=gpt-4o-2024-05-13 initial commit 2025-06-04 20:36:43 +00:00