BLEUBERI/eval/README.md
2025-06-04 20:36:43 +00:00

193 B

To display benchmark results for models reported in the paper, run show_eval_results.sh.

To run a model on all benchmarks, see run_all_evals.sh.