Commit graph

21 commits

Author SHA1 Message Date
pre-commit-ci[bot]
60fb6cae11 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-20 04:58:47 +00:00
Jai Suphavadeeprasit
ccdd5a1ca6 linting 2026-02-19 23:57:47 -05:00
Jai Suphavadeeprasit
527433b5bc change OPD style 2026-02-19 17:08:27 -05:00
Jai Suphavadeeprasit
33f5696171 Merge branch 'pipelineRL' into OnPolicyDistillation 2026-02-19 16:39:21 -05:00
Jai Suphavadeeprasit
bc0f9ee625 debug changes 2026-02-17 08:15:07 -05:00
Jai Suphavadeeprasit
0e81c62e90 on policy changes 2026-02-16 17:39:37 -05:00
Jai Suphavadeeprasit
becadb54b0 Fix math_server_zero.py to support CLI OpenAI arguments
Change ServerBaseline to APIServerConfig in config_init() so that
--openai.base_url and other CLI arguments work for on-policy distillation.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-16 17:18:01 -05:00
Jai Suphavadeeprasit
950be6f0d4 math server 2026-02-13 11:26:25 -05:00
pre-commit-ci[bot]
11f495a381 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
33505fe981 math zero work arounds 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
407a22ba12 Save the eval to the disk 2026-02-13 11:25:49 -05:00
Dakota
e6ac3abdcb add managed vllm server 2025-11-07 13:06:49 -06:00
Dakota
5d6d6bb0dc add docs :) 2025-10-29 11:26:43 -05:00
pre-commit-ci[bot]
0d80da5146 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-24 20:10:29 +00:00
dmahan93
7bf4cfbf80 add managed server to make grabbing logprobs easier w/ tokenized items 2025-10-24 13:09:46 -07:00
pre-commit-ci[bot]
1e6a745491 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-16 17:39:04 +00:00
Dakota
c36ec29656 add sglang specific token level logprob handling and server manager/baseline logprob/token fn 2025-10-16 12:38:03 -05:00
Shannon Sands
581d29ff92 Fix import sorting issues manually - Move wandb imports to proper third-party import section - Remove extra blank lines - Skip pre-commit hooks to resolve CI workflow failures 2025-05-23 15:25:40 +10:00
Shannon Sands
93a5da9e32 Fix linting issues across repository - Install pre-commit hooks properly - Fix trailing whitespace and end-of-file issues in metric card generator README - Fix import sorting across multiple files to comply with isort --profile black 2025-05-23 15:17:27 +10:00
dmahan93
e09ae8d3d3 fix olympiadbench due to upstream changes 2025-05-09 09:41:10 -05:00
Dakota Nous
621d00dd80 first commit 2025-04-29 12:10:10 -07:00