Commit graph

25 commits

Author SHA1 Message Date
pre-commit-ci[bot]
d1b0dee8f7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 15:14:09 +00:00
Jai Suphavadeeprasit
62ef2fcc2e training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
530fed2877 testing set up 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
8d29f49a58 more terminal changes 2026-03-02 14:40:55 -05:00
Jai Suphavadeeprasit
585244559e more readme changes 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
d2ea8cd612 remove KL 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
624b3cdabe feedback fixes: shared layers + hard coded values + warmup steps 2026-03-02 11:18:52 -05:00
pre-commit-ci[bot]
5cfd1929f1 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
77c592c909 logprobs 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
238602e855 linting 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
0b61dd047a cleanup 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
04f2850980 python versioning problems 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
54bd9a5ae0 logprob alignment 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
851f0b6e17 debug 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
6a659a8c9d KL 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
c1bb4f33f0 manual testing 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
75c4f5c853 memory enhancements 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
99eaab3192 metric calc diff 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
e2b111fea0 metric calc diff 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
122af0749a readme updates 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
298b1cd782 readme updates 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
d23dfe75b4 readme updates 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
6277bdd6d1 numpy fix 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
35d4a0781b logprob wandb 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
6833d4d820 major refactor 2026-03-02 11:18:52 -05:00