Commit graph

16 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
148a4fd5eb remove training code 2026-03-13 12:52:52 -04:00
Jai Suphavadeeprasit
530fed2877 testing set up 2026-03-13 11:04:57 -04:00
pre-commit-ci[bot]
91afc9e46e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
d2ea8cd612 remove KL 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
624b3cdabe feedback fixes: shared layers + hard coded values + warmup steps 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
fa22bf58d1 model layer stuff 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
e2e8268f2a cleanup 3 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
90281f5993 lora restart saving gradient changes 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
eb123f9596 ditching lora nccl 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
c33f9170c3 nccl loras 2026-03-02 11:18:52 -05:00
pre-commit-ci[bot]
5cfd1929f1 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
672cdbaea8 cleanup 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
c1bb4f33f0 manual testing 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
8a9e6945ee testing 3 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
75c4f5c853 memory enhancements 2026-03-02 11:18:52 -05:00
Jai Suphavadeeprasit
6833d4d820 major refactor 2026-03-02 11:18:52 -05:00