Jai Suphavadeeprasit
|
148a4fd5eb
|
remove training code
|
2026-03-13 12:52:52 -04:00 |
|
Jai Suphavadeeprasit
|
530fed2877
|
testing set up
|
2026-03-13 11:04:57 -04:00 |
|
pre-commit-ci[bot]
|
91afc9e46e
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
d2ea8cd612
|
remove KL
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
624b3cdabe
|
feedback fixes: shared layers + hard coded values + warmup steps
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
fa22bf58d1
|
model layer stuff
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
e2e8268f2a
|
cleanup 3
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
90281f5993
|
lora restart saving gradient changes
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
eb123f9596
|
ditching lora nccl
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
c33f9170c3
|
nccl loras
|
2026-03-02 11:18:52 -05:00 |
|
pre-commit-ci[bot]
|
5cfd1929f1
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
672cdbaea8
|
cleanup
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
c1bb4f33f0
|
manual testing
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
8a9e6945ee
|
testing 3
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
75c4f5c853
|
memory enhancements
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
6833d4d820
|
major refactor
|
2026-03-02 11:18:52 -05:00 |
|