pre-commit-ci[bot]
|
d1b0dee8f7
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 15:14:09 +00:00 |
|
Jai Suphavadeeprasit
|
62ef2fcc2e
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
530fed2877
|
testing set up
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
8d29f49a58
|
more terminal changes
|
2026-03-02 14:40:55 -05:00 |
|
Jai Suphavadeeprasit
|
585244559e
|
more readme changes
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
d2ea8cd612
|
remove KL
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
624b3cdabe
|
feedback fixes: shared layers + hard coded values + warmup steps
|
2026-03-02 11:18:52 -05:00 |
|
pre-commit-ci[bot]
|
5cfd1929f1
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
77c592c909
|
logprobs
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
238602e855
|
linting
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
0b61dd047a
|
cleanup
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
04f2850980
|
python versioning problems
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
54bd9a5ae0
|
logprob alignment
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
851f0b6e17
|
debug
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
6a659a8c9d
|
KL
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
c1bb4f33f0
|
manual testing
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
75c4f5c853
|
memory enhancements
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
99eaab3192
|
metric calc diff
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
e2b111fea0
|
metric calc diff
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
122af0749a
|
readme updates
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
298b1cd782
|
readme updates
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
d23dfe75b4
|
readme updates
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
6277bdd6d1
|
numpy fix
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
35d4a0781b
|
logprob wandb
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
6833d4d820
|
major refactor
|
2026-03-02 11:18:52 -05:00 |
|