Jai Suphavadeeprasit
|
ce85c7d95e
|
H100 bug fixes
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
624b3cdabe
|
feedback fixes: shared layers + hard coded values + warmup steps
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
fa22bf58d1
|
model layer stuff
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
f29a3d04fa
|
gradient flow fix
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
dc9df00570
|
vllm restart 2
|
2026-03-02 11:18:52 -05:00 |
|
pre-commit-ci[bot]
|
5cfd1929f1
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
8f1f8acbde
|
linting
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
238602e855
|
linting
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
672cdbaea8
|
cleanup
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
04652fd97c
|
checkpointing fixes
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
b9414e4076
|
rotary embeddings bug
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
122af0749a
|
readme updates
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
2790b9bdb6
|
readme updates
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
e89d26d7a0
|
logprob wandb
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
35d4a0781b
|
logprob wandb
|
2026-03-02 11:18:52 -05:00 |
|
Jai Suphavadeeprasit
|
6833d4d820
|
major refactor
|
2026-03-02 11:18:52 -05:00 |
|