Commit graph

65 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
a6faaee71d vllm weight bridge 2026-02-13 11:26:25 -05:00
pre-commit-ci[bot]
e1aca5ecf5 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
f0468e620e single copy now working as expected 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
dfa87df1f1 prints 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
bcd9bc6e20 clearing more bloat 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
178ad95fe7 Bloat reduction 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
802203a2d3 readme update 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
0f7713a575 clean up 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b1eccaa597 fused memory 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
ffcd9367f8 other changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
99488ab3fe adjusting buffers 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
bf50ed37d9 buffer efficiency 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
dff4065982 debugging 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
7eb5381262 pass all the informaiton 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b12d0575e1 unsure 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
5921684e9d pytorch underbelly 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
27df785ad5 patched 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
2225b4623f main changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
2dc1c2a981 pipeline changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
f3c6275263 streamline process 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
19b3116b84 serialization errors 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
f46e5c562d single copy 1 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3de03d6db3 single copy 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
5ba06c7d4a threading 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
ca1ec60869 improve default 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
eed13670de better debugging 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3ac4a64f6f patching problem 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
5af1a4a974 basic changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
007f4f275d changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
80f67f979a error handling 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
9e53076a82 param locations update 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
197fce640f daemon errors 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
3995e0af7d monkey patch fixes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
7b975f3adc changes based on torchtitan 2 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
53b29472b4 changes based on torchtitan 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
078dd4a333 Cleanup 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
39e94c4278 weight updates async 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b3874b658a vllm underlying weights 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
b0d35be8a4 IPC updates 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
e278978fa1 health changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
f51ae77f54 add missing parameter 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
d6f389f86f readme updates 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
88ccaa0ea5 standardize the training approach 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
ebdbc54842 tracking 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
9498d9576f training bug 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
d978eff127 smol changes 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
adc3ae712b design choice - LoRA and shared vLLM through the bridge 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
22648bd912 gradient checkpointing issue for LoRAs 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
1e7b7cf841 stuff 2026-02-13 11:26:25 -05:00
Jai Suphavadeeprasit
db7414329b generate endpoint with logprobs 2026-02-13 11:26:25 -05:00