Jai Suphavadeeprasit
|
1b8ff075c4
|
adding tests
|
2026-03-13 17:23:59 -04:00 |
|
pre-commit-ci[bot]
|
6c564799bc
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 21:02:08 +00:00 |
|
Jai Suphavadeeprasit
|
697c594c72
|
changes
|
2026-03-13 16:58:37 -04:00 |
|
pre-commit-ci[bot]
|
82964b6e48
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 20:13:35 +00:00 |
|
Jai Suphavadeeprasit
|
a8cdb53a4d
|
address problems
|
2026-03-13 16:12:05 -04:00 |
|
Jai Suphavadeeprasit
|
322e7e6623
|
remove comments
|
2026-03-13 13:30:04 -04:00 |
|
pre-commit-ci[bot]
|
994e9c287d
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 17:21:00 +00:00 |
|
Jai Suphavadeeprasit
|
a1b545c734
|
remove cross tokenization and fix location of configs
|
2026-03-13 13:19:28 -04:00 |
|
Jai Suphavadeeprasit
|
148a4fd5eb
|
remove training code
|
2026-03-13 12:52:52 -04:00 |
|
Jai Suphavadeeprasit
|
862cd3667d
|
clean logging
|
2026-03-13 12:38:52 -04:00 |
|
Jai Suphavadeeprasit
|
600c54f5f8
|
clean log
|
2026-03-13 12:12:33 -04:00 |
|
pre-commit-ci[bot]
|
d1b0dee8f7
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 15:14:09 +00:00 |
|
Jai Suphavadeeprasit
|
d8857eb69f
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
3df0e45659
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
690e670e64
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
a43b0b7e72
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
7ec622a098
|
training ideas
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c26432b963
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
62ef2fcc2e
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
a54dfe7a13
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c37516b289
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
fd5b426f9f
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
34a39367dc
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
8a348beccd
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
2f371e03fc
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
b457a678ce
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
3a440f847c
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c275687fba
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
f1cfc137ec
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
78c0a6d082
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
98a5d3b334
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
82be871979
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
abba562d4a
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
e79af5ff69
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
e84686b4fd
|
remove enforce eager
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
057c9fe870
|
shorten worker timeout
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
d1fd89f992
|
non blocking test
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
09ad401995
|
sneaky bug logging
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
64794e7c72
|
sneaky bug
|
2026-03-13 11:06:00 -04:00 |
|
Jai Suphavadeeprasit
|
bb2736db4e
|
next
|
2026-03-13 11:05:40 -04:00 |
|
Jai Suphavadeeprasit
|
4f33ab8bf4
|
apparently not so easy
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
81f90a67b5
|
forgot something easy
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
e5633527ba
|
quicker training
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
985311eb94
|
trial
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
ad364ac771
|
increase timeout cause vllm is super slow all of a sudden
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
d5ca760f36
|
command change
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
530fed2877
|
testing set up
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
f44eb810bf
|
teacher env init
|
2026-03-13 11:04:57 -04:00 |
|
dmahan93
|
c421582b6f
|
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
|
2026-03-10 23:08:58 -05:00 |
|
dmahan93
|
1d78069b5d
|
Bump version from 0.3.0 to 0.4.0
|
2026-03-09 23:17:01 -05:00 |
|