Commit graph

1589 commits

Author SHA1 Message Date
Jai Suphavadeeprasit
1b8ff075c4 adding tests 2026-03-13 17:23:59 -04:00
pre-commit-ci[bot]
6c564799bc [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 21:02:08 +00:00
Jai Suphavadeeprasit
697c594c72 changes 2026-03-13 16:58:37 -04:00
pre-commit-ci[bot]
82964b6e48 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 20:13:35 +00:00
Jai Suphavadeeprasit
a8cdb53a4d address problems 2026-03-13 16:12:05 -04:00
Jai Suphavadeeprasit
322e7e6623 remove comments 2026-03-13 13:30:04 -04:00
pre-commit-ci[bot]
994e9c287d [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 17:21:00 +00:00
Jai Suphavadeeprasit
a1b545c734 remove cross tokenization and fix location of configs 2026-03-13 13:19:28 -04:00
Jai Suphavadeeprasit
148a4fd5eb remove training code 2026-03-13 12:52:52 -04:00
Jai Suphavadeeprasit
862cd3667d clean logging 2026-03-13 12:38:52 -04:00
Jai Suphavadeeprasit
600c54f5f8 clean log 2026-03-13 12:12:33 -04:00
pre-commit-ci[bot]
d1b0dee8f7 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2026-03-13 15:14:09 +00:00
Jai Suphavadeeprasit
d8857eb69f investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3df0e45659 investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
690e670e64 investigating weird training issue 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a43b0b7e72 training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
7ec622a098 training ideas 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c26432b963 training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
62ef2fcc2e training kernel 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
a54dfe7a13 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c37516b289 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
fd5b426f9f tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
34a39367dc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
8a348beccd tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
2f371e03fc tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
b457a678ce tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
3a440f847c tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
c275687fba tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
f1cfc137ec tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
78c0a6d082 tokenizer bug 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
98a5d3b334 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
82be871979 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
abba562d4a testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e79af5ff69 testing config 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
e84686b4fd remove enforce eager 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
057c9fe870 shorten worker timeout 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
d1fd89f992 non blocking test 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
09ad401995 sneaky bug logging 2026-03-13 11:06:02 -04:00
Jai Suphavadeeprasit
64794e7c72 sneaky bug 2026-03-13 11:06:00 -04:00
Jai Suphavadeeprasit
bb2736db4e next 2026-03-13 11:05:40 -04:00
Jai Suphavadeeprasit
4f33ab8bf4 apparently not so easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
81f90a67b5 forgot something easy 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
e5633527ba quicker training 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
985311eb94 trial 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
ad364ac771 increase timeout cause vllm is super slow all of a sudden 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
d5ca760f36 command change 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
530fed2877 testing set up 2026-03-13 11:04:57 -04:00
Jai Suphavadeeprasit
f44eb810bf teacher env init 2026-03-13 11:04:57 -04:00
dmahan93
c421582b6f
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
2026-03-10 23:08:58 -05:00
dmahan93
1d78069b5d
Bump version from 0.3.0 to 0.4.0 2026-03-09 23:17:01 -05:00