Jai Suphavadeeprasit
|
862cd3667d
|
clean logging
|
2026-03-13 12:38:52 -04:00 |
|
Jai Suphavadeeprasit
|
600c54f5f8
|
clean log
|
2026-03-13 12:12:33 -04:00 |
|
pre-commit-ci[bot]
|
d1b0dee8f7
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-13 15:14:09 +00:00 |
|
Jai Suphavadeeprasit
|
d8857eb69f
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
3df0e45659
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
690e670e64
|
investigating weird training issue
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
a43b0b7e72
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
7ec622a098
|
training ideas
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c26432b963
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
62ef2fcc2e
|
training kernel
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
a54dfe7a13
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c37516b289
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
fd5b426f9f
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
34a39367dc
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
8a348beccd
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
2f371e03fc
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
b457a678ce
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
3a440f847c
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
c275687fba
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
f1cfc137ec
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
78c0a6d082
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
98a5d3b334
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
82be871979
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
abba562d4a
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
e79af5ff69
|
testing config
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
e84686b4fd
|
remove enforce eager
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
057c9fe870
|
shorten worker timeout
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
d1fd89f992
|
non blocking test
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
09ad401995
|
sneaky bug logging
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
64794e7c72
|
sneaky bug
|
2026-03-13 11:06:00 -04:00 |
|
Jai Suphavadeeprasit
|
bb2736db4e
|
next
|
2026-03-13 11:05:40 -04:00 |
|
Jai Suphavadeeprasit
|
4f33ab8bf4
|
apparently not so easy
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
81f90a67b5
|
forgot something easy
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
e5633527ba
|
quicker training
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
985311eb94
|
trial
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
ad364ac771
|
increase timeout cause vllm is super slow all of a sudden
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
d5ca760f36
|
command change
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
530fed2877
|
testing set up
|
2026-03-13 11:04:57 -04:00 |
|
Jai Suphavadeeprasit
|
f44eb810bf
|
teacher env init
|
2026-03-13 11:04:57 -04:00 |
|
dmahan93
|
c421582b6f
|
Merge pull request #408 from daspartho/verl-integration-fixes
fix: re-append stop string in math training path
|
2026-03-10 23:08:58 -05:00 |
|
dmahan93
|
1d78069b5d
|
Bump version from 0.3.0 to 0.4.0
|
2026-03-09 23:17:01 -05:00 |
|
dmahan93
|
6facf0add5
|
Merge pull request #405 from NousResearch/add-openai-endpoint-for-managed-server
add tool call parsing based on vllm impl and an openai server endpoint
|
2026-03-09 23:16:00 -05:00 |
|
dmahan93
|
f198c1738e
|
Merge conflict commit
|
2026-03-09 23:13:43 -05:00 |
|
Partho Das
|
632ab0161c
|
Revert "rm hardcoded same score check"
This reverts commit f02c24204d.
|
2026-03-10 01:42:44 +05:30 |
|
dmahan93
|
c0db13978a
|
Merge pull request #409 from NousResearch/pre-commit-ci-update-config
[pre-commit.ci] pre-commit autoupdate
|
2026-03-09 14:43:08 -05:00 |
|
pre-commit-ci[bot]
|
880bb4a632
|
[pre-commit.ci] pre-commit autoupdate
updates:
- [github.com/psf/black-pre-commit-mirror: 26.1.0 → 26.3.0](https://github.com/psf/black-pre-commit-mirror/compare/26.1.0...26.3.0)
- [github.com/astral-sh/ruff-pre-commit: v0.15.4 → v0.15.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.15.4...v0.15.5)
- [github.com/codespell-project/codespell: v2.4.1 → v2.4.2](https://github.com/codespell-project/codespell/compare/v2.4.1...v2.4.2)
|
2026-03-09 16:44:14 +00:00 |
|
Partho Das
|
cdc23ba5dc
|
Revert "allow serve openai overrides"
This reverts commit bd98a82bbc.
|
2026-03-08 04:42:09 +05:30 |
|
Partho Das
|
cd3a9163c7
|
Revert "eval max_token_length consistent with training config"
This reverts commit 5f52befd38.
|
2026-03-08 04:42:02 +05:30 |
|
J-SUPHA
|
1f676f2185
|
Merge pull request #406 from NousResearch/logprobsfn
Unified get_logprobs interface across the server stack
|
2026-03-05 17:36:22 -05:00 |
|
Jai Suphavadeeprasit
|
eb50099361
|
test_get_logprobs_input_ids_only_passthrough
|
2026-03-05 17:04:45 -05:00 |
|