Jai Suphavadeeprasit
|
2f371e03fc
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
78c0a6d082
|
tokenizer bug
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
09ad401995
|
sneaky bug logging
|
2026-03-13 11:06:02 -04:00 |
|
Jai Suphavadeeprasit
|
64794e7c72
|
sneaky bug
|
2026-03-13 11:06:00 -04:00 |
|
Jai Suphavadeeprasit
|
bb2736db4e
|
next
|
2026-03-13 11:05:40 -04:00 |
|
Jai Suphavadeeprasit
|
f44eb810bf
|
teacher env init
|
2026-03-13 11:04:57 -04:00 |
|
dmahan93
|
f198c1738e
|
Merge conflict commit
|
2026-03-09 23:13:43 -05:00 |
|
Jai Suphavadeeprasit
|
b91922082e
|
managed_Server pass through and centralize sem logic
|
2026-03-05 15:46:33 -05:00 |
|
dmahan93
|
f4875c5dc6
|
make preserve thinking optional
|
2026-03-04 15:44:12 -06:00 |
|
Jai Suphavadeeprasit
|
c85a3e5ee7
|
readme language
|
2026-03-03 23:44:29 -05:00 |
|
pre-commit-ci[bot]
|
efc90bfb1b
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-04 04:18:12 +00:00 |
|
Jai Suphavadeeprasit
|
1eeb31065f
|
fixing comments
|
2026-03-03 23:16:05 -05:00 |
|
pre-commit-ci[bot]
|
8f304d44fd
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-04 03:08:19 +00:00 |
|
Jai Suphavadeeprasit
|
5aaf7a346c
|
prompt logprobs simplicity
|
2026-03-03 22:06:49 -05:00 |
|
Jai Suphavadeeprasit
|
f1c20591b6
|
prompt logprobs
|
2026-03-03 21:58:05 -05:00 |
|
Jai Suphavadeeprasit
|
439b9b129b
|
prompt logprobs
|
2026-03-03 21:58:05 -05:00 |
|
dmahan93
|
12d61d197f
|
add env using the tool api stuff
|
2026-03-03 19:51:30 -06:00 |
|
dmahan93
|
c8eb63f33d
|
readme updates for tool calling
|
2026-03-03 12:22:10 -06:00 |
|
pre-commit-ci[bot]
|
e98100e5f6
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-03-03 17:21:00 +00:00 |
|
Jai Suphavadeeprasit
|
323a8a2601
|
readme updates
|
2026-03-03 12:19:55 -05:00 |
|
Jai Suphavadeeprasit
|
b9291aa29f
|
init commit
|
2026-03-03 11:32:09 -05:00 |
|
dmahan93
|
8f21bb57ed
|
add better warning message
|
2026-03-02 23:21:25 -06:00 |
|
dmahan93
|
add42a2afb
|
add tool call parsing based on vllm impl and an openai server endpoint
|
2026-03-02 23:17:13 -06:00 |
|
pre-commit-ci[bot]
|
216c1f5899
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-27 21:17:58 +00:00 |
|
Jai Suphavadeeprasit
|
35587cbdc0
|
logger changes
|
2026-02-27 16:17:03 -05:00 |
|
pre-commit-ci[bot]
|
64d3ee1bd6
|
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
|
2026-02-27 18:16:06 +00:00 |
|
Jai Suphavadeeprasit
|
f343b24a6a
|
narrow down scope
|
2026-02-27 11:14:42 -05:00 |
|
Jai Suphavadeeprasit
|
e5297148f9
|
dynamic system prompt fixed
|
2026-02-20 14:50:43 -05:00 |
|
Jai Suphavadeeprasit
|
fc248dd65b
|
clean
|
2026-02-20 12:01:50 -05:00 |
|
Jai Suphavadeeprasit
|
55f7cbd091
|
dynamic system prompts
|
2026-02-20 03:14:05 -05:00 |
|
Jai Suphavadeeprasit
|
e615eb1f50
|
assertions
|
2026-02-20 02:16:49 -05:00 |
|
Jai Suphavadeeprasit
|
559d649a26
|
proper fallback
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
3910a58f9b
|
refactor base
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
1c90fc71b0
|
on policy clean up
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
79e392c446
|
post merge changes
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
c89854a350
|
debug changes
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
0510ca9b72
|
found bug
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
fb23014dcc
|
base env debugging
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
ea2b388435
|
base env debugging
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
e814007575
|
base env debugging
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
b492ac4fce
|
on policy changes
|
2026-02-20 01:45:41 -05:00 |
|
Jai Suphavadeeprasit
|
6bc962c746
|
initial commit
|
2026-02-20 01:45:41 -05:00 |
|
Dakota
|
7d6aeb9bbf
|
add tokenizer name config to set the vllm/sglang tokenizer to something different if needed
|
2026-02-09 15:26:29 -06:00 |
|
Dakota
|
9ff24bf370
|
change to 128 tokens to support low length rejection
|
2026-02-04 16:23:30 -06:00 |
|
Dakota
|
10f651289c
|
Add dummy openai managed server
|
2026-02-04 15:16:36 -06:00 |
|
VolodymyrBg
|
e0744adf28
|
Update README.md
|
2026-01-29 10:23:53 +02:00 |
|
VolodymyrBg
|
dd02df0d76
|
Update base.py
|
2026-01-29 10:22:51 +02:00 |
|
balyan.sid@gmail.com
|
5a20abdce7
|
switch eval to use managed server adapter impl. moved managed server
adapter
|
2026-01-23 23:26:29 +05:30 |
|
Siddharth Balyan
|
7f28c52994
|
Merge branch 'main' into sid/verifiers
|
2026-01-16 11:50:27 +05:30 |
|
teknium
|
681616844d
|
linter....
|
2026-01-15 07:44:53 +00:00 |
|