Commit graph

186 commits

Author SHA1 Message Date
Nina
b3feae5eef
Update server.py 2025-11-07 19:03:54 +01:00
Nina
74b5412c2b
Update server.py 2025-11-07 19:02:26 +01:00
Nina
16a40a5617
Update server.py 2025-11-07 19:01:48 +01:00
Nina
97107ca868
Update server.py 2025-11-07 19:01:09 +01:00
Nina
a5a8b07848
Update server.py 2025-11-07 19:00:32 +01:00
dmahan93
c96b8a1255
Merge pull request #267 from bobtajson/main
fix: correct typo and improve code quality
2025-11-06 18:36:37 -08:00
dmahan93
b1e164eef5
Merge pull request #264 from NousResearch/add-logprob-server-manager-fn
add sglang specific token level logprob handling and server manager/b…
2025-10-29 13:53:39 -07:00
Dakota
578175a709 fix pre-commit 2025-10-29 14:47:50 -05:00
Dakota
3c8fc32288 fix test case 2025-10-29 14:38:16 -05:00
Dakota
5d6d6bb0dc add docs :) 2025-10-29 11:26:43 -05:00
Dakota
c3a118f50d fix tests 2025-10-29 10:55:10 -05:00
Dakota
d5400460e8 made masked logprobs coherently decided on 2025-10-29 10:52:38 -05:00
Dakota
e57c396f86 ran pre-commit 2025-10-29 10:45:27 -05:00
Dakota
17bb7bdf15 revert base.py 2025-10-29 10:11:05 -05:00
dmahan93
c483840f59 set prompt logprobs to a masked value 2025-10-26 11:58:55 -07:00
dmahan93
c22f8ca81b Merge remote-tracking branch 'origin/add-logprob-server-manager-fn' into add-logprob-server-manager-fn 2025-10-24 23:18:37 -07:00
dmahan93
5d662bf1aa add chat example and fix bug in managed_server 2025-10-24 23:15:56 -07:00
pre-commit-ci[bot]
0d80da5146 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-24 20:10:29 +00:00
dmahan93
7bf4cfbf80 add managed server to make grabbing logprobs easier w/ tokenized items 2025-10-24 13:09:46 -07:00
bobtajson
6e2d36bd2a
Update base.py 2025-10-23 10:27:23 +02:00
bobtajson
59ad9643d6
Update type_definitions.py 2025-10-23 10:26:51 +02:00
bobtajson
3157e6d7e8
Update metrics.py 2025-10-23 10:26:21 +02:00
bobtajson
16c6897b21
Update metrics.py 2025-10-23 10:26:03 +02:00
bobtajson
fd5483d510
Update metrics.py 2025-10-23 10:25:23 +02:00
pre-commit-ci[bot]
312f8859e3 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-16 18:47:07 +00:00
Dakota
d240dbb3b7 Merge remote-tracking branch 'origin/add-logprob-server-manager-fn' into add-logprob-server-manager-fn 2025-10-16 13:46:03 -05:00
Dakota
134cbc09d0 update openai/trl_vllm server with new fn 2025-10-16 13:45:55 -05:00
pre-commit-ci[bot]
1e6a745491 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-16 17:39:04 +00:00
Dakota
c36ec29656 add sglang specific token level logprob handling and server manager/baseline logprob/token fn 2025-10-16 12:38:03 -05:00
pre-commit-ci[bot]
0840c26e94 [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-10-15 04:19:25 +00:00
ropresearch
e5b8fb8654 clean up 2025-10-10 11:50:39 -04:00
ropresearch
baf4b2d8a8 gzip compression for atropos api 2025-10-10 01:26:52 -04:00
dmahan93
36243bd3f4
Merge pull request #253 from NousResearch/rop/gen-params
group temps, sample temps, and logprob api params
2025-10-01 12:58:03 -05:00
ropresearch
6a20b90549 added gen params for latest examples endpoint 2025-10-01 13:05:37 -04:00
ropresearch
b9ecb0cc7f docs update 2025-09-25 17:00:05 -04:00
ropresearch
c3fc68879c group temps, sample temps, and logprob api params 2025-09-25 16:41:58 -04:00
pre-commit-ci[bot]
e02d2c373e [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-09-21 22:33:59 +00:00
Ragnar
60addb9a7d
Update server.py 2025-09-22 00:32:39 +02:00
shannonsands
1a808e2038
Revert "Fix multiple scored data groups (#223)"
This reverts commit 67b3144113.
2025-08-29 17:55:45 +10:00
shannonsands
67b3144113
Fix multiple scored data groups (#223)
* removed changes to other files

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fail on scores empty

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-08-29 15:47:32 +10:00
Dakota
11f1303da0 add error logging to collect_trajectories so they don't fail silently 2025-08-15 16:34:21 -05:00
shannonsands
9f23c732dd
qwen tokenizer wrapper & fixed jinja template for tool handling (#224)
* added qwen tokenizer wrapper & fixed jinja template for tool handling issues in the official HF one

* moved jinja template into it's own file
2025-07-30 11:57:15 +10:00
Teknium
62cee8ac66
Merge pull request #209 from NousResearch/add-pairwise-judge-environment
Add LLM as a judge environment for eval and train based on RewardBench
2025-07-16 13:37:09 -07:00
pre-commit-ci[bot]
3d2d9e67fa [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
2025-07-15 11:42:46 +00:00
Alexey Gorbatovski
53984580c8 Bug fix 2025-07-15 14:37:55 +03:00
hjc-puro
04e69d4a19 appease precommit 2025-07-12 22:51:39 +00:00
hjc-puro
a94e4c9bf0 autoscale metrics table 2025-07-12 22:41:14 +00:00
hjc-puro
6e9baaf9d8 table 2025-07-11 09:52:19 +00:00
hjc-puro
72210cf4ad rename fn 2025-07-11 04:04:55 +00:00
hjc-puro
d133ba3867 comment 2025-07-11 03:54:03 +00:00