teknium1
|
8a0e107806
|
change eval set size since this is a small dataset we need mo data for trainn
|
2025-05-14 19:18:01 -07:00 |
|
teknium1
|
bcc38567ca
|
update some dataset stuff to use allenai's
|
2025-05-14 18:39:31 -07:00 |
|
teknium1
|
881af55f9a
|
add instruction following algo env
|
2025-05-14 18:31:05 -07:00 |
|
shannonsands
|
d011d687af
|
Merge pull request #38 from NousResearch/blackjack2-env
Blackjack2 env
|
2025-05-14 17:30:37 -07:00 |
|
Shannon Sands
|
c72a27d376
|
fixed linting in latest main
|
2025-05-14 17:29:57 -07:00 |
|
Shannon Sands
|
00dd120067
|
Merge branch 'main' into blackjack2-env
|
2025-05-14 17:27:44 -07:00 |
|
Shannon Sands
|
8fad665f6a
|
moved folder location
|
2025-05-14 17:22:30 -07:00 |
|
Shannon Sands
|
c2bf3f5acd
|
moved folder location
|
2025-05-14 17:22:18 -07:00 |
|
Joe Li
|
c1ae25c202
|
Merge pull request #26 from NousResearch/coding_server
add code execution environment
|
2025-05-14 15:08:10 -07:00 |
|
Shannon Sands
|
3fba8e3527
|
linting
|
2025-05-14 14:22:25 -07:00 |
|
Shannon Sands
|
d8ab1a6758
|
linting
|
2025-05-14 14:20:54 -07:00 |
|
Shannon Sands
|
1a7c0294fa
|
refactoring for more clarity
|
2025-05-14 14:18:43 -07:00 |
|
Shannon Sands
|
bb6c205efe
|
Linting
|
2025-05-14 14:05:52 -07:00 |
|
Shannon Sands
|
67cfd961c5
|
linting
|
2025-05-14 14:01:31 -07:00 |
|
Shannon Sands
|
826de9e283
|
Updated README
|
2025-05-14 13:57:20 -07:00 |
|
Shannon Sands
|
f5172b45a8
|
Added README
|
2025-05-14 13:35:15 -07:00 |
|
Shannon Sands
|
85f462df5e
|
Updated test scripts
|
2025-05-14 12:05:59 -07:00 |
|
Shannon Sands
|
d6f9d58606
|
new env runs locally
|
2025-05-14 11:57:45 -07:00 |
|
Shannon Sands
|
54ae40840d
|
no-thinking env added
|
2025-05-14 11:28:39 -07:00 |
|
Shannon Sands
|
21cc528b85
|
move best-of-n selection to util
|
2025-05-14 10:35:12 -07:00 |
|
Shannon Sands
|
4c00e2b209
|
move message history out to utils
|
2025-05-14 10:13:56 -07:00 |
|
dmahan93
|
e9ca0cb9b8
|
Merge pull request #43 from NousResearch/make-run-api-not-reload
Make run api not reload
|
2025-05-14 10:17:59 -05:00 |
|
dmahan93
|
df3fa0af9f
|
move argparse to main()
|
2025-05-14 09:57:08 -05:00 |
|
dmahan93
|
96a3476bec
|
Refactor API server initialization to accept command-line arguments, and default reload to False
|
2025-05-14 09:12:14 -05:00 |
|
Teknium
|
4f0c464382
|
Update README.md
Add Supported Trainers section - Axolotl and Example Trainer subsections
|
2025-05-13 15:54:50 -07:00 |
|
dmahan93
|
41342ff31b
|
Merge pull request #42 from NousResearch/release-workflow
Create upload_to_pypi.yml for releases
|
2025-05-13 13:32:29 -05:00 |
|
dmahan93
|
deabd3bd10
|
Update upload_to_pypi.yml to remove test uploads
|
2025-05-13 13:31:24 -05:00 |
|
dmahan93
|
9b54cdf6d4
|
Update pyproject.toml
|
2025-05-13 13:20:54 -05:00 |
|
dmahan93
|
c447877518
|
Update upload_to_pypi.yml with test...
|
2025-05-13 13:19:05 -05:00 |
|
dmahan93
|
efbf52519d
|
Update upload_to_pypi.yml
|
2025-05-13 13:13:51 -05:00 |
|
dmahan93
|
f7552c9c6f
|
make default not slurm
|
2025-05-13 13:11:04 -05:00 |
|
dmahan93
|
c79d2c34c1
|
Create upload_to_pypi.yml
|
2025-05-13 13:07:35 -05:00 |
|
dmahan93
|
cb8641dc9d
|
Merge pull request #28 from NousResearch/add-support-for-custom-api-servers
add custom server support
|
2025-05-13 12:42:23 -05:00 |
|
dmahan93
|
24bd8192d9
|
remove transformer version as latest is working
|
2025-05-13 11:19:27 -05:00 |
|
dmahan93
|
553d45cc7e
|
start servers as healthy so they can get a chance to check...
|
2025-05-13 11:17:23 -05:00 |
|
dmahan93
|
42c47ebc49
|
add docs :)
|
2025-05-13 10:28:17 -05:00 |
|
dmahan93
|
8b864e9786
|
move server_type to serverbaseline instead so it can be used as well for server instantiation
|
2025-05-13 10:21:58 -05:00 |
|
dmahan93
|
6fc356e76e
|
fix type checking to substantiate an appropriate class instead of the abstract server class
|
2025-05-13 10:09:36 -05:00 |
|
dmahan93
|
e2128b817e
|
restructure config_init...
|
2025-05-13 10:00:45 -05:00 |
|
dmahan93
|
bb658631e9
|
fix bad import from restructuring...
|
2025-05-13 09:58:16 -05:00 |
|
dmahan93
|
df62979b90
|
refactor to not mess up process...
|
2025-05-13 09:22:07 -05:00 |
|
dmahan93
|
6e9405ba95
|
Fix bad merge
|
2025-05-12 20:02:54 -05:00 |
|
dmahan93
|
0aaf59fc9a
|
add trl server
add gsm8k example for axolotl checking
|
2025-05-12 19:04:46 -05:00 |
|
dmahan93
|
96be544228
|
Merge commit '71e7a5ca27' into add-support-for-custom-api-servers
|
2025-05-12 18:40:35 -05:00 |
|
dmahan93
|
71e7a5ca27
|
Merge pull request #41 from NousResearch/workaround-provider-ignoring-n-kwarg-openai-api
Add n kwarg being ignored workaround
|
2025-05-12 18:19:47 -05:00 |
|
Shannon Sands
|
8cd9e4d776
|
made private collect_trajectory re changes
|
2025-05-13 07:58:48 +10:00 |
|
Shannon Sands
|
36f6822d71
|
Merge branch 'main' into blackjack2-env
|
2025-05-13 07:54:04 +10:00 |
|
Shannon Sands
|
d980acfaf9
|
linting
|
2025-05-13 07:52:03 +10:00 |
|
Shannon Sands
|
e480c30b8b
|
removed new fn
|
2025-05-13 07:49:28 +10:00 |
|
Teknium
|
1d78951d63
|
Update README.md
More updates for clarity
|
2025-05-12 11:31:42 -07:00 |
|