fuder.eth
|
1862b193ee
|
Update README.md (#118)
|
2025-05-28 10:24:12 +10:00 |
|
Shannon Sands
|
bfb822f1e0
|
updated APIServerConfig and added requirements.txt and install instructions to README
|
2025-05-15 12:22:00 -07:00 |
|
Shannon Sands
|
00dd120067
|
Merge branch 'main' into blackjack2-env
|
2025-05-14 17:27:44 -07:00 |
|
Shannon Sands
|
8fad665f6a
|
moved folder location
|
2025-05-14 17:22:30 -07:00 |
|
Shannon Sands
|
c2bf3f5acd
|
moved folder location
|
2025-05-14 17:22:18 -07:00 |
|
Shannon Sands
|
3fba8e3527
|
linting
|
2025-05-14 14:22:25 -07:00 |
|
Shannon Sands
|
d8ab1a6758
|
linting
|
2025-05-14 14:20:54 -07:00 |
|
Shannon Sands
|
1a7c0294fa
|
refactoring for more clarity
|
2025-05-14 14:18:43 -07:00 |
|
Shannon Sands
|
bb6c205efe
|
Linting
|
2025-05-14 14:05:52 -07:00 |
|
Shannon Sands
|
67cfd961c5
|
linting
|
2025-05-14 14:01:31 -07:00 |
|
Shannon Sands
|
826de9e283
|
Updated README
|
2025-05-14 13:57:20 -07:00 |
|
Shannon Sands
|
f5172b45a8
|
Added README
|
2025-05-14 13:35:15 -07:00 |
|
Shannon Sands
|
85f462df5e
|
Updated test scripts
|
2025-05-14 12:05:59 -07:00 |
|
Shannon Sands
|
d6f9d58606
|
new env runs locally
|
2025-05-14 11:57:45 -07:00 |
|
Shannon Sands
|
54ae40840d
|
no-thinking env added
|
2025-05-14 11:28:39 -07:00 |
|
Shannon Sands
|
21cc528b85
|
move best-of-n selection to util
|
2025-05-14 10:35:12 -07:00 |
|
Shannon Sands
|
4c00e2b209
|
move message history out to utils
|
2025-05-14 10:13:56 -07:00 |
|
Shannon Sands
|
8cd9e4d776
|
made private collect_trajectory re changes
|
2025-05-13 07:58:48 +10:00 |
|
Shannon Sands
|
e480c30b8b
|
removed new fn
|
2025-05-13 07:49:28 +10:00 |
|
Shannon Sands
|
220b92be47
|
Linting and cleanup
|
2025-05-10 21:15:00 +10:00 |
|
Shannon Sands
|
6617d402b3
|
Doing exact V* calc
|
2025-05-10 20:24:31 +10:00 |
|
Shannon Sands
|
a049dde6b1
|
Adding thinking reward
|
2025-05-10 19:50:30 +10:00 |
|
Shannon Sands
|
840ff20921
|
Fixed typo, revising reward function
|
2025-05-10 19:45:06 +10:00 |
|
Shannon Sands
|
7fe1a40368
|
readd multistep masking
|
2025-05-10 09:24:55 +10:00 |
|
Shannon Sands
|
9efd8c1529
|
linting
|
2025-05-10 08:44:35 +10:00 |
|
Shannon Sands
|
06c4a9e65c
|
linting
|
2025-05-10 08:43:03 +10:00 |
|
Shannon Sands
|
0248cc1227
|
Removed old code, added comments
|
2025-05-10 08:39:52 +10:00 |
|
Shannon Sands
|
ba604d44f9
|
update local server
|
2025-05-10 08:18:41 +10:00 |
|
Shannon Sands
|
c506bb147e
|
simplified config and reward
|
2025-05-10 08:04:39 +10:00 |
|
Shannon Sands
|
7e95c0b67d
|
moving test sever
|
2025-05-10 07:47:44 +10:00 |
|
Shannon Sands
|
a7dfd377da
|
moving env to clean branch
|
2025-05-10 07:44:29 +10:00 |
|