Commit graph

48 commits

Author SHA1 Message Date
AlxAI
6e4821735d first attempt at two part orders, unformatted -> formatted 2025-06-29 23:05:48 +02:00
sam-paech
b5a84867a1 add order history 2025-06-29 01:53:03 +10:00
sam-paech
ebf26cf8a6 add simplified prompts 2025-06-27 14:42:05 +10:00
sam-paech
840c6b0ad9 add experiment runner 2025-06-22 18:05:07 +10:00
sam-paech
7d50b31e34 add resuming support + critical state analysis mode 2025-06-22 14:41:29 +10:00
AlxAI
f29cf242e1 together AI client 2025-06-19 15:14:30 -04:00
sam-paech
f939dd9634 add temperature support, default temp=0, add random seed string to system prompt 2025-06-19 10:12:38 +10:00
AlxAI
77e7921b9c Support for o3-pro with openai responses api 2025-06-14 17:28:01 -04:00
AlxAI
cf556c119d analyze any game results appended with FULL_GAME 2025-06-03 13:54:18 -04:00
AlxAI
fa17592e75 Update utils.py 2025-06-02 13:58:07 -04:00
AlxAI
b1e5cd7a89 Update utils.py 2025-05-29 10:29:39 -04:00
AlxAI
7e35e005db more parsing 2025-05-26 13:50:57 -04:00
AlxAI
21d852308e more flexible 2025-05-25 17:41:00 -04:00
AlxAI
a2781c3568 lie detection improved! 2025-05-25 12:02:54 -04:00
AlxAI
4b92dd5af0 updating analysis with lie detection (it's not great yet) 2025-05-24 20:44:23 -04:00
AlxAI
81e3dcfe3f analyze game moments with number of failures per model 2025-05-21 19:38:07 -07:00
AlxAI
94a69be25a full diaries 2025-05-20 22:12:18 -07:00
AlxAI
9322ada62b analyze moments, run big models well 2025-05-20 20:04:19 -07:00
AlxAI
db827de273 first diary 2025-05-18 17:23:47 -04:00
AlxAI
c50ac85758 analyze game moments attempt 1 2025-05-17 22:28:22 -04:00
AlxAI
7fe6544667 fixed prompts, improved negotiations, and diaries 2025-05-17 20:00:14 -04:00
AlxAI
4b32940bf6 fixing prompts 2025-05-15 11:22:24 -04:00
AlxAI
02bde92188 Improving prompts with XML 2025-05-13 21:46:40 -04:00
AlxAI
0bd6428729 BIG UPDATES logging everything, better structure of moves, everything runs fast af 2025-05-11 19:10:18 -04:00
sam-paech
0c7b0157b5 add private diary summaries 2025-05-11 18:38:37 +10:00
AlxAI
4cdcd64a9e Support more openrouter models 2025-05-05 17:38:59 -04:00
AlxAI
53e6a8fd6a saving logs 2025-05-05 10:46:34 -04:00
AlxAI
1dc25702b6 relationships work! Everything is ready for big runs 2025-05-04 11:21:51 -04:00
AlxAI
02118dc98b async!! 2025-04-29 22:28:53 -04:00
AlxAI
47f55423da cleaning up a little 2025-04-28 21:09:48 -04:00
AlxAI
70f4438b2e state! 2025-04-07 17:25:12 -07:00
AlxAI
0242d7446b Revert "Merge branch 'main' into animation"
This reverts commit d7f93f587a, reversing
changes made to d505c7ea6c.
2025-03-04 20:31:14 -08:00
AlxAI
d7f93f587a
Merge branch 'main' into animation 2025-03-04 20:26:35 -08:00
Oam Patel
1f8ac5ae20 add optional planning phase 2025-02-27 02:10:48 +00:00
AlxAI
cb04ad0be5 attempt at fixing recursive summarization 2025-02-25 06:55:21 -08:00
AlxAI
eb3de01956 dramatically improving logging thanks to new 3.7sonnet cursor agent mode 2025-02-24 15:49:37 -08:00
AlxAI
b54a8252d6 fix convoy first attempt at summaries 2025-02-23 18:18:47 -08:00
AlxAI
2693b01014 Lots of improvements to prompting putting the right information in for negotiation and phase summaries - CONVOYS BROKEN RN 2025-02-23 11:18:37 -08:00
AlxAI
6b0863cb5b dramatically simplify phase summary 2025-02-20 22:24:24 -08:00
AlxAI
72327cfb22 Randomization for powers and models + enhance order instructions
Also improved plotting to show model + power
2025-02-20 15:58:41 -08:00
AlxAI
b886fd7bfc with phase summaries and country prompts 2025-02-19 21:26:54 -08:00
Oam Patel
e3db81f00e no press + context includes orders and results 2025-02-18 19:29:04 +00:00
Oam Patel
9443384c89 fix parsing for negotiations 2025-02-17 06:22:56 +00:00
Oam Patel
2f5ec5cdf7 ruff 2025-02-17 05:20:33 +00:00
Oam Patel
a0404acff9 multi message sending + further planning 2025-02-17 05:20:06 +00:00
Oam Patel
67e06f1e40 big refactor + big prompt changes + context prompt 2025-02-17 04:02:25 +00:00
Oam Patel
1b042cf6c6 script + error logging for order decoding 2025-02-16 01:27:16 +00:00
Oam Patel
a1aed6674f refactor 2025-02-16 00:24:57 +00:00