Commit graph

351 commits

Author SHA1 Message Date
sam-paech
8d7c9c47cf revert openrouter default temp 2025-07-10 21:58:45 +10:00
sam-paech
1f154a7073 fixes for state updates 2025-07-10 21:52:22 +10:00
sam-paech
3d591a11d1 critical state & plotting fixes 2025-07-10 12:20:30 +10:00
sam-paech
70a876bcee add relationship plots 2025-07-10 10:35:17 +10:00
sam-paech
af3fb8ce48 fix sentiment calcs 2025-07-10 07:49:47 +10:00
sam-paech
2338e3c6e6 bug fixes 2025-07-10 07:06:34 +10:00
Tyler Marques
754cf8e23b
Merge pull request #52 from EveryInc/experiment-runner-updates
Harness defaults, client.py updates & new experiment runner stats
2025-07-04 10:20:24 -07:00
Tyler Marques
ca5e415d89
Removing file accidently added
Signed-off-by: Tyler Marques <me@tylermarques.com>
2025-07-04 09:20:57 -07:00
sam-paech
e351aa3841 savegame fix + chart updates 2025-07-05 00:27:02 +10:00
sam-paech
7edc7c465f fix prompt errors & add per-power prompt dir functionality 2025-07-04 11:31:57 +10:00
sam-paech
22cf3202c7 cleanup 2025-07-04 09:35:57 +10:00
sam-paech
05016cafba clients raise errors to run_llm_and_log 2025-07-04 09:26:46 +10:00
sam-paech
1b16579c54 fix imports 2025-07-04 08:59:42 +10:00
sam-paech
73a3c0596a resolving issues raised in pr 2025-07-04 08:38:57 +10:00
Tyler Marques
ad817695ef
Ruff format and check 2025-07-03 14:27:50 -07:00
Tyler Marques
aadedd6512
Moving more environment variables to central config 2025-07-03 12:50:49 -07:00
Tyler Marques
540c2003e8
Adding PowerEnum as a seperate model
The PowerEnum correctly handles some misspellings. It can be easily
expanded to handle more within the _POWER_ALIASES dict.
2025-07-03 12:26:50 -07:00
sam-paech
a241e34496 fix to respect model ids passed as args when resuming 2025-07-03 09:59:47 +10:00
sam-paech
4fc1f370be allow specification of base url + api key for each model 2025-07-03 08:55:10 +10:00
sam-paech
f29ac9c1c5 all client types can be specified with a prefix in front of the model id 2025-07-03 07:41:29 +10:00
sam-paech
1cb24f1884 set minimalist prompts as harness defaults 2025-07-03 07:27:48 +10:00
sam-paech
cbd475c0c2 add new comparative stats to experiment runner 2025-07-03 03:30:47 +10:00
Tyler Marques
7905f9a0b8
Merge pull request #51 from peregrinates/ordersdata 2025-07-01 20:12:45 -07:00
peregrinates
00ce085e35
make a long form dataframe of each order in game from log data 2025-07-01 22:49:00 -04:00
AlxAI
46cabc1786 circular import 2025-06-30 12:12:52 +02:00
AlxAI
01a0f227a9 Update experiment_runner.py 2025-06-30 12:08:56 +02:00
AlxAI
7b633a0ec8 making phase summaries and unformatted prompts params in lm_game as well as making the models changeable in utils 2025-06-30 12:03:37 +02:00
AlxAI
861a5a222f fixed un-needed changes 2025-06-30 09:32:26 +02:00
AlxAI
0f01618354 Update formatter.py 2025-06-30 08:01:35 +02:00
AlxAI
6e4821735d first attempt at two part orders, unformatted -> formatted 2025-06-29 23:05:48 +02:00
Tyler Marques
01c6dde114
Merge pull request #45 from EveryInc/order-history
Order history
2025-06-29 10:10:37 -07:00
Tyler Marques
dd7aba87b0
Merge pull request #44 from EveryInc/simple_prompts
add simplified prompts
2025-06-29 10:07:00 -07:00
Tyler Marques
43a2482d44
Merge pull request #48 from EveryInc/feature/default-reviewers
Adding code owners for default PR reviewers
2025-06-29 10:03:42 -07:00
Tyler Marques
282598d173
Adding code owners for default PR reviewers
Signed-off-by: Tyler Marques <me@tylermarques.com>
2025-06-29 10:00:43 -07:00
sam-paech
b5a84867a1 add order history 2025-06-29 01:53:03 +10:00
sam-paech
ebf26cf8a6 add simplified prompts 2025-06-27 14:42:05 +10:00
Tyler Marques
0bd909b30b
Adding run with uv instructions
Signed-off-by: Tyler Marques <me@tylermarques.com>
2025-06-23 09:45:23 -07:00
Tyler Marques
46525efe6b
Merge pull request #43 from EveryInc/feature/uv-dependences
Repo cleanup and uv tool
2025-06-23 09:34:50 -07:00
Tyler Marques
d17672d258
Fixing some missing dependancies for lm_game.py
Signed-off-by: Tyler Marques <me@tylermarques.com>
2025-06-23 09:24:28 -07:00
Tyler Marques
a93a89f7cb
Bit of a cleanup
Moved all the files that don't need to be at top level to the
experiments folder.

Started using uv to init the project, should the install easier.
2025-06-23 09:18:20 -07:00
AlxAI
5b85b9f89e Create statistical_game_analysis.py 2025-06-23 08:53:51 -04:00
Tyler Marques
617ccdb1a0
Merge pull request #42 from EveryInc/experiment-runner
Experiment runner
2025-06-22 09:34:55 -07:00
Tyler Marques
4bc5774248
Merge pull request #41 from EveryInc/resuming-and-critical-state-analysis
add resuming support + critical state analysis mode
2025-06-22 09:33:39 -07:00
sam-paech
840c6b0ad9 add experiment runner 2025-06-22 18:05:07 +10:00
sam-paech
7d50b31e34 add resuming support + critical state analysis mode 2025-06-22 14:41:29 +10:00
AlxAI
477ffc144d failed sets, convoys, etc. analyzing csv's 2025-06-19 21:17:50 -04:00
AlxAI
3c879941bc Merge branch 'main' of https://github.com/EveryInc/AI_Diplomacy 2025-06-19 19:48:59 -04:00
AlxAI
adf2af00f3 add ability to check for convoys and suports 2025-06-19 19:48:58 -04:00
Tyler Marques
800ae57ef0
Merge pull request #40 from sam-paech/simplify-prompts
revamp diary summarisation
2025-06-19 15:11:06 -07:00
Tyler Marques
b652c032c4
Merge branch 'main' into simplify-prompts 2025-06-19 15:10:59 -07:00