Commit graph

423 commits

Author SHA1 Message Date
Jake Boggs
7efcb965bd
Add zip file 2025-05-18 17:45:44 -04:00
Josh
8ff2b02ce0 Working POC 2025-05-18 14:44:16 -07:00
Jake Boggs
09eed6584d Add example output 2025-05-18 17:43:38 -04:00
Jonah Philion
98e103d2f2 copy over gsmk 2025-05-18 14:42:58 -07:00
Earl Potters
b9a1940d12 Enhance evaluation process in DynastAI with detailed logging
- Added print statements to track the evaluation process, including the number of test scenarios and accuracy after evaluation.
- Improved JSON validation with additional logging for missing fields and invalid data types.
- Introduced counters for valid and invalid scenario generations during scoring.
- Enhanced feedback on scenario generation and processing errors for better debugging.
2025-05-18 21:23:33 +00:00
Jake Boggs
71a87edd50 Initial version 2025-05-18 17:21:16 -04:00
Joshua Jerin
e018440d66 initial - env 2025-05-18 14:16:01 -07:00
arihanv
fa42662c54 feat: optimizer evaluator 2025-05-18 14:15:32 -07:00
Earl Potters
fba2a42bb7 Enhance DynastAI scenario generation with structured point system and history tracking
- Updated point system guidelines to restrict values for Piety, Stability, Power, and Wealth to a range of -20 to 20.
- Introduced a category field for scenario classification.
- Added examples for scenario generation to improve clarity.
- Implemented choice history tracking to maintain a record of player decisions and their effects on the kingdom's state.
- Enhanced prompt formatting to include current kingdom state and recent choices for better context in scenario generation.
2025-05-18 21:13:13 +00:00
Alexander Speicher
1e7c3d7cc1 init 2025-05-18 13:55:40 -07:00
Adam Blumenfeld
3fde5cbda8 update gitignore 2025-05-18 20:52:54 +00:00
Olivier D'Ancona
366f82d5bc Merge branch 'main' of github.com:ODAncona/NousWhiteHouse 2025-05-18 13:41:24 -07:00
Olivier D'Ancona
0fbb112eec initialized with grpo 2025-05-18 13:41:01 -07:00
Josh
94038876f4 Add tokenizer. Fix typing 2025-05-18 13:32:59 -07:00
Josh
659247fc00 Fix environment issues. Safely ran python3 accessibility_env.py --help 2025-05-18 13:07:08 -07:00
Aditya Mehta
b62cf5d74f
Update README.md 2025-05-18 13:02:35 -07:00
Aditya Mehta
9472d2de23
Update README.md 2025-05-18 13:01:01 -07:00
Aditya Mehta
7e671925ff
Update README.md 2025-05-18 13:00:43 -07:00
Aditya Mehta
23c6f767ce
Update README.md 2025-05-18 13:00:34 -07:00
Aditya Mehta
e6b71b335f
Update README.md 2025-05-18 12:59:54 -07:00
Aditya Mehta
d8f0749e1b
Update README.md 2025-05-18 12:59:35 -07:00
Aditya Mehta
b776d19f7f
Update README.md 2025-05-18 12:58:58 -07:00
Josh
737139994a feat: Initial setup for AccessibilityEnv directory and placeholder files 2025-05-18 12:52:04 -07:00
Olivier D'Ancona
f052f14484 uv init 2025-05-18 12:48:21 -07:00
Olivier D'Ancona
89fad09688 init hackaton 2025-05-18 12:43:28 -07:00
Earl Potters
e0dabe1225 Add DynastAI game environment with FastAPI backend and web frontend
- Introduced `dynastai_local_server.py` for local development and testing.
- Implemented `dynastai_server.py` as the main server entry point.
- Created FastAPI endpoints for game state management, card generation, and player choices.
- Developed a web interface with HTML, CSS, and JavaScript for user interaction.
- Added game logic for managing metrics and decision effects.
- Included configuration management and utility functions.
- Established a testing framework for API and environment functionality.
- Updated README.md with project overview and setup instructions.
2025-05-18 19:37:31 +00:00
Teknium
bdb15e5d85
Merge pull request #21 from NousResearch/add-sft-data-env
add an SFT data loading env
2025-05-17 11:30:07 -07:00
shannonsands
b1c9b23956
Merge pull request #50 from NousResearch/readme-updates
Added new env info
2025-05-17 10:22:20 -07:00
artem
651b40968d
Merge pull request #51 from NousResearch/kernelbench_env
Kernelbench env with parallel compilation
2025-05-16 21:38:57 -07:00
Artem Yatsenko
4b1c07232c readme updates 2025-05-17 04:37:09 +00:00
Artem Yatsenko
5396975bab move to its own folder, add readme, citation. 2025-05-17 04:34:49 +00:00
Shannon Sands
edf2beaa32 linting 2025-05-16 20:40:15 -07:00
Artem Yatsenko
9875f5dc06 pre-commit. add KB path as variable 2025-05-17 03:17:14 +00:00
Artem Yatsenko
fbfe06771b add parallel compile. add instructions 2025-05-16 19:53:46 -07:00
Shannon Sands
41caa05a1a remvoed merge error 2025-05-16 19:49:37 -07:00
Shannon Sands
9753d5a122 resolved conflict 2025-05-16 19:48:15 -07:00
Teknium
c9b6534eae
Merge pull request #44 from NousResearch/instruction-following-algo-environment
Instruction following algo environment
2025-05-16 19:37:00 -07:00
teknium1
20d263a495 add citation to allenai 2025-05-16 19:34:51 -07:00
teknium1
287bbcd356 some cleanup for final merge 2025-05-16 19:24:50 -07:00
Shannon Sands
fd63c76a5c Added new env info 2025-05-16 16:44:33 -07:00
teknium1
daa6f0ff18 add stricter enforcement of think tags 2025-05-16 13:18:20 -07:00
Artem Yatsenko
45aece515f first commit for kernelbench env 2025-05-16 11:29:10 -07:00
teknium1
6ae0703ad6 fix some regex and show special tokens for completions table 2025-05-15 22:29:42 -07:00
teknium1
24c571654e match num_max_requests with groupsize 2025-05-15 15:57:39 -07:00
Shannon Sands
c8afc0aa5d lint 2025-05-15 12:34:41 -07:00
Shannon Sands
bfb822f1e0 updated APIServerConfig and added requirements.txt and install instructions to README 2025-05-15 12:22:00 -07:00
Shannon Sands
bda3bb99be added requirements.txt and updated README 2025-05-15 12:17:28 -07:00
Shannon Sands
eaaecb71f8 lint 2025-05-15 12:14:39 -07:00
Shannon Sands
bcb2b7206d updated env APIServerConfig and excluded uv.lock 2025-05-15 12:14:09 -07:00
Shannon Sands
bba93552f5 Revert "merged latest"
This reverts commit d768ad68aa.
2025-05-15 12:11:05 -07:00