Commit graph

36 commits

Author SHA1 Message Date
Andreas Koepf
80eff8acb6 bump version, update gallery 2025-02-23 22:36:39 +01:00
Andreas Koepf
a1a305c8d7 dev minor version one ahead of PyPI released version 2025-02-22 16:54:05 +01:00
Andreas Koepf
5e7e205639 update GALLERY.my, bump version 2025-02-20 23:03:54 +01:00
Andreas Köpf
52b44c47d5 reasoning-gym-server & cli tool (#154)
* feat: Add initial server structure with configuration, registry, and middleware

* feat: Add chain_sum dataset to experiment registry test

* fix: Update test_registry to use DatasetSpec for composite config validation

* refactor: Update Pydantic config to use json_schema_extra and ConfigDict

* feat: Add Pydantic models for API request/response data

* feat: Implement basic experiment management endpoints with tests

* feat: Implement composite configuration endpoints for experiments

* fix: Add missing DatasetConfigUpdate import in server.py

* refactor: Update dataset config update method to properly merge config updates

* fix: Correctly retrieve current dataset config in composite endpoint

* feat: Add basic CLI structure with experiments and config commands

* feat: Add initial CLI tool with basic experiment management commands

* refactor: Reorganize CLI package structure and fix import paths

* refactor: Implement initial CLI commands for experiment management

* feat: Implement HTTP client for Reasoning Gym server in RGC CLI tool

* fix: Move print statements inside try block to resolve SyntaxError

* fix: Resolve SyntaxError in edit_config function by adding missing except block

* feat: Add default app instance in server module for easier uvicorn startup

* docs: Add README.md with server and RGC tool documentation

* remove unused files

* refactor: Remove unsupported type annotation in registry.py

* refactor: Move ExperimentRegistry to coaching module and add Experiment class

* fix: Add missing CompositeDataset import in test_registry.py

* refactor: Implement lazy ASGI app creation for server initialization

* feat: Add health check command to RGC CLI for server connection

* feat: Add version tracking support to CompositeDataset

* feat: Add DatasetVersionManager for tracking dataset versions

* feat: Add entry_id metadata and score_answer_with_id method to CompositeDataset

* feat: Add entry_id metadata combining version and index

* fix: Resolve undefined variable by storing version_id before use

* test: Add comprehensive unit tests for score_answer_with_id() function

* test: Add comprehensive version tracking test for dataset config updates

* feat: Validate dataset weights are positive in CompositeDataset initialization

* feat: Add weight update and normalization methods to CompositeDataset

* refactor: Centralize weight normalization in CompositeDataset and allow zero-weight datasets

* feat: Add negative weight validation to CompositeDataset constructor

* feat: Add duplicate dataset name check in CompositeDataset and update test

* refactor: Move duplicate dataset name check inside dataset iteration loop

* refactor: Update CompositeDataset weight management to use config as source of truth

* refactor: Move duplicate dataset name check to CompositeConfig.validate()

* test: Update composite dataset weight test assertions and validation

* feat: Add methods to add and remove datasets in CompositeDataset

* refactor: Remove weight normalization and use unnormalized weights directly

* refactor: Remove redundant total weight check in update_dataset_weights

* feat: Add batch generation and scoring endpoints to server

* fix: Import BatchEntry in server.py to resolve undefined name error

* refactor: Update ReasoningGymDataset to use server for batch generation and scoring

* fix: Add missing List and Dict type imports

* feat: Add get_batch() and score_outputs() methods to RGClient

* test: Add unit tests for generate_batch and score_outputs endpoints

* refactor: Add DatasetVersionManager to Experiment class and CompositeDataset constructor

* feat: Add validation for base_index and batch_size in generate_batch endpoint

* refactor: Remove unused BatchRequest type from imports

* refactor: Convert models to use Pydantic exclusively

* test: Update scoring endpoint tests to use correct request model format

* refactor: Rename ScoreItem to AnswerItem and update related code

* feat: Update scoring endpoint to return ordered ScoringResponse with scores and entry_ids

* fix: Add missing ScoringResponse import in server.py

* move verl ppo sample with server into own file

* refactor: Use Pydantic models for get_batch() and score_outputs() in RGClient

* refactor: Update client methods to use Pydantic models for type safety

* refactor: Use Pydantic models for experiment and dataset config operations

* refactor: Clean up duplicate methods and improve error handling in main.py

* first bits of rg server use for verl

* refactor: Optimize scoring with single HTTP request in _score_output

* fix: Correct experiment creation with ExperimentCreate object

* grpo tests with server
2025-02-19 22:41:33 +01:00
Andreas Koepf
82ebcf4ac6 bump version, update GALLERY.md 2025-02-14 23:34:59 +01:00
Andreas Koepf
6c5ee5b915 update GALLERY.md & bump version 2025-02-11 23:43:56 +01:00
Andreas Koepf
0c7fbb5001 bump version 2025-02-09 00:39:48 +01:00
Andreas Koepf
4e49806d22 add ArcAgiDataset class, fix score_entry() metadata params 2025-02-08 23:18:18 +01:00
Andreas Koepf
9887a1beed fix tool.hatch.build section in pyproject.toml 2025-02-07 19:02:43 +01:00
Andreas Koepf
2a363c8610 bump version to 0.1.14 2025-02-07 18:28:06 +01:00
Andreas Koepf (aider)
6ec8f782d7 feat: Add pyyaml dependency to project configuration 2025-02-04 19:07:52 +01:00
Rich Jones
1490abb573 adds zebrapuzzles 2025-02-03 14:34:57 +01:00
Andreas Koepf
63a8c94d85 bump version to 0.1.3, uploaded to pypi 2025-02-02 22:26:24 +01:00
Andreas Koepf (aider)
3fbda91b7b deps: Add pytz dependency to pyproject.toml 2025-02-01 02:15:45 +01:00
Andreas Koepf
44f32e3862 Add time interval dataset class 2025-02-01 02:10:48 +01:00
Andreas Koepf (aider)
ceae294332 fix: Remove duplicate project section and reorganize pyproject.toml 2025-01-31 12:15:27 +01:00
Andreas Koepf (aider)
df2e20532b fix: Move project metadata to correct section in pyproject.toml 2025-01-31 12:13:41 +01:00
Andreas Koepf (aider)
252347fed2 ci: Add GitHub Actions workflow for running tests 2025-01-31 00:10:12 +01:00
Andreas Koepf
e1b2dc83e2 bump version 2025-01-30 23:19:58 +01:00
Andreas Koepf
25540b6634 lint 2025-01-30 22:55:04 +01:00
Rich Jones
189f5bf7a1 game of life via cellpylib 2025-01-30 20:08:44 +01:00
Rich Jones
3b11e4c296 difficulty levels 2025-01-30 16:24:28 +01:00
Andreas Köpf
4b02834aa0 Merge branch 'main' into miserlou/figlet 2025-01-29 23:59:27 +01:00
Rich Jones
781ed6ef78 dep typo 2025-01-29 22:23:00 +01:00
Rich Jones
d6bab78fa1 figlet fonts 2025-01-29 20:38:38 +01:00
Rich Jones
f749539138 inital stab at rubikscube 2025-01-29 15:53:29 +01:00
Andreas Koepf
c3b6af35f0 min python 3.11 to support StrEnum 2025-01-26 22:17:43 +01:00
Andreas Koepf
ad9f0d265c fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00
Andreas Koepf
3a492f2b7d add Apache-2.0 license info to pyproject.toml 2025-01-25 17:48:11 +01:00
Andreas Koepf
ad5453df73 add sympy dep 2025-01-24 15:15:32 +01:00
Andreas Koepf
cf07ee8eae bump version, remove accidentially checked in file 2025-01-23 23:37:24 +01:00
Andreas Koepf (aider)
00430d9d6a feat: Add data file packaging configuration and __init__.py for reasoning_gym.data 2025-01-23 18:57:34 +01:00
Andreas Koepf (aider)
82afdb26e9 feat: Add Black and isort pre-commit hooks with line length configuration 2025-01-23 11:02:13 +01:00
Andreas Koepf (aider)
64641c4ee0 chore: Update project homepage and bug tracker URLs 2025-01-23 10:58:29 +01:00
Andreas Koepf (aider)
f98c16a6d3 chore: Update license to Apache 2.0 and author information 2025-01-23 10:55:01 +01:00
Andreas Koepf (aider)
4dc36f79e7 build: Initialize reasoning_gym package structure with packaging and development setup 2025-01-23 10:50:54 +01:00