mirror of
https://github.com/open-thought/reasoning-gym.git
synced 2026-04-19 12:58:07 +00:00
fix(README): Arxiv link (#447)
This commit is contained in:
parent
17a8431013
commit
b3f81a6609
1 changed files with 7 additions and 1 deletions
|
|
@ -8,7 +8,7 @@
|
|||
<!-- badges -->
|
||||
<p align="center">
|
||||
<a href="https://arxiv.org/abs/2505.24760">
|
||||
<img src="https://img.shields.io/badge/arXiv-2306.00777-b31b1b.svg?style=for-the-badge" alt="Paper PDF">
|
||||
<img src="https://img.shields.io/badge/arXiv-2505.24760-b31b1b.svg?style=for-the-badge" alt="Paper PDF">
|
||||
</a>
|
||||
</p>
|
||||
</p>
|
||||
|
|
@ -80,6 +80,12 @@ If you have ideas for dataset generators please create an issue here or contact
|
|||
[](https://discord.gg/gpumode)
|
||||
|
||||
|
||||
## 🚀 Projects Using Reasoning Gym
|
||||
|
||||
Following is a list of awesome projects building on top of Reasoning Gym:
|
||||
- [Verifiers: Reinforcement Learning with LLMs in Verifiable Environments](https://github.com/willccbb/verifiers)
|
||||
- [(NVIDIA) ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models](https://arxiv.org/abs/2505.24864)
|
||||
|
||||
## 📝 Citation
|
||||
|
||||
If you use this library in your research, please cite the paper:
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue