reasoning-gym/examples/trl/requirements.txt
joesharratt1229 a8e11e71be
Test training with trl (#70)
* first trl grpo implementation
* added config yaml file
* added read me and dependencies
* updated reward format func
2025-02-07 07:42:32 +01:00

10 lines
267 B
Text

torch --index-url https://download.pytorch.org/whl/cu124
torchvision --index-url https://download.pytorch.org/whl/cu124
torchaudio --index-url https://download.pytorch.org/whl/cu124
datasets
peft
transformers
trl
wandb
huggingface_hub
flash-attn --no-build-isolation