GRPO-Training / requirements.txt
satyanayak's picture
peft model logic added
7d089e3
raw
history blame contribute delete
80 Bytes
gradio>=4.19.2
transformers>=4.38.0
torch>=2.2.0
accelerate>=0.27.0
peft>=0.9.0