ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6 Reinforcement Learning • 15B • Updated 17 days ago • 2.74k • 17
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 3.9k • 182
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16 Reinforcement Learning • 8B • Updated Mar 25 • 2.99k • 85
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • 8B • Updated May 31 • 1.04k • 4