reward for Non-Verifiable Queries
#12 opened 13 days ago
by
DaleMeng
SFT data release?
#11 opened 16 days ago
by
canac84073
Suggestion for the Unsloth team!
#10 opened 27 days ago
by
owao
Add project page to model card
#9 opened 29 days ago
by
nielsr

θ½εεΈε Άδ»εΊεζ΅θ―ηεΎεε
#8 opened about 1 month ago
by
lieren2023
performance on Deepseek v3 using your distilled data
#7 opened about 1 month ago
by
QiongC
Please add this model to HuggingChat
2
#4 opened about 2 months ago
by
devopsML

Will you opensource your rl training data?
π
3
1
#3 opened about 2 months ago
by
leo98xh
Is there any plan to launch the model of 14b
β
π
8
#1 opened about 2 months ago
by
player225