Ning Ding
stingning
AI & ML interests
NLP
Recent Activity
authored
a paper
9 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
upvoted
a
paper
9 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
authored
a paper
about 1 month ago
TTRL: Test-Time Reinforcement Learning
Organizations
stingning's activity
The dataset viewer show 774k rows but the dataset has much more
1
#2 opened almost 2 years ago
by
lhoestq

Vocab size 32001 causes problems for quantisation
๐
5
2
#1 opened almost 2 years ago
by
TheBloke
