Ning Ding's picture

Ning Ding

stingning

·

https://www.stingning.cn

ningding97

AI & ML interests

NLP

Recent Activity

authored a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

upvoted a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

authored a paper about 1 month ago

TTRL: Test-Time Reinforcement Learning

View all activity

Organizations

stingning's activity

New activity in stingning/ultrachat almost 2 years ago

The dataset viewer show 774k rows but the dataset has much more

#2 opened almost 2 years ago by

New activity in openbmb/UltraLM-13b almost 2 years ago

Vocab size 32001 causes problems for quantisation

#1 opened almost 2 years ago by