Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
9
Linfeng Song
freesunshine0316
Follow
0 followers
ยท
3 following
https://freesunshine0316.github.io/
LinfengSong1
freesunshine0316
AI & ML interests
Researcher @Tencent AI Lab working on reasoning and RLAIF with LLM, especially search + RL. Working on NLP since 2010.
Recent Activity
authored
a paper
1 day ago
The Trickle-down Impact of Reward (In-)consistency on RLHF
authored
a paper
1 day ago
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
authored
a paper
1 day ago
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
View all activity
Organizations
Papers
15
arxiv:
2507.06804
arxiv:
2505.23754
arxiv:
2505.10962
arxiv:
2504.11456
Expand 15 papers
models
0
None public yet
datasets
0
None public yet