Chen's picture

2 12

Chen

Lansechen

·

Lanselott

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning

liked a model about 2 months ago

tngtech/DeepSeek-TNG-R1T2-Chimera

updated a model 3 months ago

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch3-cosine0516-v1

View all activity

Organizations

None yet

upvoted a paper 28 days ago

GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning

Paper • 2507.10628 • Published Jul 14 • 1

upvoted a collection 5 months ago

March 21 Releases

39 items • Updated Mar 22 • 10