arxiv:2412.07393
Dongfang Li
crazyofapple
AI & ML interests
None yet
Recent Activity
liked
a model
about 17 hours ago
deepseek-ai/DeepSeek-V3.2
upvoted
a
paper
3 months ago
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model
upvoted
a
paper
4 months ago
Stabilizing Long-term Multi-turn Reinforcement Learning with Gated
Rewards