Dan Zhang's picture

2 3

Dan Zhang

zd21

·

https://zhangdan0602.github.io/

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

updated a model 2 days ago

zd21/DeepSeek-TD1-PRM

published a model 2 days ago

zd21/DeepSeek-TD1-PRM

View all activity

Organizations

None yet

upvoted 2 collections about 1 month ago

TDRM

Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference • 14 items • Updated 2 days ago • 2

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 30 days ago • 230

upvoted a paper 10 months ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 51