Kai Zhang
drogozhang
AI & ML interests
NLP
Recent Activity
authored
a paper
5 days ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
upvoted
a
paper
5 days ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use