Kai Zhang's picture

Kai Zhang

drogozhang

·

https://drogozhang.github.io

AI & ML interests

NLP

Recent Activity

authored a paper 7 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

upvoted a paper 7 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 7 days ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

View all activity

Organizations

authored a paper 7 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 10 days ago • 76

upvoted 3 papers 7 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 9 days ago • 63

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published 8 days ago • 13

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 10 days ago • 76

upvoted 4 papers 3 months ago

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

Paper • 2506.06962 • Published Jun 8 • 29

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 27

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 26

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21 • 49

upvoted 2 papers 4 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26 • 45

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 34

liked 2 models 5 months ago

osunlp/Dreamer-7B-Reddit

Image-Text-to-Text • 8B • Updated Apr 9 • 6 • 1

osunlp/Dreamer-7B-Classifieds

Image-Text-to-Text • 8B • Updated Apr 9 • 6 • 1

liked a dataset 5 months ago

osunlp/Dreamer-V1-Data

Viewer • Updated Apr 9 • 3.12M • 347 • 3

upvoted a collection 5 months ago

WebDreamer

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents • 6 items • Updated Apr 14 • 5

updated a collection 5 months ago

WebDreamer

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents • 6 items • Updated Apr 14 • 5