3 23 20

Kai Zhang

drogozhang

https://drogozhang.github.io

AI & ML interests

NLP

Recent Activity

authored a paper 4 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

upvoted a paper 5 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 5 days ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

View all activity

Organizations

upvoted 3 papers 5 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 7 days ago • 60

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published 5 days ago • 13

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 8 days ago • 74

upvoted 5 papers 3 months ago

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

Paper • 2506.06962 • Published Jun 8 • 29

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 27

upvoted a paper 4 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 34

upvoted a collection 5 months ago

WebDreamer

Collection

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents • 6 items • Updated Apr 14 • 5

upvoted a paper 6 months ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 15

upvoted a paper 10 months ago

AAAR-1.0: Assessing AI's Potential to Assist Research

Paper • 2410.22394 • Published Oct 29, 2024 • 16

upvoted 2 papers 11 months ago

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

Paper • 2401.16355 • Published Jan 29, 2024 • 2

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17

upvoted 3 papers about 1 year ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 32

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

Paper • 2303.14070 • Published Mar 24, 2023 • 10

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 19

upvoted a paper over 1 year ago

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Paper • 2402.01622 • Published Feb 2, 2024 • 38

upvoted a collection over 1 year ago

Multimodal Embeddings

Collection

13 items • Updated Oct 19, 2024 • 1

upvoted a paper over 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 23

Kai Zhang

AI & ML interests

Recent Activity

Organizations

drogozhang's activity