3 23 20

Kai Zhang

drogozhang

https://drogozhang.github.io

AI & ML interests

NLP

Recent Activity

authored a paper 6 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

upvoted a paper 6 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper 6 days ago

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

View all activity

Organizations

authored a paper 6 days ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published 9 days ago • 76

authored 2 papers 11 months ago

Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Paper • 2410.12781 • Published Oct 16, 2024 • 6

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17

authored 2 papers about 1 year ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 32

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 19

authored 2 papers over 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 23

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Paper • 2402.01622 • Published Feb 2, 2024 • 38

authored 2 papers almost 2 years ago

ImagenHub: Standardizing the evaluation of conditional image generation models

Paper • 2310.01596 • Published Oct 2, 2023 • 19

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 36

authored 4 papers about 2 years ago

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

Paper • 2306.10012 • Published Jun 16, 2023 • 36

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

Paper • 2303.14070 • Published Mar 24, 2023 • 10

Automatic Evaluation of Attribution by Large Language Models

Paper • 2305.06311 • Published May 10, 2023

Adaptive Chameleon or Stubborn Sloth: Unraveling the Behavior of Large Language Models in Knowledge Clashes

Paper • 2305.13300 • Published May 22, 2023 • 2

Kai Zhang

AI & ML interests

Recent Activity

Organizations

drogozhang's activity