20 63 27

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

upvoted a paper about 24 hours ago

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

upvoted a paper 8 days ago

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

authored a paper 21 days ago

Visual Agentic Reinforcement Fine-Tuning

View all activity

Organizations

KennyUTC's activity

upvoted a paper about 24 hours ago

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published 1 day ago • 37

upvoted a paper 8 days ago

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Paper • 2505.23764 • Published 12 days ago • 3

authored a paper 21 days ago

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published 22 days ago • 31

upvoted a paper 21 days ago

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published 22 days ago • 31

upvoted a paper about 1 month ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 93

liked a Space about 1 month ago

Openvlm Subjective Leaderboard

🌎

VLMEvalKit Subjectivce Benchmark Results

upvoted a paper about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 270

authored a paper 2 months ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

upvoted 3 papers 2 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 46

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 189

updated a dataset 2 months ago

VLMEval/OpenVLMRecords

Updated Apr 8 • 447 • 8

authored a paper 2 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68

upvoted a paper 2 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68

commented a paper 2 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68 •

authored a paper 3 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 34

upvoted a paper 3 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 34

commented a paper 3 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 34 •

liked 2 datasets 3 months ago

PhoenixZ/MM-AlignBench

Updated Mar 1 • 25 • 4

PhoenixZ/OmniAlign-V-DPO

Viewer • Updated Mar 1 • 133k • 144 • 6