guiminghardychen's picture

9 7 6

guiminghardychen

g-h-chen

·

g-h-chen

AI & ML interests

None yet

Organizations

upvoted a paper 7 months ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Paper • 2506.01111 • Published Jun 1, 2025 • 31

upvoted a collection 9 months ago

VisionLM

1867 items • Updated 14 days ago • 139

upvoted a paper 9 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10, 2025 • 30

upvoted a collection 9 months ago

VLAA-Thinker

7 items • Updated Sep 3, 2025 • 5

upvoted a paper 9 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21, 2025 • 45

upvoted 2 papers over 1 year ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 67

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Paper • 2406.19280 • Published Jun 27, 2024 • 63