Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
guiminghardychen's picture
9 7 6

guiminghardychen

g-h-chen
21world's profile picture
·
  • g-h-chen

AI & ML interests

None yet

Organizations

UCSC-VLAA's profile picture dulab's profile picture

upvoted a paper 7 months ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Paper • 2506.01111 • Published Jun 1, 2025 • 31
upvoted a collection 9 months ago

VisionLM

Collection
1867 items • Updated 14 days ago • 139
upvoted a paper 9 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10, 2025 • 30
upvoted a collection 9 months ago

VLAA-Thinker

Collection
7 items • Updated Sep 3, 2025 • 5
upvoted a paper 9 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21, 2025 • 45
upvoted 2 papers over 1 year ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 67

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Paper • 2406.19280 • Published Jun 27, 2024 • 63
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs