Joserf Huang's picture

10

Joserf Huang

JoserfHuang

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

upvoted a paper 8 days ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

upvoted a paper 8 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

View all activity

Organizations

None yet

JoserfHuang's activity

upvoted 7 papers 8 days ago

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Paper • 2505.24862 • Published 11 days ago • 31

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published 11 days ago • 75

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 11 days ago • 90

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 11 days ago • 118

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published 9 days ago • 36

From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation

Paper • 2506.01920 • Published 8 days ago • 4

MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling

Paper • 2505.15772 • Published 20 days ago • 2

upvoted 3 papers 2 months ago

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Paper • 2504.01934 • Published Apr 2 • 23

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published Apr 2 • 36

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 92