Jaehong Yoon's picture

1 18 2

Jaehong Yoon

jaehong31

·

https://jaehong31.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning

upvoted a paper 19 days ago

MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation

upvoted a paper 26 days ago

EgoLife: Towards Egocentric Life Assistant

View all activity

Organizations

authored 5 papers about 1 month ago

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Paper • 2506.07177 • Published Jun 8 • 22

Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization

Paper • 2202.11453 • Published Feb 23, 2022

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

Paper • 2504.08641 • Published Apr 11 • 7

Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning

Paper • 2506.03525 • Published Jun 4 • 6

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Paper • 2505.21876 • Published May 28 • 9

authored a paper 4 months ago

RSQ: Learning from Important Tokens Leads to Better Quantized LLMs

Paper • 2503.01820 • Published Mar 3 • 2

authored a paper 5 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

authored 5 papers 8 months ago

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Paper • 2411.16657 • Published Nov 25, 2024 • 20

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement

Paper • 2411.15115 • Published Nov 22, 2024 • 9

Glider: Global and Local Instruction-Driven Expert Router

Paper • 2410.07172 • Published Oct 9, 2024

Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection

Paper • 2410.10636 • Published Oct 14, 2024

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

Paper • 2410.12761 • Published Oct 16, 2024

authored 8 papers about 1 year ago

Personalized Subgraph Federated Learning

Paper • 2206.10206 • Published Jun 21, 2022

Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models

Paper • 2304.01515 • Published Apr 4, 2023

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

Paper • 2310.00754 • Published Oct 1, 2023

On the Soft-Subnetwork for Few-shot Class Incremental Learning

Paper • 2209.07529 • Published Sep 15, 2022 • 1

Forget-free Continual Learning with Soft-Winning SubNetworks

Paper • 2303.14962 • Published Mar 27, 2023 • 1

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Paper • 2401.10529 • Published Jan 19, 2024 • 1

EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents

Paper • 2403.12014 • Published Mar 18, 2024

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Paper • 2310.02998 • Published Oct 4, 2023 • 1