Jaehyun Jun's picture

Jaehyun Jun

btjhjeon

·

https://btjhjeon.github.io/

btjhjeon

AI & ML interests

Multimodal

Recent Activity

upvoted a paper 2 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

updated a collection 2 days ago

updated a collection 2 days ago

View all activity

Organizations

btjhjeon's activity

upvoted a paper 2 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 4 days ago • 18

updated a collection 2 days ago

Multimodal LLM

172 items • Updated 2 days ago • 14

upvoted a paper 2 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 4 days ago • 64

upvoted a paper 3 days ago

Enhancing Abnormality Grounding for Vision Language Models with Knowledge Descriptions

Paper • 2503.03278 • Published 5 days ago • 12

liked a model 3 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 2 days ago • 231k • 1.04k

upvoted 2 papers 3 days ago

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published 5 days ago • 25

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

Paper • 2503.00329 • Published 9 days ago • 18

updated 2 collections 5 days ago

Multimodal Dataset

30 items • Updated 5 days ago • 2

Multimodal LLM

172 items • Updated 2 days ago • 14

upvoted a paper 5 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 6 days ago • 65

updated 2 collections 5 days ago

Multimodal Alignment

16 items • Updated 5 days ago • 2

Multimodal Reasoning

8 items • Updated 5 days ago • 1

liked 2 models 6 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 4 days ago • 3.46M • 634

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 3.64M • • 11k

updated 2 collections 9 days ago

Multimodal LLM

172 items • Updated 2 days ago • 14

Multimodal Reasoning

8 items • Updated 5 days ago • 1

upvoted a paper 9 days ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published 11 days ago • 56

updated a collection 10 days ago

Multimodal Benchmarks

78 items • Updated 10 days ago • 8

upvoted a paper 11 days ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published 12 days ago • 60