1 1 20

Jeonghun Baek

ku21fan

https://jeonghunbaek.net/

AI & ML interests

Computer vision, Text recognition

Recent Activity

updated a dataset 9 days ago

hal-utokyo/Manga109

updated a model 3 months ago

hal-utokyo/MangaLMM

upvoted a paper 3 months ago

MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding

View all activity

Organizations

updated a dataset 9 days ago

hal-utokyo/Manga109

Preview • Updated 9 days ago • 44 • 9

updated a model 3 months ago

hal-utokyo/MangaLMM

Image-Text-to-Text • 8B • Updated Jun 1 • 1.27k • 7

upvoted a paper 3 months ago

MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding

Paper • 2505.20298 • Published May 26 • 6

New activity in hal-utokyo/MangaLMM 3 months ago

Add model card, link to paper and code

#1 opened 3 months ago by

nielsr

updated 2 datasets 3 months ago

JMMMU/JMMMU

Viewer • Updated May 30 • 1.32k • 327 • 16

hal-utokyo/Manga109-s

Preview • Updated May 23 • 31 • 14

liked a Space 3 months ago

MangaLMM Demo

📚

The official demo of MangaLMM

published a model 3 months ago

hal-utokyo/MangaLMM

Image-Text-to-Text • 8B • Updated Jun 1 • 1.27k • 7

liked a dataset 4 months ago

nvidia/Aegis-AI-Content-Safety-Dataset-2.0

Viewer • Updated Jun 9 • 33.4k • 2.94k • 45

liked a dataset 5 months ago

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24 • 3.94M • 15.6k • 215

liked 2 models 6 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 450k • 1.48k

microsoft/Phi-4-mini-instruct

Text Generation • 4B • Updated May 1 • 197k • 585

authored a paper 10 months ago

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22, 2024 • 15

liked a Space 11 months ago

JMMMU Leaderboard

🥇

Evaluating LMMs on Japanese subjects

liked a Space about 1 year ago

140

TextDiffuser 2

📚

Generate images from text prompts with layout planning

liked a model about 1 year ago

xtuner/llava-llama-3-8b-v1_1-gguf

Image-to-Text • 8B • Updated Apr 30, 2024 • 3.86k • 215

liked 2 models over 1 year ago

rinna/japanese-clip-vit-b-16

Feature Extraction • 0.2B • Updated Mar 23 • 27.6k • 22

stabilityai/japanese-stable-clip-vit-l-16

Feature Extraction • 0.4B • Updated Jul 10, 2024 • 93 • 26

liked a dataset over 1 year ago

sean0042/KorMedMCQA

Viewer • Updated Dec 9, 2024 • 7.49k • 2.48k • 27

liked a model over 1 year ago

MBZUAI/MobiLlama-05B

Text Generation • Updated Feb 28, 2024 • 438 • 41

Jeonghun Baek

AI & ML interests

Recent Activity

Organizations

ku21fan's activity

Add model card, link to paper and code

MangaLMM Demo

JMMMU Leaderboard

TextDiffuser 2