5 43 10

Misaki Wang

MisakiWang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

MultiRef: Controllable Image Generation with Multiple Visual References

upvoted a paper 15 days ago

Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

upvoted a collection about 2 months ago

Janus

View all activity

Organizations

None yet

upvoted a paper 2 days ago

MultiRef: Controllable Image Generation with Multiple Visual References

Paper • 2508.06905 • Published 14 days ago • 18

upvoted a paper 15 days ago

Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Paper • 2508.03644 • Published 18 days ago • 24

upvoted a collection about 2 months ago

Janus

Collection

Janus is a novel autoregressive framework that unifies multimodal understanding and generation. • 8 items • Updated Feb 18 • 17

liked a Space 2 months ago

338

3D Arena

🏢

Vote and view 3D leaderboard

upvoted an article 2 months ago

Article

Introducing the Chatbot Guardrails Arena

and 3 others •

Mar 21, 2024

• 5

upvoted 2 papers 6 months ago

Wikipedia in the Era of LLMs: Evolution and Risks

Paper • 2503.02879 • Published Mar 4 • 22

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 101

liked a model 6 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.44M • • 11.2k

liked a Space 6 months ago

3.1k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 3 papers 7 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 222

upvoted a collection 7 months ago

DeepSeek-R1

Collection

10 items • Updated May 29 • 788

liked a Space 7 months ago

2.02k

PuLID-FLUX

🤗

Generate images from text prompts and ID images

upvoted a paper 7 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

upvoted a paper 8 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 283

liked a model 8 months ago

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 90.9k • 292

liked 2 datasets 8 months ago

HuggingFaceM4/OBELICS

Viewer • Updated Aug 22, 2023 • 276M • 4.74k • 156

DAMO-NLP-SG/multimodal_textbook

Updated Mar 17 • 617 • 147

upvoted a paper 9 months ago

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Paper • 2412.03548 • Published Dec 4, 2024 • 17