3 51 21

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI, Spatial Intelligence

Recent Activity

upvoted an article 6 days ago

A Dive into Text-to-Video Models

upvoted a paper 12 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

upvoted a paper 13 days ago

Emerging Properties in Unified Multimodal Pretraining

View all activity

Organizations

None yet

coderchen01's activity

upvoted an article 6 days ago

Article

A Dive into Text-to-Video Models

•

May 8, 2023

• 39

upvoted a paper 12 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published 13 days ago • 96

upvoted a paper 13 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 20 days ago • 129

authored a paper 14 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published 15 days ago • 145

upvoted a paper 14 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published 15 days ago • 145

upvoted a paper 19 days ago

Latent Flow Transformer

Paper • 2505.14513 • Published 20 days ago • 27

liked a model about 1 month ago

Cylingo/Xinyuan-LLM-14B-0428

Text Generation • Updated May 1 • 5.45k • 9

upvoted a paper about 1 month ago

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Paper • 2504.14899 • Published Apr 21 • 21

upvoted a paper about 2 months ago

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Paper • 2504.10823 • Published Apr 15 • 14

liked a model about 2 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated Apr 30 • 245k • 1.65k

upvoted a paper 2 months ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30 • 95

liked 2 Spaces 2 months ago

317

Qwen2.5 Omni 7B Demo

🏆

Generate text and speech responses from text, images, or audio input

Deep Reinforcement Learning Leaderboard

🚀

Display and search trained RL models on a leaderboard

liked a Space 3 months ago

284

VBench Leaderboard

📊

Upload and analyze video model evaluation data

liked a model 3 months ago

kuleshov-group/bd3lm-owt-block_size16

Text Generation • Updated Apr 13 • 1.64k • 14

upvoted a paper 3 months ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12 • 46

liked a Space 3 months ago

Model Atlas

🗺

A demo for exploring and analyzing large-scale model repos

upvoted 3 papers 3 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 84

Motion Anything: Any to Motion Generation

Paper • 2503.06955 • Published Mar 10 • 33

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 72