Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LMMs-Lab

community
https://www.lmms-lab.com/
lmmslab
EvolvingLMMs-Lab
Activity Feed

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

mwxely  authored a paper 1 day ago
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
kcz358  authored a paper 1 day ago
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
THUdyh  authored a paper 28 days ago
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
View all activity

Papers

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

A Simple Baseline for Streaming Video Understanding

View all Papers

Bo Li's profile picturePu Fanyi's profile pictureZhang Peiyuan's profile pictureZhang Yuanhan's profile pictureChunyuan Li's profile pictureHaotian Liu's profile picturekcz's profile pictureKairui's profile pictureNguyen Quang Trung's profile picturePham Ba Cong's profile pictureJinming Wu's profile pictureYingluo Li's profile pictureDevin Thang's profile pictureJingkang Yang's profile pictureZihao Deng's profile pictureYezhen Wang's profile pictureXinyu Huang's profile pictureXiyao Wang's profile pictureGao Yiming's profile pictureJinghao Guo's profile pictureDo Duc Anh's profile pictureyiyexy's profile picturewkzhang's profile picturexiangan's profile pictureHaiwen Diao's profile pictureJiankangDeng's profile pictureZhongang Cai's profile pictureyl-1993's profile picturewangyubo's profile pictureYANG Zhitao's profile pictureZuhao Yang's profile pictureYuwei Niu's profile pictureYuhao Dong's profile picture
lmms-lab 's Papers 5
Submitted by
taesiri
85

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

lmms-lab LMMs-Lab
75 4
Submitted by
Yujiao Shen
73

A Simple Baseline for Streaming Video Understanding

lmms-lab LMMs-Lab
119 7
Submitted by
yiyexy
52

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

lmms-lab LMMs-Lab
344 4
Submitted by
Zuhao Yang
189

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

lmms-lab LMMs-Lab
225 7
Submitted by
kcz
96

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

lmms-lab LMMs-Lab
161 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs