Perflow-Shuai/data_distillation_longvila_reformated_onlyspatial Viewer • Updated 20 days ago • 134 • 22
Perflow-Shuai/data_distillation_longvila_reformated_onlyspatial Viewer • Updated 20 days ago • 134 • 22
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model Paper • 2312.17240 • Published Dec 28, 2023 • 1
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper • 2412.09501 • Published Dec 12, 2024 • 49
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 111
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding Paper • 2312.14074 • Published Dec 21, 2023
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection Paper • 2411.14794 • Published Nov 22, 2024 • 13
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Paper • 2410.06270 • Published Oct 8, 2024 • 1
SEED-Story: Multimodal Long Story Generation with Large Language Model Paper • 2407.08683 • Published Jul 11, 2024 • 26