2 6 5

ZhiqiLi

RealZhiqiLi

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

weaverbirdllm/famma

liked a dataset 6 days ago

generalagents/showdown-clicks

liked a dataset 6 days ago

zwq2018/embodied_reasoner

View all activity

Organizations

RealZhiqiLi's activity

liked a dataset 1 day ago

weaverbirdllm/famma

Viewer • Updated 8 days ago • 4.1k • 424 • 13

liked 2 datasets 6 days ago

generalagents/showdown-clicks

Viewer • Updated 16 days ago • 557 • 1.09k • 8

zwq2018/embodied_reasoner

Preview • Updated 13 days ago • 1.58k • 10

upvoted a paper about 1 month ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 93

New activity in nvidia/Eagle2-9B 3 months ago

Sequence packing logic

#3 opened 3 months ago by

orrzohar

authored 8 papers 3 months ago

FB-BEV: BEV Representation from Forward-Backward View Transformations

Paper • 2308.02236 • Published Aug 4, 2023

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Paper • 2109.03814 • Published Sep 8, 2021

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

Paper • 2401.06197 • Published Jan 11, 2024 • 1

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Paper • 2312.09245 • Published Dec 14, 2023

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

Paper • 2403.09626 • Published Mar 14, 2024 • 15

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Paper • 2211.05778 • Published Nov 10, 2022

Driving with InternVL: Oustanding Champion in the Track on Driving with Language of the Autonomous Grand Challenge at CVPR 2024

Paper • 2412.07247 • Published Dec 10, 2024

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

Paper • 2501.14818 • Published Jan 20 • 4

New activity in huggingface/HuggingDiscussions 3 months ago

[FEEDBACK] Daily Papers

127

#32 opened 10 months ago by

kramp

upvoted a paper 3 months ago

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

Paper • 2501.14818 • Published Jan 20 • 4

liked a model 3 months ago

nvidia/Eagle2-1B

Image-Text-to-Text • Updated 22 days ago • 2.02k • 19

upvoted a paper 4 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 154

upvoted a paper 8 months ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 88

updated a dataset 9 months ago

RealZhiqiLi/VLMEvalImages

Updated Jul 25, 2024 • 5

upvoted a paper 11 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 132