Peter Grasch's picture

3

Peter Grasch

petergrasch

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

authored a paper about 13 hours ago

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

authored a paper 3 months ago

FastVLM: Efficient Vision Encoding for Vision Language Models

View all activity

Organizations

None yet

petergrasch's activity

upvoted a paper about 13 hours ago

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

Paper • 2503.13111 • Published 3 days ago • 6

authored a paper about 13 hours ago

MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs

Paper • 2503.13111 • Published 3 days ago • 6

authored a paper 3 months ago

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 13

upvoted a collection 4 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

authored a paper 6 months ago

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30, 2024 • 56

upvoted a paper 9 months ago

Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Paper • 2407.02477 • Published Jul 2, 2024 • 23

authored a paper 9 months ago

Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Paper • 2407.02477 • Published Jul 2, 2024 • 23