CogSense Collection Toward Cognitive Supersensing in Multimodal Large Language Model • 3 items • Updated 6 days ago • 2
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 53 • 4
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published 11 days ago • 16
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published 11 days ago • 16
Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles Paper • 2309.10228 • Published Sep 19, 2023
On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation Paper • 2411.11913 • Published Nov 17, 2024
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 53
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24, 2025 • 53
Qwen/Qwen3-VL-235B-A22B-Thinking Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 2.46M • • 375
SocialGesture: Delving into Multi-person Gesture Understanding Paper • 2504.02244 • Published Apr 3, 2025
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper • 2506.21656 • Published Jun 26, 2025 • 16
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper • 2506.21656 • Published Jun 26, 2025 • 16