Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sascha-kirch 's Collections
3D Reconstruction
Diffusion Models
DL Perception
Foundation Models
State-Space models

Foundation Models

updated Dec 6, 2024
Upvote
-

  • No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

    Paper • 2404.04125 • Published Apr 4, 2024 • 30

  • Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

    Paper • 2404.08197 • Published Apr 12, 2024 • 30

  • Probing the 3D Awareness of Visual Foundation Models

    Paper • 2404.08636 • Published Apr 12, 2024 • 13

  • AM-RADIO: Agglomerative Model -- Reduce All Domains Into One

    Paper • 2312.06709 • Published Dec 10, 2023 • 2

  • Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection

    Paper • 2405.10300 • Published May 16, 2024 • 31

  • Depth Anything V2

    Paper • 2406.09414 • Published Jun 13, 2024 • 103

  • Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

    Paper • 2412.04424 • Published Dec 5, 2024 • 63
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs