Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SkalskiP 's Collections
CVPR 2025
Zero-Shot Detection and Segmentation
OpenAI Vision API
LMMs - Large Multimodal Models

CVPR 2025

updated 18 days ago

A collection of models and demos linked to papers presented at CVPR 2025.

Upvote
1

  • Running on Zero
    MCP
    26
    26

    Gaze LLE

    👀

    Gaze Target Estimation


  • Running on Zero
    268
    268

    vggt

    🏆

    VGGT (CVPR 2025)


  • Running on Zero
    28
    28

    UniK3D Demo

    🏢

    UniK3D (CVPR 2025)


  • Running on Zero
    174
    174

    DepthCrafter

    🦀

    a super consistent video depth model


  • Running on L40S
    167
    167

    Video Depth Anything

    👀

    Generate depth video from input video


  • Running on Zero
    783
    783

    MMAudio — generating synchronized audio from video/text

    🔊

    Generate audio from video or text prompts


  • Running on Zero
    33
    33

    Semantic Draw Canvas X Animagine XL 3.1

    🔥

    Create and share 2K arts in 30s with Animagine XL 3.1


  • Running on Zero
    16
    16

    MINIMA

    📈


  • Runtime error
    33
    33

    EdgeTAM

    🚀

    On-Device Track Anything Model


  • Runtime error
    51
    51

    HSMR

    💀

    Convert images of humans to biomechanically accurate 3D skeletons


  • Running on L4
    176
    176

    MatAnyone

    🤡

    Gradio demo for MatAnyone


  • Running on Zero
    115
    115

    Molmo 7B D 0924

    👁


  • Running on Zero
    40
    40

    Magma UI

    📚

    Magma-8B model for UI Agents


  • Running on Zero
    234
    234

    ShowUI

    💻

    Generate clickable coordinates on a screenshot

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs