Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Stoney Kang's picture
166 18

Stoney Kang

sikang99
shtefcs's profile picture
·

AI & ML interests

Remote Control based on Vision

Recent Activity

updated a collection 3 days ago
VLA Models
upvoted a paper 3 days ago
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
upvoted a paper 3 days ago
Depth Anything at Any Condition
View all activity

Organizations

TeamGRIT, Co. Ltd.'s profile picture

Collections 9

VLM, MLLM
  • UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding

    Paper • 2506.23219 • Published 7 days ago • 6
Diffusion Model
  • MMaDA: Multimodal Large Diffusion Language Models

    Paper • 2505.15809 • Published May 21 • 89
  • Diffusion World Model

    Paper • 2402.03570 • Published Feb 5, 2024 • 8
VLM, MLLM
  • UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding

    Paper • 2506.23219 • Published 7 days ago • 6
Diffusion Model
  • MMaDA: Multimodal Large Diffusion Language Models

    Paper • 2505.15809 • Published May 21 • 89
  • Diffusion World Model

    Paper • 2402.03570 • Published Feb 5, 2024 • 8
View 9 collections

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs