Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

xintaozhen
/
MiniVLA

Image-Text-to-Text
Transformers
ONNX
Safetensors
English
vision-language-action
edge-deployment
tensorRT
qwen
Model card Files Files and versions
xet
Community
MiniVLA
12.9 GB
  • 1 contributor
History: 24 commits
xintaozhen's picture
xintaozhen
Upload 2 files
e1e3d75 verified 11 days ago
  • Results
    Upload 2 files 11 days ago
  • models
    Upload 7 files 12 days ago
  • qwen25-0_5b-trtllm
    Upload 10 files 12 days ago
  • qwen25-0_5b-with-extra-tokenizer
    Upload 9 files 12 days ago
  • tensorRT
    Upload vision_encoder_fp16.onnx 12 days ago
  • .gitattributes
    1.8 kB
    Upload 2 files 11 days ago
  • MiniVLA_Architecture.jpg
    1.6 MB
    xet
    Upload 2 files 11 days ago
  • MiniVLA_Architecture.svg
    399 kB
    Upload 2 files 11 days ago
  • README.md
    7.24 kB
    Update README.md 11 days ago
  • System_Architecture.jpg
    974 kB
    xet
    Upload 2 files 11 days ago
  • System_Architecture.svg
    110 kB
    Upload 2 files 11 days ago