Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hcompany
/
Holo1-3B
like
83
Follow
H company
317
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
image-to-text
multimodal
action
agent
conversational
text-generation-inference
arxiv:
2401.13919
arxiv:
2506.02865
License:
other
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
6d27086
Holo1-3B
Commit History
Create screenspot_eval.py
6d27086
verified
plcedoz38
commited on
May 22
Update README.md
24abde5
verified
plcedoz38
commited on
May 22
Upload calendar_example.jpg
d9d8f98
verified
plcedoz38
commited on
May 22
Update README.md
64b97e0
verified
plcedoz38
commited on
May 21
Delete LICENSE
1c104cb
verified
plcedoz38
commited on
May 21
Update README.md
f0e31b8
verified
plcedoz38
commited on
May 21
Update README.md
2e4a2c8
verified
plcedoz38
commited on
May 21
Create LICENSE
e17a862
verified
plcedoz38
commited on
May 21
Update README.md
5d51d08
verified
plcedoz38
commited on
May 21
Upload folder using huggingface_hub
6e5d15d
verified
plcedoz38
commited on
May 20
initial commit
cd85305
verified
plcedoz38
commited on
May 20