Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

microsoft
/
kosmos-2-patch14-224

Image-to-Text
Transformers
PyTorch
Safetensors
kosmos-2
image-text-to-text
image-captioning
Model card Files Files and versions Community
19
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Can I use Kosmos-2 for a VQA classification task ?

#19 opened 7 months ago by
dutta18

Fix task tag

#18 opened 11 months ago by
merve

Support for multiple image/text pairs and/or in-context learning

#17 opened 12 months ago by
kushinm

To run on the GPU

#16 opened about 1 year ago by
jshenoy

Detail Captioning with one or multiple bounding box

#15 opened about 1 year ago by
ytmaimai

How Instal kosmos 2 on windows??

1
#14 opened about 1 year ago by
rox7677

Bounding Box Confidence Scores

2
#11 opened over 1 year ago by
m22cs058

1 Click Windows, RunPod & Linux Installer for Kosmos-2 with Batch Image captioning feature - not an issue

1
#10 opened over 1 year ago by
MonsterMMORPG

indexError with draw_entity_boxes_on_image

#9 opened over 1 year ago by
SrikanthChellappa

How to get the model to describe the picture in detail

2
#7 opened over 1 year ago by
li111111111123

Can i pass the prompt via a post request to the HF Inference endpoint / API?

3
#5 opened over 1 year ago by
jamesdhope

Runtime error when Deploying this model into Spaces

3
#4 opened over 1 year ago by
hfsriks8

The size of Kosmos-2.pt

1
#2 opened over 1 year ago by
sanshi2023

KeyError: 'kosmos-2'

❤️ 2
19
#1 opened over 1 year ago by
yingss
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs