moondream2
a tiny vision language model
a tiny vision language model
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Generate text from images and prompts
Generate images from text prompts with various styles
Meta Llama3 8b with Llava Multimodal capabilities
Generate text and segment images using PaliGemma
Answer questions about images by chatting
Generate image descriptions
Microsoft Phi-3 Vision 128k with Multimodal capabilities
let's talk about the meaning of life
Convert images to grayscale
Analyze images to generate captions, detect objects, or perform OCR
Generate detailed captions for images
Describe images in detail with text
Interact with Florence-2 to analyze images and generate descriptions
A private and powerful multimodal AI chatbot that runs local
Create images from descriptions or images
Generate text based on an image and prompt
Ask questions about images
Describe images using text prompts
GOT - OCR (from : UCAS, Beijing)
Chat about images with detailed descriptions
Huggingface space for JanusFlow-1.3B
Ask questions about images to get answers
Generate captions for images