@singhsidhukuldeep on Hugging Face: "🦅 Falcon has landed... again! And now it not just reads but sees as well…"

Post

1877

🦅 Falcon has landed... again!
And now it not just reads but sees as well 📖👀

Here is a summary of the Falcon-11B-VLM model:

Model Type: Causal decoder-only model 🔄.

Parameters: 11 billion 🌌.

Vision Integration: Uses the pretrained CLIP ViT-L/14 vision encoder with the recently released Falcon2-11B chat-finetuned model and trained with image-text data 🖼️📚.

Training: Pretrained on over 5,000 billion tokens from RefinedWeb with curated corpora 📊.

Dynamic Encoding: Enhances perception of fine-grained details in images 🔍.

Training Hardware: 16 A100 80GB GPUs with ZeRO and Flash-Attention 2 🖥️.

Tokenizer: Falcon-7B/11B tokenizer 🧩.

Languages Supported: 🌍 Primarily English, with capabilities in German 🇩🇪, Spanish 🇪🇸, French 🇫🇷, Italian 🇮🇹, Dutch 🇳🇱, Romanian 🇷🇴, Czech 🇨🇿, Swedish 🇸🇪, and more. 🗣️🌐.

License: Open Source - TII Falcon License 2.0, based on Apache 2.0 📜.

Model: tiiuae/falcon-11B-vlm

Join the conversation