Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Muhammad Ramzan's picture

Muhammad Ramzan

iamramzan

Altan3265's profile picture

Krish7845's profile picture

ali14325's profile picture

·

https://linktr.ee/ramzanshaheen

i_amramzan
iamramzan
iamramzanai

AI & ML interests

GenAI, Vision & Co

Organizations

iamramzan 's collections 5

Shaheen Collection 🦅

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 880k • • 12.5k
Abirate/english_quotes

Viewer • Updated Oct 25, 2022 • 2.51k • 3.75k • 90
fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 27.3k • 8.27k
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17 • 2.32k • 145

Vision Foundation Models 🧩

Foundation models for computer vision.

Running

84

84

Grounding DINO Demo

💻

Cutting edge open-vocabulary object detection app
Running

87

87

Owlv2

👀

State-of-the-art Zero-shot Object Detection
Runtime error

41

41

BLIP2 with transformers

🌖

BLIP2 (cutting edge image captioning) in 🤗transformers
Runtime error

377

377

IDEFICS Playground

🐨

Comprehensive Computer Vision Backbones 🧩

This collection offers a variety of pre-trained computer vision backbones ideal for fine-tuning.

microsoft/resnet-50

Image Classification • 0.0B • Updated Feb 13, 2024 • 281k • • 428
google/vit-base-patch16-224-in21k

Image Feature Extraction • 0.1B • Updated Feb 5, 2024 • 3.7M • 350
google/vit-base-patch32-224-in21k

Image Feature Extraction • 0.1B • Updated Dec 8, 2022 • 8.65k • 19
facebook/dinov2-large

Image Feature Extraction • 0.3B • Updated Sep 6, 2023 • 678k • 86

Top Vision-Language Papers 🖼️💬📝

A curated list of papers on vision-language models, with the most influential ones at the top.

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 38
DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 47
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 9
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

Paper • 2404.01331 • Published Mar 29, 2024 • 28

Cutting-Edge Object Detection Models 🥥

facebook/detr-resnet-50

Object Detection • 0.0B • Updated Apr 10, 2024 • 491k • • 874
facebook/detr-resnet-101-dc5

Object Detection • 0.1B • Updated Sep 6, 2023 • 3.68k • 19
facebook/detr-resnet-50-dc5

Object Detection • 0.0B • Updated Sep 7, 2023 • 1.96k • 6
google/owlvit-base-patch32

Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 147k • 135

Shaheen Collection 🦅

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 880k • • 12.5k
Abirate/english_quotes

Viewer • Updated Oct 25, 2022 • 2.51k • 3.75k • 90
fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 27.3k • 8.27k
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17 • 2.32k • 145

Top Vision-Language Papers 🖼️💬📝

A curated list of papers on vision-language models, with the most influential ones at the top.

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 38
DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 47
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 9
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

Paper • 2404.01331 • Published Mar 29, 2024 • 28

Vision Foundation Models 🧩

Foundation models for computer vision.

Running

84

84

Grounding DINO Demo

💻

Cutting edge open-vocabulary object detection app
Running

87

87

Owlv2

👀

State-of-the-art Zero-shot Object Detection
Runtime error

41

41

BLIP2 with transformers

🌖

BLIP2 (cutting edge image captioning) in 🤗transformers
Runtime error

377

377

IDEFICS Playground

🐨

Cutting-Edge Object Detection Models 🥥

facebook/detr-resnet-50

Object Detection • 0.0B • Updated Apr 10, 2024 • 491k • • 874
facebook/detr-resnet-101-dc5

Object Detection • 0.1B • Updated Sep 6, 2023 • 3.68k • 19
facebook/detr-resnet-50-dc5

Object Detection • 0.0B • Updated Sep 7, 2023 • 1.96k • 6
google/owlvit-base-patch32

Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 147k • 135

Comprehensive Computer Vision Backbones 🧩

This collection offers a variety of pre-trained computer vision backbones ideal for fine-tuning.

microsoft/resnet-50

Image Classification • 0.0B • Updated Feb 13, 2024 • 281k • • 428
google/vit-base-patch16-224-in21k

Image Feature Extraction • 0.1B • Updated Feb 5, 2024 • 3.7M • 350
google/vit-base-patch32-224-in21k

Image Feature Extraction • 0.1B • Updated Dec 8, 2022 • 8.65k • 19
facebook/dinov2-large

Image Feature Extraction • 0.3B • Updated Sep 6, 2023 • 678k • 86

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs