Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

updated a Space about 18 hours ago

mrdbourke/LLMDet-demo

liked a Space about 18 hours ago

mrdbourke/LLMDet-demo

published a Space about 19 hours ago

mrdbourke/LLMDet-demo

View all activity

Organizations

None yet

mrdbourke's activity

upvoted a collection 2 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 14 items • Updated about 9 hours ago • 21

upvoted an article 3 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

4 days ago

• 94

upvoted 2 collections 7 days ago

MiMo

6 items • Updated 8 days ago • 5

MiMo-VL

2 items • Updated 8 days ago • 24

upvoted a collection 8 days ago

OpenVision

27 items • Updated 29 days ago • 27

upvoted a paper 21 days ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 23 days ago • 90

upvoted an article 22 days ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

23 days ago

• 112

upvoted a paper 23 days ago

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published 26 days ago • 29

upvoted a paper 24 days ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 26 days ago • 143

upvoted an article 25 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

By

and 14 others •

Dec 19, 2024

• 645

upvoted a paper 25 days ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published about 1 month ago • 26

upvoted 2 articles 25 days ago

Article

Object Detection Leaderboard

By

and 1 other •

Sep 18, 2023

• 15

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

26 days ago

• 417

upvoted a paper 26 days ago

D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement

Paper • 2410.13842 • Published Oct 17, 2024 • 5

upvoted a collection 26 days ago

D-FINE

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

upvoted 2 collections about 1 month ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated Apr 18 • 27

InternVL3

34 items • Updated Apr 20 • 70

upvoted 2 papers about 1 month ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 73

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 618

upvoted an article about 1 month ago

Article

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

By

and 2 others •

Apr 24

• 14