
BioCLIP
All BioCLIP project Datasets, Models, and Demos.
Zero-Shot Image Classification • Updated • 5.96k • 14Note BioCLIP 2 is a foundation model for biology organismal images. It is trained on TreeOfLife-200M on the basis of a CLIP model (ViT-14/L) pre-trained on LAION-2B. BioCLIP 2 yields state-of-the-art performance in recognizing various species. More importantly, it demonstrates emergent properties beyond species classification after extensive hierarchical contrastive training.
imageomics/bioclip
Zero-Shot Image Classification • Updated • 165k • 51Note BioCLIP is a foundation model for the tree of life, built using CLIP architecture as a vision model for general organismal biology. It is trained on TreeOfLife-10M, our specially-created dataset covering over 450K taxa--the most biologically diverse ML-ready dataset available to date.
imageomics/TreeOfLife-200M
Viewer • Updated • 214M • 2.65k • 12Note Nearly 214 million images representing 952,257 taxa across the tree of life used to train BioCLIP 2. This dataset combines images and metadata from four core biodiversity data providers: Global Biodiversity Information Facility (GBIF), Encyclopedia of Life (EOL), BIOSCAN-5M, and FathomNet to more than double the number of unique taxa covered by TreeOfLife-10M.
imageomics/TreeOfLife-10M
Viewer • Updated • 6.13M • 7.32k • 30Note Over 10 million images covering 454 thousand taxa in the tree of life used to train BioCLIP. This dataset of images of biological organisms paired with their associated taxonomic labels expands on the foundation established by existing high-quality datasets, such as iNat21 and BIOSCAN-1M, by further incorporating newly curated images from the Encyclopedia of Life (eol.org), which supplies most of TreeOfLife-10M’s data diversity.
imageomics/rare-species
Viewer • Updated • 12k • 2.99k • 13Note This dataset was generated alongside TreeOfLife-10M as a benchmark for BioCLIP; data (images and text) were pulled from Encyclopedia of Life (EOL) to generate a dataset consisting of rare species for zero-shot-classification and more refined image classification tasks. Here, we use "rare species" to mean species listed on The International Union for Conservation of Nature (IUCN) Red List as Near Threatened, Vulnerable, Endangered, Critically Endangered, and Extinct in the Wild.
imageomics/IDLE-OO-Camera-Traps
Viewer • Updated • 2.59k • 441Note IDLE-OO Camera Traps is a 5-dataset benchmark of camera trap images from the Labeled Information Library of Alexandria: Biology and Conservation (LILA BC) with a total of 2,586 images for species classification. Each of the 5 benchmarks is balanced to have the same number of images for each species within it (between 310 and 1120 images), representing between 16 and 39 species.
imageomics/bioclip-vit-b-16-inat-only
Zero-Shot Image Classification • Updated • 4Note This model is trained on iNat21, different from BioCLIP which is trained on TreeOfLife-10M.
- 7
Bioclip 2 Demo
😻Classify images to identify plant and animal species
- 67
Bioclip Demo
🐘Identify plant and animal species from images
BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Paper • 2505.23883 • Published • 1BIOCLIP: A Vision Foundation Model for the Tree of Life
Paper • 2311.18803 • Published • 1