A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction β’ Updated β’ 2.74k β’ 41 -
apple/aimv2-huge-patch14-224
Image Feature Extraction β’ Updated β’ 250 β’ 7 -
apple/aimv2-1B-patch14-224
Image Feature Extraction β’ Updated β’ 180 β’ 4 -
apple/aimv2-3B-patch14-224
Image Feature Extraction β’ Updated β’ 33 β’ 2