A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction • 0.3B • Updated • 751 • 51 -
apple/aimv2-huge-patch14-224
Image Feature Extraction • 0.7B • Updated • 191 • 9 -
apple/aimv2-1B-patch14-224
Image Feature Extraction • 1B • Updated • 99 • 7 -
apple/aimv2-3B-patch14-224
Image Feature Extraction • 3B • Updated • 67 • 3