Apple
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Benchmark for the design of efficient continual learning of image-text models over years.
AIM: Autoregressive Image Models
A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction • 0.3B • Updated • 948 • 57 -
apple/aimv2-huge-patch14-224
Image Feature Extraction • 0.7B • Updated • 70 • 11 -
apple/aimv2-1B-patch14-224
Image Feature Extraction • 1B • Updated • 72 • 7 -
apple/aimv2-3B-patch14-224
Image Feature Extraction • 3B • Updated • 10 • 3
MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities.
DataCompDR: Improved datasets for training image-text SOTA models.
-
MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
Paper • 2311.17049 • Published • 2 -
apple/mobileclip_s0_timm
Image Classification • Updated • 201 • 10 -
apple/mobileclip_s1_timm
Image Classification • Updated • 165 • 2 -
apple/mobileclip_s2_timm
Image Classification • Updated • 90 • 5
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
CLIP Models trained using DFN-2B/DFN-5B datasets
DCLM Models + Datasets
A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction • 0.3B • Updated • 948 • 57 -
apple/aimv2-huge-patch14-224
Image Feature Extraction • 0.7B • Updated • 70 • 11 -
apple/aimv2-1B-patch14-224
Image Feature Extraction • 1B • Updated • 72 • 7 -
apple/aimv2-3B-patch14-224
Image Feature Extraction • 3B • Updated • 10 • 3
MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities.
DataCompDR: Improved datasets for training image-text SOTA models.
-
MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
Paper • 2311.17049 • Published • 2 -
apple/mobileclip_s0_timm
Image Classification • Updated • 201 • 10 -
apple/mobileclip_s1_timm
Image Classification • Updated • 165 • 2 -
apple/mobileclip_s2_timm
Image Classification • Updated • 90 • 5
Benchmark for the design of efficient continual learning of image-text models over years.
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
CLIP Models trained using DFN-2B/DFN-5B datasets
AIM: Autoregressive Image Models
DCLM Models + Datasets