view article Article Visualize and understand GPU memory in PyTorch By qgallouedec β’ Dec 24, 2024 β’ 224
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF β’ Nov 25, 2024 β’ 287
Whisper Collection OpenAI Whisper speech recognition models in MLX format β’ 48 items β’ Updated Oct 1, 2024 β’ 45
What matters when building vision-language models? Paper β’ 2405.02246 β’ Published May 3, 2024 β’ 104
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6, 2024 β’ 91
Zero-Shot Detection and Segmentation Collection Demos of projects focused on zero-shot detection and segmentation. β’ 4 items β’ Updated Feb 7, 2024 β’ 3