wow đ
Adam Fields
adamelliotfields
AI & ML interests
Diffusers and transformers
Organizations
None yet
vision language models
papers and models đ
-
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper ⢠2409.17146 ⢠Published ⢠121 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper ⢠2409.12191 ⢠Published ⢠77 -
mistralai/Pixtral-12B-2409
Updated ⢠2.71k ⢠664 -
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text ⢠2B ⢠Updated ⢠70.4k ⢠542
image generation
diffusion and transformer models đ§¨
-
CompVis/stable-diffusion-v1-4
Text-to-Image ⢠Updated ⢠554k ⢠6.91k -
stable-diffusion-v1-5/stable-diffusion-v1-5
Text-to-Image ⢠Updated ⢠3.04M ⢠800 -
benjamin-paine/stable-diffusion-v1-5
Text-to-Image ⢠Updated ⢠11.4k ⢠70 -
Comfy-Org/stable-diffusion-v1-5-archive
Updated ⢠458k ⢠67
cv papers
computer vision papers đ
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper ⢠1311.2524 ⢠Published ⢠1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper ⢠1312.4659 ⢠Published ⢠1 -
Generative Adversarial Networks
Paper ⢠1406.2661 ⢠Published ⢠5 -
scikit-image: Image processing in Python
Paper ⢠1407.6245 ⢠Published ⢠1
small language models
under 7b đ
video generation
đš
datasets
for machine learning projects đ
papers
machine learning and neural network papers đ
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper ⢠1106.1813 ⢠Published ⢠1 -
Scikit-learn: Machine Learning in Python
Paper ⢠1201.0490 ⢠Published ⢠1 -
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Paper ⢠1406.1078 ⢠Published -
Distributed Representations of Sentences and Documents
Paper ⢠1405.4053 ⢠Published
favorites
wow đ
-
Running on CPU Upgrade306306
TikSlop
đżThe 100% Latent Video App
-
Paused503503
AI WebTV
đŽDisplay a loading screen with a spinner
-
Running on CPU Upgrade131131
â Hub API Playground â
đšTry the Hugging Face API through the playground
-
Running584584
LoRA Studio
đŞBrowse and run thousands of community trained LoRAs
small language models
under 7b đ
vision language models
papers and models đ
-
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper ⢠2409.17146 ⢠Published ⢠121 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper ⢠2409.12191 ⢠Published ⢠77 -
mistralai/Pixtral-12B-2409
Updated ⢠2.71k ⢠664 -
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text ⢠2B ⢠Updated ⢠70.4k ⢠542
video generation
đš
image generation
diffusion and transformer models đ§¨
-
CompVis/stable-diffusion-v1-4
Text-to-Image ⢠Updated ⢠554k ⢠6.91k -
stable-diffusion-v1-5/stable-diffusion-v1-5
Text-to-Image ⢠Updated ⢠3.04M ⢠800 -
benjamin-paine/stable-diffusion-v1-5
Text-to-Image ⢠Updated ⢠11.4k ⢠70 -
Comfy-Org/stable-diffusion-v1-5-archive
Updated ⢠458k ⢠67
datasets
for machine learning projects đ
cv papers
computer vision papers đ
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper ⢠1311.2524 ⢠Published ⢠1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper ⢠1312.4659 ⢠Published ⢠1 -
Generative Adversarial Networks
Paper ⢠1406.2661 ⢠Published ⢠5 -
scikit-image: Image processing in Python
Paper ⢠1407.6245 ⢠Published ⢠1
papers
machine learning and neural network papers đ
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper ⢠1106.1813 ⢠Published ⢠1 -
Scikit-learn: Machine Learning in Python
Paper ⢠1201.0490 ⢠Published ⢠1 -
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Paper ⢠1406.1078 ⢠Published -
Distributed Representations of Sentences and Documents
Paper ⢠1405.4053 ⢠Published