Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1323
161
248
Merve Noyan
merve
Follow
CryptoFrancesc's profile picture
cmhungsteve's profile picture
dubiousx's profile picture
6110 followers
Ā·
226 following
https://github.com/merveenoyan/smol-vision
mervenoyann
merveenoyan
merve.bsky.social
AI & ML interests
VLMs, vision & co
Recent Activity
posted
an
update
about 16 hours ago
Everything that happened this week in open AI, a recap š¤ https://huggingface.co/collections/merve/jan-17-releases-678a673a9de4a4675f215bf5 š Multimodal - MiniCPM-o 2.6 is a new sota any-to-any model by OpenBMB (vision, speech and text!) - VideoChat-Flash-Qwen2.5-2B is new video multimodal models by OpenGVLab that come in sizes 2B & 7B in resolutions 224 & 448 - ByteDance released larger SA2VA that comes in 26B parameters - Dataset: VRC-Bench is a new diverse benchmark for multimodal LLM reasoning performance š¬ LLMs - MiniMax-Text-01 is a new huge language model (456B passive 45.9B active params) by MiniMaxAI with context length of 4M tokens š¤Æ - Dataset: Sky-T1-data-17k is a diverse dataset used to train Sky-T1-32B - kyutai released Helium-1-Preview-2B is a new small multilingual LM - Wayfarer-12B is a new LLM able to write D&D š§š»āāļø - ReaderLM-v2 is a new HTML parsing model by Jina AI - Dria released, Dria-Agent-a-3B, new agentic coding model (Pythonic function calling) based on Qwen2.5 Coder - Unsloth released Phi-4, faster and memory efficient Llama 3.3 š¼ļø Vision - MatchAnything is a new foundation model for matching - FitDit is a high-fidelity VTON model based on DiT architecture š£ļø Audio - OuteTTS-0.3-1B is a new multilingual text-to-speech model with voice cloning and emotion control capabilities š Retrieval - lightblue released a new reranker based on Qwen2.5 LB-reranker-0.5B-v1.0 that can handle 95+ languages - cde-small-v2 is a new sota small retrieval model by @jxm
updated
a collection
about 17 hours ago
Jan 17 Releases āļø
updated
a collection
about 17 hours ago
Jan 17 Releases āļø
View all activity
Articles
Introducing smolagents: simple agents that write actions in code.
18 days ago
ā¢
505
Welcome PaliGemma 2 ā New vision language models by Google
Dec 5, 2024
ā¢
127
SmolVLM - small yet mighty Vision Language Model
Nov 26, 2024
ā¢
156
Llama can now see and run on your device - welcome Llama 3.2
Sep 25, 2024
ā¢
181
Preference Optimization for Vision Language Models
Jul 10, 2024
ā¢
55
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
Jun 24, 2024
ā¢
183
PaliGemma ā Google's Cutting-Edge Open Vision Language Model
May 14, 2024
ā¢
233
Vision Language Models Explained
Apr 11, 2024
ā¢
245
Introduction to Quantization cooked in š¤ with šš§āš³
Aug 25, 2023
ā¢
25
Deploy MusicGen in no time with Inference Endpoints
Aug 4, 2023
ā¢
4
Open-Source Text Generation & LLM Ecosystem at Hugging Face
Jul 17, 2023
ā¢
2
Jupyter X Hugging Face
Mar 23, 2023
ā¢
2
Using Machine Learning to Aid Survivors and Race through Time
Mar 3, 2023
ā¢
6
Introducing Skops
Aug 12, 2022
ā¢
1
Announcing the Hugging Face Fellowship Program
May 17, 2022
ā¢
6
Showcase Your Projects in Spaces using Gradio
Oct 5, 2021
ā¢
6
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
Oct 5, 2021
ā¢
3
Organizations
merve
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
internlm/internlm-xcomposer2d5-ol-7b
about 17 hours ago
fix task tag
#2 opened about 17 hours ago by
merve
New activity in
5CD-AI/Vintern-1B-v2
about 17 hours ago
fix task tag
#9 opened about 17 hours ago by
merve
New activity in
erax-ai/EraX-VL-7B-V2.0-Preview
about 17 hours ago
fix task tag
#2 opened about 17 hours ago by
merve
New activity in
omkarthawakar/LlamaV-o1
about 17 hours ago
fix task tag
#1 opened about 17 hours ago by
merve
New activity in
black-forest-labs/FLUX.1-Canny-dev-lora
5 days ago
Fix pipeline tag
#3 opened 5 days ago by
merve
New activity in
black-forest-labs/FLUX.1-Depth-dev-lora
5 days ago
Fix pipeline tag
#4 opened 5 days ago by
merve
New activity in
black-forest-labs/FLUX.1-Fill-dev
5 days ago
Fix pipeline tag
#27 opened 5 days ago by
merve
New activity in
jinaai/jina-clip-v2
5 days ago
Fix pipeline tag
1
#28 opened 5 days ago by
merve
New activity in
showlab/ShowUI-2B
5 days ago
Fix pipeline tag to increase visibility
#11 opened 5 days ago by
merve
New activity in
vidore/colpali-v1.2
6 days ago
Fix task tag
#10 opened 6 days ago by
merve
New activity in
vidore/colpali-v1.3-hf
6 days ago
Fix task tag
#3 opened 6 days ago by
merve
New activity in
keremberke/yolov5m-smoke
8 days ago
Enable download stats -- fix library name
#1 opened 8 days ago by
merve
New activity in
arnabdhar/YOLOv8-Face-Detection
8 days ago
Fix metadata
#7 opened 8 days ago by
merve
New activity in
Ultralytics/YOLOv5
8 days ago
Enable download stats
1
#2 opened 8 days ago by
merve
New activity in
Ultralytics/YOLOv8
8 days ago
Enable download stats
1
#1 opened 8 days ago by
merve
New activity in
Ultralytics/YOLO11
8 days ago
Update library name
1
#1 opened 8 days ago by
merve
New activity in
StephanST/WALDO30
8 days ago
Add library
#4 opened 8 days ago by
merve
License
5
#3 opened 9 days ago by
merve
New activity in
WHL95/PRIME-RL-Eurus-2-7B-PRIME
9 days ago
Zero A100 Grant
#1 opened 9 days ago by
merve
New activity in
ByteDance/Sa2VA-1B
9 days ago
Demo
2
#2 opened 9 days ago by
merve
Load more