Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
44.8
TFLOPS
1532
1691
4654
Omar Sanseviero
osanseviero
Follow
Yanis's profile picture
medisagi's profile picture
edyoshikun's profile picture
2822 followers
·
449 following
https://osanseviero.github.io/hackerllama/
osanseviero
osanseviero
omarsanseviero
AI & ML interests
Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙
Articles
Llama can now see and run on your device - welcome Llama 3.2
Sep 25
•
164
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
Sep 18
•
198
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
Jul 23
•
213
WWDC 24: Running Mistral 7B with Core ML
Jul 22
•
55
How we leveraged distilabel to create an Argilla 2.0 Chatbot
Jul 16
•
32
Welcome Gemma 2 - Google's new open LLM
Jun 27
•
123
Welcome Llama 3 - Meta's new open LLM
Apr 18
•
278
CodeGemma - an official Google release for code LLMs
Apr 9
•
99
🪆 Introduction to Matryoshka Embedding Models
Feb 23
•
54
Welcome Gemma - Google's new open LLM
Feb 21
•
16
Constitutional AI with Open LLMs
Feb 1
•
12
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18
•
35
Mixture of Experts Explained
Dec 11, 2023
•
185
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Dec 11, 2023
•
11
Inference for PROs
Sep 22, 2023
•
48
Spread Your Wings: Falcon 180B is here
Sep 6, 2023
•
4
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
8
Results of the Open Source AI Game Jam
Jul 21, 2023
Llama 2 is here - get it on Hugging Face
Jul 18, 2023
•
21
The Falcon has landed in the Hugging Face ecosystem
Jun 5, 2023
•
9
Hugging Face Machine Learning Demos on arXiv
Nov 17, 2022
What's new in Diffusers? 🎨
Sep 12, 2022
Announcing Evaluation on the Hub
Jun 28, 2022
An Introduction to Deep Reinforcement Learning
May 4, 2022
•
2
Welcome spaCy to the 🤗 Hub
Jul 13, 2021
Sentence Transformers in the 🤗 Hub
Jun 28, 2021
Organizations
osanseviero
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
5 days ago
InstantX/InstantIR
Image-to-Image
•
Updated
4 days ago
•
39
•
86
liked
2 models
6 days ago
tencent/Hunyuan3D-1
Updated
3 days ago
•
306
•
175
jpgallegoar/F5-Spanish
Updated
6 days ago
•
35
liked
2 models
7 days ago
EvanTHU/MotionCLR
Updated
7 days ago
•
1
ManzhenWei/MG2
Updated
6 days ago
•
10
liked
a space
7 days ago
Running
44
🔥
Vectorsearch Hub Datasets
Add vectors to Hub datasets and do in memory vector search.
liked
a dataset
7 days ago
Spawning/PD12M
Viewer
•
Updated
11 days ago
•
12.4M
•
8.35k
•
102
liked
a model
7 days ago
stabilityai/stable-diffusion-3.5-medium
Text-to-Image
•
Updated
11 days ago
•
45.3k
•
323
liked
2 models
9 days ago
amd/AMD-OLMo
Text Generation
•
Updated
8 days ago
•
62
microsoft/deberta-v3-large
Fill-Mask
•
Updated
Mar 19, 2023
•
4.36M
•
188
liked
2 models
10 days ago
genmo/mochi-1-preview
Text-to-Video
•
Updated
11 days ago
•
1.57k
•
911
Etched/oasis-500m
Updated
7 days ago
•
3.65k
•
371
liked
3 models
11 days ago
openai/clip-vit-large-patch14
Zero-Shot Image Classification
•
Updated
Sep 15, 2023
•
23.3M
•
1.44k
Qwen/Qwen2-0.5B-Instruct
Text Generation
•
Updated
Aug 21
•
225k
•
161
abacusai/Dracarys2-72B-Instruct
Text Generation
•
Updated
11 days ago
•
651
•
58
liked
a space
12 days ago
Runtime error
3
🚀
Non Streaming Example
liked
a dataset
12 days ago
mistralai/MM-MT-Bench
Viewer
•
Updated
Oct 10
•
92
•
126
•
10
liked
a model
12 days ago
amphion/MaskGCT
Text-to-Speech
•
Updated
17 days ago
•
224
liked
a model
13 days ago
Qwen/Qwen2.5-72B-Instruct
Text Generation
•
Updated
Sep 25
•
417k
•
•
452
liked
a model
14 days ago
NimVideo/cogvideox-2b-img2vid
Image-to-Video
•
Updated
14 days ago
•
1.69k
•
47
Load more