Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SimmonsSongHW
/
Llama-3.2-1B-Instruct-GGUF-Imatrix
like
0
GGUF
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
README.md exists but content is empty.
Downloads last month
9
GGUF
Model size
1.24B params
Architecture
llama
Chat template
Hardware compatibility
Log In
to view the estimation
2-bit
Q2_K
581 MB
4-bit
Q4_0
773 MB
5-bit
Q5_0
895 MB
6-bit
Q6_K
1.02 GB
8-bit
Q8_0
1.32 GB
View +1 variant
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Collection including
SimmonsSongHW/Llama-3.2-1B-Instruct-GGUF-Imatrix
Llama3-Quants-Imatrix
Collection
3 items
โข
Updated
Mar 20