Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DisOOM
/
Faro-Yi-9B-200k-GGUF
like
1
Text Generation
Transformers
GGUF
PyTorch
quantized
2-bit
3-bit
4-bit precision
5-bit
6-bit
8-bit precision
fp16
GGUF
yi
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
The gguf quantization of
Fi-9B
Downloads last month
12
GGUF
Model size
8.83B params
Architecture
llama
Chat template
Hardware compatibility
Log In
to view the estimation
2-bit
Q2_K
3.35 GB
3-bit
Q3_K_M
4.32 GB
4-bit
Q4_K_M
5.33 GB
5-bit
Q5_K_M
6.26 GB
6-bit
Q6_K
7.25 GB
8-bit
Q8_0
9.38 GB
16-bit
F16
17.7 GB
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
๐
Ask for provider support