Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
yichunkuo
/
Llama-2-7b-hf-gptq
like
0
Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints
4-bit precision
gptq
License:
cc-by-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Edit model card
GPTQ quantized falcon-rw-1b
Branch
Bits
GS
Act Order
Damp %
GPTQ Dataset
Seq Len
Size
ExLlama
Desc
main
4
None
No
0.01
c4
4096
--
No
4-bit, without Act Order and no grouop size.
Downloads last month
14
Safetensors
Model size
1.08B params
Tensor type
I32
·
FP16
·
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.