Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rirv938
/
GPTQ-LLaMa-30B-4bit-triton-g128
like
0
Text Generation
Transformers
llama
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
GPTQ-LLaMa-30B-4bit-triton-g128
2 contributors
History:
2 commits
robert
add files
51ad9f1
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
LLaMa-30B-GPTQ-4bit-g128.safetensors
17.5 GB
LFS
add files
over 1 year ago
config.json
503 Bytes
add files
over 1 year ago
generation_config.json
137 Bytes
add files
over 1 year ago
special_tokens_map.json
411 Bytes
add files
over 1 year ago
tokenizer.json
1.84 MB
add files
over 1 year ago
tokenizer_config.json
700 Bytes
add files
over 1 year ago