akshathmangudi
/

llama3.1-8b-quantized

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Model Card for Model ID

This is the quantized version of Llama3.1-8B using bitsandbytes. More quantized LLMs coming soon...

Model Description

Developed by: Meta
Quantized by: Akshath Mangudi
My GitHub: https://github.com/akshathmangudi
My LinkedIn: https://www.linkedin.com/in/akshathmangudi/
License: llama3.1

Model Source

Repository: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B

Downloads last month: 64

Safetensors

Model size

4.65B params

Tensor type

BF16

·

F32

·

U8

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support