Text Generation
Transformers
Safetensors
llama
sparse
instruct
text-generation-inference
mgoin's picture
Convert model to BFloat16 and shard using SafeTensors
74b9f6d