Text Generation
Transformers
PyTorch
Chinese
llama
text-generation-inference
Inference Endpoints
4-bit precision
gptq

Commit History

Upload tokenizer
40cdb61

q-allen commited on

Upload LlamaForCausalLM
68e40c0

q-allen commited on

initial commit
e2d0b38

q-allen commited on