--- library_name: transformers license: cc-by-4.0 base_model: - HKUSTAudio/Llasa-3B --- ## Model Description This is a 4bit GPTQ quantization of Llasa-3B by the HKUSTAudio team. I tested using a script written by GitHub user nivibilla, linked below. For some reason, I was not able to run it on my RTX 3090, while quantized Llasa-1B worked fine. Please let me know if you can get it working. ## Model Sources - **Repository:** HKUSTAudio/Llasa-3B - **Paper:** LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis (Coming soon) - **Test Script:** https://github.com/slives-lab/local-llasa-tts_voice/blob/main/llasa_vllm_longtext_inference.ipynb