Exllama v2 Quantizations of Tess-v2.5.2-Qwen2-72B
Using turboderp's ExLlamaV2 v0.0.21 for quantization.
Original model: https://huggingface.co/migtissera/Tess-v2.5.2-Qwen2-72B
- Downloads last month
- 1
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.