Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

598

Full-text search

Active filters: AWQ

QuantTrio/KAT-V1-40B-AWQ

Text Generation • 7B • Updated 27 days ago • 24 • 2

QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ

Text Generation • Updated 3 days ago • 824 • 1

QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ

Text Generation • Updated 3 days ago • 177 • 1

QuantTrio/GLM-4.6-AWQ

Text Generation • Updated about 5 hours ago • 100 • 1

abhinavkulkarni/mosaicml-mpt-7b-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 13

abhinavkulkarni/mosaicml-mpt-7b-chat-w4-g128-awq

Text Generation • 1B • Updated Feb 21, 2024 • 22

abhinavkulkarni/VMware-open-llama-7b-open-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 12

abhinavkulkarni/VMware-open-llama-13b-open-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 13 • 3

abhinavkulkarni/tiiuae-falcon-7b-instruct-w4-g64-awq

Text Generation • Updated Sep 12, 2023 • 6 • 5

abhinavkulkarni/psmathur-orca_mini_v2_7b-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 10 • 2

abhinavkulkarni/Salesforce-codegen25-7b-multi-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 11 • 2

abhinavkulkarni/psmathur-orca_mini_v2_13b-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 6 • 2

abhinavkulkarni/mosaicml-mpt-30b-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 14 • 2

abhinavkulkarni/mosaicml-mpt-30b-chat-w4-g128-awq

Text Generation • 4B • Updated Jun 3, 2024 • 19

abhinavkulkarni/VMware-open-llama-7b-v2-open-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 9

abhinavkulkarni/tiiuae-falcon-40b-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 8 • 2

abhinavkulkarni/Salesforce-codegen25-7b-instruct-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 7 • 3

abhinavkulkarni/meta-llama-Llama-2-7b-chat-hf-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 14 • 6

abhinavkulkarni/meta-llama-Llama-2-13b-chat-hf-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 8 • 1

abhinavkulkarni/stabilityai-StableBeluga-7B-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 10 • 1

abhinavkulkarni/stabilityai-StableBeluga-13B-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 7 • 1

abhinavkulkarni/codellama-CodeLlama-7b-Instruct-hf-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 12

abhinavkulkarni/codellama-CodeLlama-7b-Python-hf-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 925

abhinavkulkarni/codellama-CodeLlama-13b-Instruct-hf-w4-g128-awq

Text Generation • 2B • Updated Sep 17, 2023 • 6

abhinavkulkarni/codellama-CodeLlama-13b-Python-hf-w4-g128-awq

Text Generation • Updated Sep 12, 2023 • 8

xDAN-AI/xDAN-L1-Chat-RL-v1-awq

Text Generation • 1B • Updated Dec 25, 2023

solidrust/Noromaid-7B-0.4-DPO-AWQ

Text Generation • 1B • Updated Mar 2, 2024 • 6 • 1

solidrust/WestLake-7B-v2-AWQ

Text Generation • 1B • Updated Feb 29, 2024 • 5 • 4

solidrust/WestLake-7B-v2-laser-AWQ

Text Generation • 1B • Updated Feb 27, 2024 • 6 • 1

MaziyarPanahi/Mistral-7B-Instruct-v0.2-AWQ

Text Generation • 1B • Updated Feb 9, 2024 • 18 • 2