-
-
-
-
-
-
Inference Providers
Active filters:
AWQ
QuantTrio/KAT-V1-40B-AWQ
Text Generation
•
7B
•
Updated
•
24
•
2
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
Updated
•
824
•
1
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
Updated
•
177
•
1
QuantTrio/GLM-4.6-AWQ
Text Generation
•
Updated
•
100
•
1
abhinavkulkarni/mosaicml-mpt-7b-instruct-w4-g128-awq
Text Generation
•
Updated
•
13
abhinavkulkarni/mosaicml-mpt-7b-chat-w4-g128-awq
Text Generation
•
1B
•
Updated
•
22
abhinavkulkarni/VMware-open-llama-7b-open-instruct-w4-g128-awq
Text Generation
•
Updated
•
12
abhinavkulkarni/VMware-open-llama-13b-open-instruct-w4-g128-awq
Text Generation
•
Updated
•
13
•
3
abhinavkulkarni/tiiuae-falcon-7b-instruct-w4-g64-awq
Text Generation
•
Updated
•
6
•
5
abhinavkulkarni/psmathur-orca_mini_v2_7b-w4-g128-awq
Text Generation
•
Updated
•
10
•
2
abhinavkulkarni/Salesforce-codegen25-7b-multi-w4-g128-awq
Text Generation
•
Updated
•
11
•
2
abhinavkulkarni/psmathur-orca_mini_v2_13b-w4-g128-awq
Text Generation
•
Updated
•
6
•
2
abhinavkulkarni/mosaicml-mpt-30b-instruct-w4-g128-awq
Text Generation
•
Updated
•
14
•
2
abhinavkulkarni/mosaicml-mpt-30b-chat-w4-g128-awq
Text Generation
•
4B
•
Updated
•
19
abhinavkulkarni/VMware-open-llama-7b-v2-open-instruct-w4-g128-awq
Text Generation
•
Updated
•
9
abhinavkulkarni/tiiuae-falcon-40b-instruct-w4-g128-awq
Text Generation
•
Updated
•
8
•
2
abhinavkulkarni/Salesforce-codegen25-7b-instruct-w4-g128-awq
Text Generation
•
Updated
•
7
•
3
abhinavkulkarni/meta-llama-Llama-2-7b-chat-hf-w4-g128-awq
Text Generation
•
Updated
•
14
•
6
abhinavkulkarni/meta-llama-Llama-2-13b-chat-hf-w4-g128-awq
Text Generation
•
Updated
•
8
•
1
abhinavkulkarni/stabilityai-StableBeluga-7B-w4-g128-awq
Text Generation
•
Updated
•
10
•
1
abhinavkulkarni/stabilityai-StableBeluga-13B-w4-g128-awq
Text Generation
•
Updated
•
7
•
1
abhinavkulkarni/codellama-CodeLlama-7b-Instruct-hf-w4-g128-awq
Text Generation
•
Updated
•
12
abhinavkulkarni/codellama-CodeLlama-7b-Python-hf-w4-g128-awq
Text Generation
•
Updated
•
925
abhinavkulkarni/codellama-CodeLlama-13b-Instruct-hf-w4-g128-awq
Text Generation
•
2B
•
Updated
•
6
abhinavkulkarni/codellama-CodeLlama-13b-Python-hf-w4-g128-awq
Text Generation
•
Updated
•
8
xDAN-AI/xDAN-L1-Chat-RL-v1-awq
Text Generation
•
1B
•
Updated
solidrust/Noromaid-7B-0.4-DPO-AWQ
Text Generation
•
1B
•
Updated
•
6
•
1
solidrust/WestLake-7B-v2-AWQ
Text Generation
•
1B
•
Updated
•
5
•
4
solidrust/WestLake-7B-v2-laser-AWQ
Text Generation
•
1B
•
Updated
•
6
•
1
MaziyarPanahi/Mistral-7B-Instruct-v0.2-AWQ
Text Generation
•
1B
•
Updated
•
18
•
2