Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

18

Full-text search

Active filters: efficiency

Shahradmz/HyenaDistilledPythia70M

Text Generation • Updated Jan 10, 2024

sapienzanlp/maverick-mes-litbank

Updated Aug 12, 2024 • 3 • 4

1-800-LLMs/Qwen-2.5-14B-Hindi

15B • Updated Feb 13 • 2

1024m/PHI-4-Hindi

15B • Updated Feb 13 • 2 • 1

1024m/PHI-4-Hindi-LoRA

large-traversaal/Mantra-14B

15B • Updated Apr 13 • 10 • 2

DrishtiSharma/qwen-2.5-14b

large-traversaal/Qwen-2.5-14B-Hindi

15B • Updated Mar 3 • 14 • 4

mradermacher/Qwen-2.5-14B-Hindi-GGUF

15B • Updated Jul 31 • 54 • 1

sst12345/CoRe2

Text-to-Image • Updated Mar 18 • 2

mradermacher/Mantra-14B-GGUF

15B • Updated Jul 11 • 48

mradermacher/Mantra-14B-i1-GGUF

15B • Updated Jul 11 • 153

codelion/Qwen3-0.6B-accuracy-recovery-lora

Text Generation • Updated Jul 13 • 16 • 1

GY2233/R2R_router_qwen3-1.7b

Text Classification • Updated Jul 22 • 3

GY2233/R2R_router_qwen3-4b

Text Classification • Updated Jul 23 • 4

GY2233/R2R_router_qwenr1

Text Classification • Updated Jul 24 • 3

lumees/lumees-362m-base

Text Generation • Updated 14 days ago • 46 • 1

5ivatej/tinyllama-1.1b-early-exit

Text Generation • Updated about 11 hours ago