Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

8-bit precision

Inference Endpoints

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

26,026

Full-text search

Active filters: 8-bit

kyo-takano/open-calm-7b-8bit

Text Generation • Updated May 28, 2023 • 21 • 10

pythainlp/wangchanglm-7.5B-sft-en-8bit

Text Generation • Updated May 29, 2023 • 19

pythainlp/wangchanglm-7.5B-sft-en-8bit-sharded

Text Generation • Updated May 31, 2023 • 44

legendhasit/gpt-j-8-bit

Text Generation • Updated May 29, 2023 • 18

rycont/kakaobrain__kogpt-6b-8bit

Text Generation • Updated May 30, 2023 • 14 • 2

beomi/polyglot-ko-12.8b-safetensors-8bit

Text Generation • Updated May 31, 2023 • 13 • 2

exbow/TinyStories-wikitrain-33m-ethan-8bit

Text Generation • Updated Jun 1, 2023 • 16

Sandiago21/llama-13b-hf-prompt-answering

Text Generation • Updated Feb 8 • 21 • 1

ichitaka/falcon-40b-instruct-8bit

Text Generation • Updated Jun 29, 2023 • 13 • 6

Sandiago21/llama-7b-hf-prompt-answering

Text Generation • Updated Jun 12, 2023 • 19 • 3

legendhasit/falcon-7b-instruct-8bit

Text Generation • Updated Jun 4, 2023 • 29

zhouning/lora-test

Text Generation • Updated Jun 5, 2023 • 13

exbow/gpt-neo-125m-test

Text Generation • Updated Jun 5, 2023 • 11

legendhasit/pythia-12b-deduped-synthetic-instruct-8bit

Text Generation • Updated Jun 6, 2023 • 14

rs224/bloom-1b7-8bit

Token Classification • Updated Jun 22, 2023 • 8

dotvignesh/raven-3b

Text2Text Generation • Updated Jun 8, 2023 • 5

DanceLab/cheese-llm-v1

Text Generation • Updated Jun 12, 2023 • 11

WHJ1998/Ziya-LLaMA-13B-v1.1-in8

Text Generation • Updated Jun 11, 2023 • 12

cassanof/santacoder-lua

Text Generation • Updated Jun 13, 2023 • 20 • 2

jckuri/FB-DLAI-Instruct-tune-v3

Text Generation • Updated Jun 17, 2023 • 23

kristian-a/bloomz-560m

Text Generation • Updated Jun 13, 2023 • 16

kristian-a/bloomz-560m-v2

Text Generation • Updated Jun 13, 2023 • 18

zhouning/lora-llm

Updated Jun 13, 2023 • 2

rahuldshetty/starchat-beta-8bit

Text Generation • Updated Jun 14, 2023 • 13

madatnlp/nllb-moe-54b-8bit

Text2Text Generation • Updated Jun 14, 2023 • 7 • 1

ngoc26/bloomz-7b1-mt-adapter-merged

Text Generation • Updated Jun 15, 2023 • 10

rahuldshetty/WizardCoder-15B-V1.0-8bit

Text Generation • Updated Jun 15, 2023 • 17 • 1

rere84/nineren

Text Generation • Updated Jun 16, 2023 • 27

rere84/renne2

Text Generation • Updated Jun 16, 2023 • 11

inarikami/falcon-40b-instruct-8bit

Text Generation • Updated Jun 23, 2023 • 11