Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

8-bit precision

Inference Endpoints

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

25,988

Full-text search

Active filters: 8-bit

ek826/instruct-gpt-j-fp16-8bit

Text Generation • Updated May 18, 2023 • 9

Ransaka/mbart-large-cc25-8bit

Text2Text Generation • Updated Oct 2, 2023 • 7

alexpaul/QI-Large-v1

Text Generation • Updated May 1, 2023 • 20

rsonavane/flan-t5-xl-alpaca-dolly-lora-peft

Text2Text Generation • Updated May 5, 2023 • 35 • 1

MrNJK/gpt2-xl-sft-int8

Text Generation • Updated May 5, 2023 • 13

s3nh/tiny-gpt2-instruct-polish

Text Generation • Updated May 5, 2023 • 65

dodosconundrum/alpaca_final_8bit

Text Generation • Updated May 6, 2023 • 14

polmeladianos/oasst-sft-4-pythia-12b-epoch-3.5m-8bit

Text Generation • Updated May 6, 2023 • 11

universonic/llama-7b-8bit

Text Generation • Updated Sep 22, 2023 • 11

santhosh97/eleuther-gpt-j-6B

Text Generation • Updated May 10, 2023 • 12

santhosh97/gpt-pythia-6.9b-quantized

Text Generation • Updated May 10, 2023 • 22

santhosh97/gpt-mosaic-int8

Text Generation • Updated May 10, 2023 • 9

santhosh97/gpt-mosaic-7b-int8

Text Generation • Updated May 10, 2023 • 10

santhosh97/gpt-pythia-12b-quantized

Text Generation • Updated May 11, 2023 • 16

ladygaia/alpaca-8bit

Text Generation • Updated May 15, 2023 • 8

jasonmcaffee/flan-t5-large-samsum

Text2Text Generation • Updated May 17, 2023 • 8 • 2

WHJ1998/stablelm-7b-sft-v7-epoch-3-int8

Text Generation • Updated May 17, 2023 • 11

WHJ1998/oasst-sft-4-pythia-12b-epoch-int8

Text Generation • Updated May 17, 2023 • 10

WHJ1998/oasst-sft-4-pythia-12b-epoch-int8-1GB

Text Generation • Updated May 17, 2023 • 10

reasonwang/flan-t5-xl-8bit

Text2Text Generation • Updated May 18, 2023 • 9

devanand7800/pygmalion-1.3b

Text Generation • Updated May 19, 2023 • 17

zh-tw-llm-dv/zh-tw-llm-ta01-pythia-6.9b-ta8000-v1-a_1_embeddings-h100-t01-c5daa1-8bit

Text Generation • Updated May 19, 2023 • 16

zetavg/zh-tw-llm-ta01-pythia-6.9b-ta8000-v1-a_1_embeddings-h100-t01-c5daa1-8bit-2

Text Generation • Updated May 19, 2023 • 14

Chauhanhp10/test2

Text Generation • Updated May 20, 2023 • 22

WHJ1998/Ziya-LLaMA-13B-v1-in8

Text Generation • Updated Jun 11, 2023 • 12

Shoubhik8/bloom-1b7-no_lora-finetuned_v2

Text Generation • Updated May 24, 2023 • 14

Grammonde/dolly-v2-meadow-patient-info-fine-tune

Text Generation • Updated May 26, 2023 • 10

RajuKandasamy/twinsights_3b_alpha_8bit

Text Generation • Updated May 26, 2023 • 10

rockerBOO/stablelm-tuned-alpha-3b-8bit

Text Generation • Updated Sep 6, 2024 • 17 • 3

kyo-takano/open-calm-7b-8bit

Text Generation • Updated May 28, 2023 • 20 • 10