Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

504

Full-text search

Active filters: RLHF

mradermacher/JSL-MedMNX-7B-SFT-i1-GGUF

7B • Updated Jan 19 • 367

govindrhf/aaditya-Llama3-OpenBioLLM-70B

Updated Feb 5 • 4

mradermacher/Starling-LM-11B-alpha-GGUF

11B • Updated Feb 9 • 186 • 1

mradermacher/Starling-LM-11B-alpha-i1-GGUF

11B • Updated Feb 10 • 497 • 1

dhruvrnaik/test-openbiollm

Updated Feb 14 • 5

yukiarimo/yuna-ai-v4

Text Generation • 8B • Updated Feb 14 • 136 • 3

yukiarimo/yuna-ai-v4-full

Text Generation • 8B • Updated Feb 14 • 7 • 3

xi0v/tempesthenno-ppo-ckpt40-archive

15B • Updated Mar 4

chucre/Llama3-OpenBioLLM-70B

Updated Mar 6 • 4

estnafinema0/smolLM-variation-dpo

Text Generation • 0.1B • Updated Mar 30 • 5

estnafinema0/smolLM-variation-ppo

Text Generation • 0.1B • Updated Mar 30 • 63

mradermacher/TC-instruct-DPO-GGUF

7B • Updated 15 days ago • 216

tensorblock/tanamettpk_TC-instruct-DPO-GGUF

7B • Updated 16 days ago • 69

tensorblock/CallComply_Starling-LM-11B-alpha-GGUF

11B • Updated 16 days ago • 37

Compumacy/OpenBioLLm-70B

Updated May 12 • 2

ETI-Deploy/DM-BaseModel-4Bit

Text Generation • 37B • Updated Jun 2 • 2

mradermacher/Harmless-RewardModel-GGUF

0.1B • Updated 15 days ago • 753

NiuTrans/GRAM-Qwen3-1.7B-RewardModel

2B • Updated 30 days ago • 185 • 3

NiuTrans/GRAM-Qwen3-14B-RewardModel

15B • Updated 30 days ago • 96 • 3

NiuTrans/GRAM-LLaMA3.2-3B-RewardModel

3B • Updated 30 days ago • 380 • 3

NiuTrans/GRAM-Qwen3-4B-RewardModel

4B • Updated 30 days ago • 10 • 2

NiuTrans/GRAM-Qwen3-8B-RewardModel

8B • Updated 30 days ago • 17 • 3

jg940101/5da3878a-e5c1-41a2-96a9-4f445a29bfd2

8B • Updated 30 days ago • 7

mario-rc/gemma-2-9b-it-emotional-rlaif-dpo

Updated 15 days ago • 44