Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Fireworks
Cerebras
Together AI
Nscale
Cohere
Nebius AI Studio
Hyperbolic
Novita
fal
SambaNova
Replicate
HF Inference API
Misc
rlhf
Inference Endpoints
text-generation-inference
Merge
Eval Results
4-bit precision
8-bit precision
custom_code
Mixture of Experts

Misc with no match

text-embeddings-inference
Carbon Emissions

Models

336
Full-text search
Active filters: rlhf

mradermacher/beaver-7b-v1.0-GGUF

Reinforcement Learning • Updated Apr 5 • 24

loganlin777/mistral-7b-dpo-adapter

Updated Apr 27

tensorblock/mlabonne_NeuralDaredevil-7B-GGUF

Updated May 1 • 125

BryanADA/Qwen2.5-3B-cot-zh-tw

Text Generation • Updated 14 days ago • 65 • 1

mradermacher/RewardAnything-8B-v1-GGUF

Updated 2 days ago • 164

Pierizvi/infused-reasoning-phi2

Text Generation • Updated 3 days ago
  • Previous
  • 1
  • ...
  • 10
  • 11
  • 12
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs