Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
fal
SambaNova
Together AI
Hyperbolic
Nebius AI Studio
Nscale
Cerebras
Fireworks
Replicate
Cohere
Novita
HF Inference API
Misc
rlvr
Inference Endpoints

Misc with no match

text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

6
Full-text search
Active filters: rlvr

SultanR/SmolTulu-1.7b-Reinforced-GGUF

Text Generation • Updated Dec 17, 2024 • 7 • 1

thuml/rt1-world-model-multi-step-rlvr

Updated 14 days ago • 12

thuml/rt1-world-model-single-step-rlvr

Updated 14 days ago • 7

thuml/webarena-world-model-rlvr

Updated 14 days ago • 8

thuml/bytesized32-world-model-rlvr-binary-reward

Updated 14 days ago • 6

thuml/bytesized32-world-model-rlvr-task-specific-reward

Updated 14 days ago • 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs