Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cohere
Nebius AI Studio
Hyperbolic
Together AI
Cerebras
Novita
fal
Replicate
SambaNova
Fireworks
HF Inference API
Misc
Reset Misc
RL
Inference Endpoints
text-generation-inference
AutoTrain Compatible
Misc with no match
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
30
Full-text search
Edit filters
Sort: Trending
Active filters:
RL
Clear all
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
Updated
Jan 30
•
159
•
20
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
•
Updated
Feb 28
•
44
•
1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
Updated
Mar 1
•
328
•
1
mradermacher/Magellanic-Qwen-25B-R999-GGUF
Updated
Mar 5
•
128
•
1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
Updated
Mar 5
•
423
•
1
stanfordnlp/SteamSHP-flan-t5-xl
Text2Text Generation
•
Updated
Oct 10, 2023
•
41
•
43
stanfordnlp/SteamSHP-flan-t5-large
Text2Text Generation
•
Updated
Oct 10, 2023
•
38
•
33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
•
Updated
Dec 17, 2024
•
41
•
5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
Updated
Dec 18, 2024
•
55
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
Jan 22
JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text2Text Generation
•
Updated
Jan 22
tecosys/Nutaan-RL1
Reinforcement Learning
•
Updated
Feb 7
•
3
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
Updated
Feb 9
•
35
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
51
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
Updated
Feb 22
•
154
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
Updated
Feb 26
•
41
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
•
Updated
Feb 27
•
76
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
Updated
Feb 27
•
187
VaidikML0508/llama3.2-3B-Instruct-DPO-16bits-V1
Text Generation
•
Updated
29 days ago
•
19
TEEN-D/squiral_maze
Reinforcement Learning
•
Updated
17 days ago
TEEN-D/Tabular_RL_For_Multi_Env
Reinforcement Learning
•
Updated
17 days ago
prithivMLmods/Mensa-Beta-14B-Instruct
Text Generation
•
Updated
4 days ago
•
24
mradermacher/Mensa-Beta-14B-Instruct-GGUF
Updated
2 days ago
•
231
mradermacher/Mensa-Beta-14B-Instruct-i1-GGUF
Updated
2 days ago
•
465
prithivMLmods/Venatici-Coder-14B-Y.2
Text Generation
•
Updated
2 days ago
•
9
mradermacher/Venatici-Coder-14B-Y.2-GGUF
Updated
about 7 hours ago
•
131
mradermacher/Venatici-Coder-14B-Y.2-i1-GGUF
Updated
about 7 hours ago
•
350
prithivMLmods/CEERS-2112-14B-Instruct
Text Generation
•
Updated
about 16 hours ago
•
9
mradermacher/CEERS-2112-14B-Instruct-GGUF
Updated
about 7 hours ago
•
19
mradermacher/CEERS-2112-14B-Instruct-i1-GGUF
Updated
about 7 hours ago
•
45