Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

Reasoning-Course

Inference Endpoints

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

29

Full-text search

Active filters: Reasoning-Course

nharshavardhana/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 4 • 10

Lingyue1/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 5 • 11

t2190/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 6 • 14

t2190/GRPO_1

Text Generation • 0.5B • Updated Mar 12 • 12

kaweizhenpi/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 7 • 12

Shumatsurontek/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 9 • 10

skyimple/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 12 • 12 • 1

abdulsamad/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 6 • 51

tobrun/SmolLM2-135M-GRPO

Text Generation • 0.1B • Updated Mar 15 • 11

TharunSivamani/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 16 • 12

frascuchon/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 17 • 9

bhaveshgoel07/SmolGRPO-135M

Arushhh/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 24 • 12

czuo03/SmolGRPO-135M

Text Generation • 0.1B • Updated Mar 28 • 13

opria123/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 6 • 15

alonsosilva/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 8 • 11

alfredcs/gemma-3-12b-grpo-firstaid

garethpaul/SmolGRPO-135M

Text Generation • 0.1B • Updated May 8 • 6

Thabet/SmolGRPO-135M-learning

Text Generation • 0.1B • Updated May 10 • 10

jcollado/SmolGRPO-135M

Text Generation • 0.1B • Updated May 14 • 36

Brianpuz/SmolGRPO-135M

Text Generation • 0.1B • Updated May 19 • 8

yigitkucuk/tint-interact-sft-grpo

Text Generation • 0.4B • Updated May 19 • 7

koochikoo25/SmolGRPO-135M

Text Generation • 0.1B • Updated May 20 • 22

jackle33/SmolGRPO-135M

Text Generation • 0.1B • Updated May 22 • 7

alfredcs/torchrun-gemma-3-12b-grpo-icd10pcs-merged

Text Generation • 8B • Updated 24 days ago • 20

alfredcs/gemma-3-27b-grpo-med-merged

Image-Text-to-Text • Updated 12 days ago • 36

alfredcs/gemma-3-27b-firstaid-icd10-merged

Image-Text-to-Text • Updated 10 days ago • 23

mradermacher/gemma-3-27b-firstaid-icd10-merged-GGUF

28B • Updated 8 days ago • 194

jinlovespho/SmolGRPO-135M

Text Generation • 0.1B • Updated 4 days ago • 2