Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mark's picture
1 4

Mark

Makrrr
·

AI & ML interests

NLP, RLHF, IR

Recent Activity

liked a model 5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
updated a model 5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
published a model 5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
View all activity

Organizations

None yet

models 11

Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl

Reinforcement Learning • 2B • Updated 5 days ago • 7 • 2

Makrrr/a2c-PandaReachDense-v3

Reinforcement Learning • Updated May 31 • 9

Makrrr/Pyramids

Reinforcement Learning • Updated May 30 • 13

Makrrr/ppo-SnowballTarget

Reinforcement Learning • Updated May 30 • 16

Makrrr/Pixelcopter-PLE-v0

Reinforcement Learning • Updated May 29

Makrrr/Cartpole-v1

Reinforcement Learning • Updated May 29

Makrrr/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated May 28 • 12

Makrrr/QTable-Taxi-V3

Reinforcement Learning • Updated May 28

Makrrr/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated May 28

Makrrr/ppo-Huggy

Reinforcement Learning • Updated May 27 • 35
View 11 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs