Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
Mark
Makrrr
Follow
0 followers
·
1 following
AI & ML interests
NLP, RLHF, IR
Recent Activity
liked
a model
5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
updated
a model
5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
published
a model
5 days ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
View all activity
Organizations
None yet
models
11
Sort: Recently updated
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
•
2B
•
Updated
5 days ago
•
7
•
2
Makrrr/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 31
•
9
Makrrr/Pyramids
Reinforcement Learning
•
Updated
May 30
•
13
Makrrr/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 30
•
16
Makrrr/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 29
Makrrr/Cartpole-v1
Reinforcement Learning
•
Updated
May 29
Makrrr/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 28
•
12
Makrrr/QTable-Taxi-V3
Reinforcement Learning
•
Updated
May 28
Makrrr/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 28
Makrrr/ppo-Huggy
Reinforcement Learning
•
Updated
May 27
•
35
View 11 models
datasets
0
None public yet