Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nscale
SambaNova
Cohere
Fireworks
Replicate
Nebius AI Studio
Hyperbolic
Cerebras
Novita
Together AI
fal
HF Inference API
Misc
Reset Misc
Eval Results
Inference Endpoints
text-generation-inference
reinforcement-learning
custom_code
Mixture of Experts
8-bit precision
Carbon Emissions
4-bit precision
Merge
Misc with no match
text-embeddings-inference
Apply filters
Models
60,325
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement-learning
Clear all
bguan/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Nov 18, 2022
•
1
bguan/a2c-HalfCheetahBulletEnv-v0
Reinforcement Learning
•
Updated
Nov 18, 2022
•
2
LidoHon/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 18, 2022
•
2
•
1
OSalem99/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Nov 18, 2022
•
2
LidoHon/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 19, 2022
Harrier/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Nov 19, 2022
•
3
yizhangliu/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 10, 2023
•
3
•
1
Harrier/Reinforce-CartPole-0
Reinforcement Learning
•
Updated
Nov 20, 2022
Harrier/Reinforce-Pixelcopter-0
Reinforcement Learning
•
Updated
Nov 20, 2022
xaeroq/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 20, 2022
xaeroq/q-Taxi-v3
Reinforcement Learning
•
Updated
Nov 20, 2022
bsmith0430/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 22, 2022
•
1
Harrier/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Nov 21, 2022
•
10
TUMxudashuai/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 22, 2022
•
1
BeeBeaver/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 22, 2022
SweepCake/LunarLander-v2-PPO-HFcourse
Reinforcement Learning
•
Updated
Nov 22, 2022
•
1
motmono/Modified-Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Nov 22, 2022
juansebashr/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 12, 2023
•
2
Chayo/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 23, 2022
•
1
popolin52/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 23, 2022
xaeroq/dqn-Qbert-v5
Reinforcement Learning
•
Updated
Nov 23, 2022
•
2
kontogiorgos/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 23, 2022
phtutgo/Reinforce01
Reinforcement Learning
•
Updated
Nov 23, 2022
kontogiorgos/q-Taxi-v3
Reinforcement Learning
•
Updated
Nov 23, 2022
TUMxudashuai/DQN-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 23, 2022
•
3
xaeroq/MLAgents-Pyramids
Reinforcement Learning
•
Updated
Nov 23, 2022
•
3
xaeroq/ppo-MsPacman-v5
Reinforcement Learning
•
Updated
Nov 24, 2022
•
4
aspectcisco/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 24, 2022
•
4
morgansoftware/LunarLander-v2
Reinforcement Learning
•
Updated
Nov 24, 2022
•
2
Galeros/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Nov 24, 2022
•
2
Previous
1
...
98
99
100
Next