Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Cly
Akikaaa
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
published
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
updated
a model
about 1 month ago
Akikaaa/Reinforce-CartPole-v1
View all activity
Organizations
None yet
Akikaaa
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Feb 3
•
25
published
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Feb 3
•
25
updated
a model
about 1 month ago
Akikaaa/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Feb 2
published
a model
about 1 month ago
Akikaaa/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Feb 2
updated
a model
about 1 month ago
Akikaaa/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jan 31
•
8
published
a model
about 1 month ago
Akikaaa/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jan 31
•
8
updated
a model
about 1 month ago
Akikaaa/Taxi-v3
Reinforcement Learning
•
Updated
Jan 30
published
a model
about 1 month ago
Akikaaa/Taxi-v3
Reinforcement Learning
•
Updated
Jan 30
updated
a model
about 1 month ago
Akikaaa/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 30
published
a model
about 1 month ago
Akikaaa/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 30
updated
a model
2 months ago
Akikaaa/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 4
•
6
New activity in
Qwen/Qwen2.5-0.5B-Instruct
2 months ago
Does this model apply SFT or SFT+RL during post-training?
1
#8 opened 2 months ago by
Akikaaa
liked
a model
4 months ago
mlabonne/Meta-Llama-3-8B
Text Generation
•
Updated
May 2, 2024
•
87
•
1