Cly
Akikaaa
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
published
a model
about 1 month ago
Akikaaa/ppo-SnowballTarget
updated
a model
about 1 month ago
Akikaaa/Reinforce-CartPole-v1
Organizations
None yet
Akikaaa's activity
Does this model apply SFT or SFT+RL during post-training?
1
#8 opened 2 months ago
by
Akikaaa