Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Daniil Tiapkin
dtiapkin
Follow
eliebak's profile picture
tgritsaev's profile picture
BayesianMonster's profile picture
6 followers
·
6 following
https://d-tiapkin.github.io/
dtiapkin
d-tiapkin
daniil-tiapkin-049714240
dtiapkin.bsky.social
AI & ML interests
Reinforcement learning enjoyer
Recent Activity
authored
a paper
10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
upvoted
a
paper
10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
commented
on
a paper
10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
View all activity
Organizations
None yet
Papers
4
arxiv:
2505.19731
arxiv:
2502.02671
arxiv:
2310.17303
arxiv:
2303.08059
models
3
Sort: Recently updated
dtiapkin/RL-Course-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 21, 2023
•
1
dtiapkin/ppo-LunarLander-v2-try2
Updated
May 10, 2022
dtiapkin/ppo-LunalLander-v2
Reinforcement Learning
•
Updated
May 10, 2022
•
1
datasets
0
None public yet