Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Daniil Tiapkin's picture
2 3

Daniil Tiapkin

dtiapkin
eliebak's profile picture tgritsaev's profile picture BayesianMonster's profile picture
·
https://d-tiapkin.github.io/
  • dtiapkin
  • d-tiapkin
  • daniil-tiapkin-049714240
  • dtiapkin.bsky.social

AI & ML interests

Reinforcement learning enjoyer

Recent Activity

authored a paper 10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
upvoted a paper 10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
commented on a paper 10 days ago
Accelerating Nash Learning from Human Feedback via Mirror Prox
View all activity

Organizations

None yet

Papers 4

arxiv:2505.19731
arxiv:2502.02671
arxiv:2310.17303
arxiv:2303.08059

models 3

dtiapkin/RL-Course-ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 21, 2023 • 1

dtiapkin/ppo-LunarLander-v2-try2

Updated May 10, 2022

dtiapkin/ppo-LunalLander-v2

Reinforcement Learning • Updated May 10, 2022 • 1

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs