Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rostislav Golubev's picture
9

Rostislav Golubev

mika5883

AI & ML interests

None yet

Organizations

None yet

Collections 1

interesting
  • DPO Meets PPO: Reinforced Token Optimization for RLHF

    Paper • 2404.18922 • Published Apr 29, 2024 • 1
interesting
  • DPO Meets PPO: Reinforced Token Optimization for RLHF

    Paper • 2404.18922 • Published Apr 29, 2024 • 1

models 53

mika5883/qwen3-14b_rugec_v2

Updated Jun 24

mika5883/qwen3-14b_rugec

Updated Jun 1

mika5883/qwen3-4b_rugec

Updated May 27

mika5883/gec_t5_dpo_A_v2

0.2B • Updated May 27 • 4

mika5883/rugec_A_comet_v3

0.2B • Updated May 25 • 2

mika5883/gec_t5_dpo_A_v1

0.2B • Updated May 24 • 3

mika5883/gec_t5_dpo

0.2B • Updated May 23 • 2

mika5883/gec_Ae_yanArt

0.2B • Updated May 19 • 3

mika5883/t5_gec_test

Updated May 14

mika5883/MT5_large_A_art

1B • Updated Apr 20 • 3
View 53 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs