Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vwxyzjn 's Collections
Async RLHF Paper Checkpoints
lm-human-preference-details
TL;DR summarization checkpoints
RLOO / PPOv2 TL;DR summarize checkpoints

RLOO / PPOv2 TL;DR summarize checkpoints

updated Jun 11, 2024
Upvote
1

  • vwxyzjn/ppo_tldr

    Text Generation • Updated May 24, 2024 • 16 • 1

  • vwxyzjn/ppo_tldr_6.9b

    Text Generation • Updated Jun 7, 2024 • 13

  • vwxyzjn/rloo_tldr

    Text Generation • Updated Jun 11, 2024 • 12

  • vwxyzjn/rloo_tldr_6.9b

    Text Generation • Updated Jun 7, 2024 • 11
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs