Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thuml 's Collections
RLVR-World
Time Series Foundation Models
iVideoGPT

RLVR-World

updated May 26
Upvote
1

  • RLVR-World: Training World Models with Reinforcement Learning

    Paper • 2505.13934 • Published May 20 • 14

  • thuml/rt1-frame-tokenizer

    Updated May 22 • 14

  • thuml/rt1-world-model-single-step-base

    0.1B • Updated May 22 • 4

  • thuml/rt1-world-model-single-step-rlvr

    Updated May 26 • 6

  • thuml/rt1-compressive-tokenizer

    Updated May 22 • 7

  • thuml/rt1-world-model-multi-step-base

    0.1B • Updated May 22 • 4

  • thuml/rt1-world-model-multi-step-rlvr

    0.1B • Updated May 26 • 3

  • thuml/webarena-world-model-cot

    Viewer • Updated May 26 • 6.41k • 47

  • thuml/webarena-world-model-sft

    2B • Updated May 26 • 5

  • thuml/webarena-world-model-rlvr

    2B • Updated May 26 • 3

  • thuml/bytesized32-world-model-cot

    Viewer • Updated May 26 • 304k • 24

  • thuml/bytesized32-world-model-sft

    2B • Updated May 26 • 4

  • thuml/bytesized32-world-model-rlvr-binary-reward

    2B • Updated May 26 • 3

  • thuml/bytesized32-world-model-rlvr-task-specific-reward

    2B • Updated May 26 • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs