Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TsinghuaC3I 's Collections
SSRL
UltraMedical

SSRL

updated 7 days ago
Upvote
2

  • TsinghuaC3I/SSRL

    Preview • Updated 19 days ago • 40 • 2

  • TsinghuaC3I/Llama-3.1-8B-Instruct-SSRL

    Text Generation • 8B • Updated 20 days ago • 26

  • TsinghuaC3I/Llama-3.2-3B-Instruct-SSRL

    Text Generation • 4B • Updated 20 days ago • 10

  • TsinghuaC3I/Qwen2.5-7B-Instruct-SSRL

    Text Generation • 8B • Updated 20 days ago • 9

  • TsinghuaC3I/Qwen2.5-3B-Instruct-SSRL

    Text Generation • 3B • Updated 20 days ago • 8

  • SSRL: Self-Search Reinforcement Learning

    Paper • 2508.10874 • Published 10 days ago • 88
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs