Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TsinghuaC3I
's Collections
SSRL
UltraMedical
SSRL
updated
7 days ago
Upvote
2
TsinghuaC3I/SSRL
Preview
•
Updated
19 days ago
•
40
•
2
TsinghuaC3I/Llama-3.1-8B-Instruct-SSRL
Text Generation
•
8B
•
Updated
20 days ago
•
26
TsinghuaC3I/Llama-3.2-3B-Instruct-SSRL
Text Generation
•
4B
•
Updated
20 days ago
•
10
TsinghuaC3I/Qwen2.5-7B-Instruct-SSRL
Text Generation
•
8B
•
Updated
20 days ago
•
9
TsinghuaC3I/Qwen2.5-3B-Instruct-SSRL
Text Generation
•
3B
•
Updated
20 days ago
•
8
SSRL: Self-Search Reinforcement Learning
Paper
•
2508.10874
•
Published
10 days ago
•
88
Upvote
2
Share collection
View history
Collection guide
Browse collections