Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Alex Su
ssmmzz
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 1 month ago
Pixel-Reasoner
upvoted
a
paper
about 2 months ago
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
upvoted
a
paper
3 months ago
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
View all activity
Organizations
None yet
models
10
Sort: Recently updated
ssmmzz/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 2
ssmmzz/axolotl-lora-checkpoint-181-correctness-only-merged
Updated
Aug 31, 2024
ssmmzz/axolotl-lora-checkpoint-724-correctness-only
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-checkpoint-543-correctness-only
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-checkpoint-362-correctness-only
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-checkpoint-181-correctness-only
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-2000-complete
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-1500-complete
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-1000-complete
Updated
Aug 30, 2024
ssmmzz/axolotl-lora-500-complete
Updated
Aug 30, 2024
datasets
51
Sort: Recently updated
ssmmzz/carla25k
Updated
Jan 9
•
2
ssmmzz/fuckyou
Viewer
•
Updated
Sep 15, 2024
•
256
•
23
ssmmzz/label_Sim
Viewer
•
Updated
Sep 5, 2024
•
279
•
25
ssmmzz/label_Coh
Viewer
•
Updated
Sep 5, 2024
•
391
•
29
ssmmzz/label_Suc
Viewer
•
Updated
Sep 5, 2024
•
511
•
27
ssmmzz/label_Cor
Viewer
•
Updated
Sep 5, 2024
•
640
•
26
ssmmzz/label
Viewer
•
Updated
Sep 2, 2024
•
18.2k
•
42
ssmmzz/mixed_helsteer_combined_format
Viewer
•
Updated
Sep 2, 2024
•
18.7k
•
20
ssmmzz/label-only-correctness
Viewer
•
Updated
Sep 2, 2024
•
6.55k
•
22
ssmmzz/error_samples_only_test
Viewer
•
Updated
Sep 1, 2024
•
1.74k
•
28
View 51 datasets