Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Su's picture
1 3

Alex Su

ssmmzz

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago
Pixel-Reasoner
upvoted a paper about 2 months ago
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
upvoted a paper 3 months ago
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
View all activity

Organizations

None yet

ssmmzz 's models 10

ssmmzz/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 2

ssmmzz/axolotl-lora-checkpoint-181-correctness-only-merged

Updated Aug 31, 2024

ssmmzz/axolotl-lora-checkpoint-724-correctness-only

Updated Aug 30, 2024

ssmmzz/axolotl-lora-checkpoint-543-correctness-only

Updated Aug 30, 2024

ssmmzz/axolotl-lora-checkpoint-362-correctness-only

Updated Aug 30, 2024

ssmmzz/axolotl-lora-checkpoint-181-correctness-only

Updated Aug 30, 2024

ssmmzz/axolotl-lora-2000-complete

Updated Aug 30, 2024

ssmmzz/axolotl-lora-1500-complete

Updated Aug 30, 2024

ssmmzz/axolotl-lora-1000-complete

Updated Aug 30, 2024

ssmmzz/axolotl-lora-500-complete

Updated Aug 30, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs