Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
minju's picture
1 17

minju

iaminju
saytes's profile picture jiongdao's profile picture Sangsang's profile picture
·

AI & ML interests

None yet

Organizations

prometheus-vision's profile picture multi-subject's profile picture cot_encyclopedia_human_eval's profile picture

iaminju 's models 13

iaminju/rlpvr_pref_only

2B • Updated Mar 28 • 3

iaminju/rlpvr_math_only

2B • Updated Mar 28 • 13

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3

Updated Feb 28

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2

2B • Updated Feb 28 • 3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k

2B • Updated Feb 27 • 2

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k

Text Generation • 2B • Updated Feb 26 • 3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k

Text Generation • 2B • Updated Feb 26 • 3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref

Text Generation • 2B • Updated Feb 25 • 3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref

Text Generation • 2B • Updated Feb 25 • 4

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s

Updated Feb 25

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_m

Text Generation • 2B • Updated Feb 25 • 3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s

Updated Feb 24

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Feb 24
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs