Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
2
26
HyunseokLee
hyunseoki
Follow
ikhyeon's profile picture
Minnyeong's profile picture
kimtaey's profile picture
6 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
29 days ago
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics
liked
a dataset
about 1 month ago
psiyum/winemag-feb-2019
upvoted
a
paper
about 1 month ago
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models
View all activity
Organizations
hyunseoki
's models
25
Sort: Recently updated
hyunseoki/qwen2.5-vl-7b-rft-crop
Image-Text-to-Text
•
8B
•
Updated
May 12
•
1.75k
hyunseoki/qwen2.5-vl-rft-crop0.3
Image-Text-to-Text
•
4B
•
Updated
May 12
•
12
•
1
hyunseoki/qwen2.5-vl-grpo-continue
Image-Text-to-Text
•
4B
•
Updated
Apr 22
•
12
hyunseoki/qwen2.5-vl-grpo
Image-Text-to-Text
•
4B
•
Updated
Apr 21
•
7
hyunseoki/finetune-llama-3.1-8b-gsm8k
Text Generation
•
8B
•
Updated
Mar 31
•
12
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test-new
Updated
Feb 10
hyunseoki/llama3.2-1b-Open-R1-GRPO-test0
Text Generation
•
1B
•
Updated
Feb 10
•
17
•
1
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-metamath-test
2B
•
Updated
Feb 8
•
11
hyunseoki/llama3.2-1b-Open-R1-GRPO-test5
1B
•
Updated
Feb 7
•
17
hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test5
Updated
Feb 7
hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test4
Text Generation
•
2B
•
Updated
Feb 6
•
10
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test5
Text Generation
•
2B
•
Updated
Feb 6
•
6
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test3
Updated
Feb 6
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test2
2B
•
Updated
Feb 6
•
10
hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test2
Updated
Feb 6
hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test
2B
•
Updated
Feb 6
•
10
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test
2B
•
Updated
Feb 6
•
9
hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO
2B
•
Updated
Feb 5
•
10
hyunseoki/Qwen2.5-1.5B-Open-GRPO-test
Updated
Feb 4
hyunseoki/llama-3.1-8B-thesis-aligned
Text Generation
•
8B
•
Updated
Nov 4, 2024
•
1.7k
hyunseoki/llama-3.1-8B-thesis-sft
Text Generation
•
8B
•
Updated
Oct 28, 2024
•
15
hyunseoki/ReMoDetect-deberta
0.4B
•
Updated
Sep 26, 2024
•
173
•
1
hyunseoki/ko-ref-llama2-7b
Text Generation
•
Updated
Oct 4, 2023
•
4.02k
•
3
hyunseoki/ko-ref-llama2-13b
Text Generation
•
Updated
Oct 4, 2023
•
4.01k
•
1
hyunseoki/ko-en-llama2-13b
Text Generation
•
Updated
Oct 3, 2023
•
4.12k
•
27