HyunseokLee's picture

4 2 26

HyunseokLee

hyunseoki

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

liked a dataset about 1 month ago

psiyum/winemag-feb-2019

upvoted a paper about 1 month ago

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

View all activity

Organizations

hyunseoki 's models 25

hyunseoki/qwen2.5-vl-7b-rft-crop

Image-Text-to-Text • 8B • Updated May 12 • 1.75k

hyunseoki/qwen2.5-vl-rft-crop0.3

Image-Text-to-Text • 4B • Updated May 12 • 12 • 1

hyunseoki/qwen2.5-vl-grpo-continue

Image-Text-to-Text • 4B • Updated Apr 22 • 12

hyunseoki/qwen2.5-vl-grpo

Image-Text-to-Text • 4B • Updated Apr 21 • 7

hyunseoki/finetune-llama-3.1-8b-gsm8k

Text Generation • 8B • Updated Mar 31 • 12

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test-new

hyunseoki/llama3.2-1b-Open-R1-GRPO-test0

Text Generation • 1B • Updated Feb 10 • 17 • 1

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-metamath-test

2B • Updated Feb 8 • 11

hyunseoki/llama3.2-1b-Open-R1-GRPO-test5

1B • Updated Feb 7 • 17

hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test5

hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test4

Text Generation • 2B • Updated Feb 6 • 10

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test5

Text Generation • 2B • Updated Feb 6 • 6

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test3

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test2

2B • Updated Feb 6 • 10

hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test2

hyunseoki/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-test

2B • Updated Feb 6 • 10

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO-test

2B • Updated Feb 6 • 9

hyunseoki/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Feb 5 • 10

hyunseoki/Qwen2.5-1.5B-Open-GRPO-test

hyunseoki/llama-3.1-8B-thesis-aligned

Text Generation • 8B • Updated Nov 4, 2024 • 1.7k

hyunseoki/llama-3.1-8B-thesis-sft

Text Generation • 8B • Updated Oct 28, 2024 • 15

hyunseoki/ReMoDetect-deberta

0.4B • Updated Sep 26, 2024 • 173 • 1

hyunseoki/ko-ref-llama2-7b

Text Generation • Updated Oct 4, 2023 • 4.02k • 3

hyunseoki/ko-ref-llama2-13b

Text Generation • Updated Oct 4, 2023 • 4.01k • 1

hyunseoki/ko-en-llama2-13b

Text Generation • Updated Oct 3, 2023 • 4.12k • 27