chansung park's picture

chansung park PRO

chansung

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

google/diffusiongemma-26B-A4B-it

upvoted an article 13 days ago

AutoResearch on Diffusers' Pipeline for 10 Rounds on JarvisLabs

published an article 13 days ago

AutoResearch on Diffusers' Pipeline for 10 Rounds on JarvisLabs

View all activity

Organizations

Posts 21

Post

4865

YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes).

Here, I built a simple editor first for @dstackai , and I will share the live endpoint this week. Let me know what you think about this approach.

Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.

Articles 8

Article

3

AutoResearch on Diffusers' Pipeline for 10 Rounds on JarvisLabs

View all Articles

Papers 3

arxiv:2602.15449

arxiv:2412.06071

arxiv:2408.13467

spaces 54

Paper Q&A

Explore papers with auto generated Q&As

Llama2 With Gradio Chat

Zero2Story

Create a custom story with characters and plot

Co Write With Llama2

LLMs As Chatbot

Open Model Playground

Generate text and images with customizable AI prompts

models 184

chansung/diffusion-dpo

chansung/Qwen2.5-7B-CCRL-CUR-BINARY2-ONLY-1E

Text Generation • 8B • Updated Nov 23, 2025 • 1

chansung/Qwen2.5-1.5B-CCRL-CUR-BINARY2-ONLY-1E

Text Generation • 2B • Updated Nov 23, 2025 • 2

chansung/Qwen2.5-7B-CCRL-CUR-BINARY-ONLY-1E

Text Generation • 8B • Updated Nov 22, 2025 • 1

chansung/Qwen2.5-1.5B-CCRL-CUR-BINARY-ONLY-1E

Text Generation • 2B • Updated Nov 21, 2025 • 1

chansung/Qwen2.5-Coder-7B-CCRL-CUR-BINARY-ONLY-1E

Updated Nov 20, 2025

chansung/Qwen2.5-Coder-7B-UCRL

8B • Updated Nov 9, 2025 • 3

chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO

Text Generation • 2B • Updated Nov 4, 2025 • 92 •

chansung/Qwen3-4B-CCRL-CUR-VAR-ASCE-NORMAL-1E-LOG

4B • Updated Sep 23, 2025

chansung/Qwen3-4B-CCRL-CUR-VAR-ASCE-NORMAL-2E

Text Generation • 4B • Updated Sep 16, 2025 • 1

View 184 models

datasets 59

chansung/verifiable-coding-problems-python-v2

Viewer • Updated Apr 21, 2025 • 15.5k • 7

chansung/verifiable-coding-problems-python

Viewer • Updated Mar 29, 2025 • 949 • 22

chansung/openthoughts-coding-llama-factory

Viewer • Updated Mar 12, 2025 • 19.9k • 3

chansung/cqa_synth_ds

Viewer • Updated Jun 3, 2024 • 111k • 25

chansung/coding_synth_ds

Viewer • Updated Jun 3, 2024 • 116k • 17 • 1

chansung/classification_synth_ds

Viewer • Updated Jun 2, 2024 • 92.3k • 27

chansung/classification_synth_ds2

Viewer • Updated Jun 1, 2024 • 424 • 23

chansung/aaa3

Updated Jun 1, 2024 • 5

chansung/aaa2

Updated Jun 1, 2024 • 5

chansung/synth_summarize_dataset

Viewer • Updated May 31, 2024 • 880k • 348 • 2

View 59 datasets