11 12 25

Shengyi Costa Huang

vwxyzjn

http://costa.sh

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.1-Base

liked a model 4 months ago

deepseek-ai/DeepSeek-R1-0528

updated a dataset 5 months ago

vwxyzjn/the-algorithm-python

View all activity

Organizations

Articles 9

Article

122

How NuminaMath Won the 1st AIMO Progress Prize

Article

models 393

vwxyzjn/ppo_async

Updated Feb 5 • 3

vwxyzjn/ppo_sync

Updated Feb 5 • 2

vwxyzjn/online_dpo_sync

Updated Feb 5 • 3

vwxyzjn/online_dpo_async

Updated Feb 5 • 2

vwxyzjn/rm_zephyr_new

Text Classification • 7B • Updated Sep 26, 2024 • 4

vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev

Updated Sep 11, 2024

vwxyzjn/reward_modeling__EleutherAI_pythia-14m

Updated Aug 24, 2024 • 4

vwxyzjn/online_dpo_vllm__vwxyzjn_btulu

Updated Aug 23, 2024 • 4

vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b

Updated Aug 19, 2024 • 3

vwxyzjn/btulu

Text Generation • 8B • Updated Aug 19, 2024 • 5

View 393 models

datasets 295

vwxyzjn/the-algorithm-python

Viewer • Updated May 5 • 608 • 24

vwxyzjn/rlvr_acecoder

Viewer • Updated May 5 • 87.1k • 29

vwxyzjn/rlvr_orz_math_72k_collection_extended

Viewer • Updated Apr 28 • 56.9k • 27

vwxyzjn/rlvr_orz_math_13k_collection_hard

Viewer • Updated Apr 28 • 56.9k • 29

vwxyzjn/rlvr_orz_math_57k_collected

Viewer • Updated Apr 28 • 56.9k • 28

vwxyzjn/acecoder_sft_gpt4o_test_cases_then_impl1

Viewer • Updated Apr 11 • 79.1k • 26

vwxyzjn/acecoder_sft_gpt4o_test_cases_then_impl_no_system_message

Viewer • Updated Apr 11 • 41.6k • 19 • 1

vwxyzjn/acecoder_sft_gpt4o_test_cases_then_impl

Viewer • Updated Apr 10 • 41.6k • 35

vwxyzjn/the-algorithm-python-debug

Viewer • Updated Apr 2 • 11 • 30

vwxyzjn/multiplication_train_1000_2x2-gsm8k-verifier

Viewer • Updated Mar 10 • 1k • 22

View 295 datasets

Shengyi Costa Huang

AI & ML interests

Recent Activity

Organizations

Articles 9

How NuminaMath Won the 1st AIMO Progress Prize

NuminaMath 是如何荣膺首届 AIMO 进步奖的？

Collections 4

Papers 10

spaces 4 Sort: Recently updated

Test

Aim

Vwxyzjn Testyes4

Pyserini Wikipedia Kilt Doc

models 393 Sort: Recently updated

datasets 295 Sort: Recently updated

spaces 4

models 393

datasets 295