Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
15
20
19
Wei Xiong
weqweasdas
Follow
hyr51740's profile picture
stillarrow's profile picture
linrongc's profile picture
19 followers
·
20 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
upvoted
a
paper
3 days ago
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training
commented
on
a paper
3 days ago
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training
updated
a dataset
6 days ago
weqweasdas/ultrafeedback_binarized_processed
View all activity
Organizations
weqweasdas
's datasets
260
Sort: Recently updated
weqweasdas/kumar_minvervalsecond
Viewer
•
Updated
Apr 29
•
272
•
4
weqweasdas/self_rewardingppo_minvervalsecond
Viewer
•
Updated
Apr 28
•
272
•
5
weqweasdas/self_rewardingppo_minverval
Viewer
•
Updated
Apr 28
•
272
•
5
weqweasdas/single_turn_minverval
Viewer
•
Updated
Apr 28
•
272
•
5
weqweasdas/kmr_07_step120_one_turn
Viewer
•
Updated
Apr 28
•
500
•
13
weqweasdas/ift_ppo_07_one_turn_conssitent_rm
Viewer
•
Updated
Apr 28
•
500
•
2
weqweasdas/ift_ppo_07_one_turn
Viewer
•
Updated
Apr 28
•
500
•
4
weqweasdas/kmr_07_step120
Viewer
•
Updated
Apr 28
•
500
•
5
weqweasdas/kmr_05
Viewer
•
Updated
Apr 28
•
500
•
4
weqweasdas/kmr_07
Viewer
•
Updated
Apr 28
•
500
•
4
weqweasdas/cot_raft_07
Viewer
•
Updated
Apr 28
•
500
•
4
weqweasdas/ift_07_one_turn
Viewer
•
Updated
Apr 28
•
500
•
9
weqweasdas/cot_07_2
Viewer
•
Updated
Apr 28
•
500
•
5
weqweasdas/cot_07_1
Viewer
•
Updated
Apr 28
•
500
•
5
weqweasdas/ift_ppo_07
Viewer
•
Updated
Apr 28
•
500
•
7
weqweasdas/ift_07
Viewer
•
Updated
Apr 28
•
500
•
6
weqweasdas/amc23
Viewer
•
Updated
Mar 19
•
40
•
6
weqweasdas/minerva_math
Viewer
•
Updated
Mar 19
•
272
•
187
weqweasdas/olympiadbench
Viewer
•
Updated
Mar 19
•
675
•
186
weqweasdas/aime24
Viewer
•
Updated
Mar 19
•
30
•
7
weqweasdas/math500
Viewer
•
Updated
Mar 19
•
500
•
183
weqweasdas/medium
Viewer
•
Updated
Feb 14
•
10.7k
•
7
weqweasdas/numia_hard
Viewer
•
Updated
Feb 14
•
29.2k
•
14
weqweasdas/rs_numia30k
Viewer
•
Updated
Jan 30
•
30.6k
•
5
weqweasdas/rs_math_train
Viewer
•
Updated
Jan 29
•
7.5k
•
8
weqweasdas/rs_math_test
Viewer
•
Updated
Jan 29
•
5k
•
10
weqweasdas/rs_gsm8k_test
Viewer
•
Updated
Jan 29
•
1.32k
•
7
weqweasdas/rs_gsm8k_train
Viewer
•
Updated
Jan 29
•
7.47k
•
9
weqweasdas/ace_processed
Viewer
•
Updated
Jan 26
•
5.18M
•
45
weqweasdas/llama31_70b_chosen_type12_mix
Viewer
•
Updated
Jan 19
•
21.5k
•
8
Previous
1
2
3
4
5
...
9
Next