Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset 1 day ago
weqweasdas/qwen7b_prompt_difficult
published a dataset 1 day ago
weqweasdas/qwen7b_prompt_difficult
updated a dataset 2 days ago
weqweasdas/qwen7b_openr1_with_scores_sub
View all activity

Organizations

reward modeling's profile picture raft_study's profile picture Directional Preference Alignment's profile picture RLHFlow's profile picture RRLHF's profile picture TIRData's profile picture feedbackagent's profile picture myselfrew's profile picture selfcorrexp's profile picture selfcorrexp2's profile picture mytestdpo's profile picture tmpmodelsave's profile picture qwselfcorr's profile picture dsrtrain's profile picture dsrselfcorr's profile picture ptllama's profile picture raftstudy's profile picture Reinforce's profile picture