Shihan Dou's picture

11 7 6

Shihan Dou

Ablustrund

·

Ablustrund

AI & ML interests

Natural Language Processing, Large Language Models

Recent Activity

authored a paper 5 days ago

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

authored a paper 5 days ago

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

authored a paper 5 days ago

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

View all activity

Organizations

liked 4 datasets over 1 year ago

vikp/evol_instruct_v2_filtered_109k

Viewer • Updated Aug 29, 2023 • 110k • 11 • 3

mrqa-workshop/mrqa

Viewer • Updated Jan 24, 2024 • 585k • 480 • 24

lucadiliello/naturalquestionsshortqa

Viewer • Updated Jun 6, 2023 • 117k • 136 • 3

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.84k • 374

liked 2 models about 2 years ago

fnlp/moss-rlhf-sft-model-7B-en

Updated Jul 14, 2023 • 2

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 1 • 23