Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
7
6
Shihan Dou
Ablustrund
Follow
21world's profile picture
TaoJi's profile picture
2 followers
·
3 following
Ablustrund
AI & ML interests
Natural Language Processing, Large Language Models
Recent Activity
authored
a paper
5 days ago
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
authored
a paper
5 days ago
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
authored
a paper
5 days ago
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
View all activity
Organizations
Ablustrund
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
4 datasets
over 1 year ago
vikp/evol_instruct_v2_filtered_109k
Viewer
•
Updated
Aug 29, 2023
•
110k
•
11
•
3
mrqa-workshop/mrqa
Viewer
•
Updated
Jan 24, 2024
•
585k
•
480
•
24
lucadiliello/naturalquestionsshortqa
Viewer
•
Updated
Jun 6, 2023
•
117k
•
136
•
3
openbmb/UltraFeedback
Viewer
•
Updated
Dec 29, 2023
•
64k
•
1.84k
•
374
liked
2 models
about 2 years ago
fnlp/moss-rlhf-sft-model-7B-en
Updated
Jul 14, 2023
•
2
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
1
•
23