Cross-lingual Transfer of Reward Models in Multilingual Alignment
Paper
•
2410.18027
•
Published
This is the collection of synthetic preference data and trained reward models in "Cross-lingual Transfer of Reward Models in Multilingual Alignment".