AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation
Paper
•
2504.07532
•
Published
https://github.com/salesforce/creativity_eval/tree/main/WritingRewards/training/Llama/data