Predicting the Order of Upcoming Tokens Improves Language Modeling Paper โข 2508.19228 โข Published 18 days ago โข 21