Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper โข 2505.03335 โข Published May 6 โข 179 โข 9