A collection of models and dataset from the paper "The Hallucination Tax of Reinforcement Finetuning".