Synthetic Unanswerable Math (SUM)

lime-nlp 's Collections

updated Jun 8

A collection of models and dataset from the paper "The Hallucination Tax of Reinforcement Finetuning".