lime-nlp 's Collections

Synthetic Unanswerable Math (SUM)

A collection of models and dataset from the paper "The Hallucination Tax of Reinforcement Finetuning".