A collection of recent papers on NLG evaluations, very applicable to components of LLM systems.
-
Can Large Language Models Be an Alternative to Human Evaluations?
Paper โข 2305.01937 โข Published โข 2 -
Decontextualization: Making Sentences Stand-Alone
Paper โข 2102.05169 โข Published -
RARR: Researching and Revising What Language Models Say, Using Language Models
Paper โข 2210.08726 โข Published โข 1 -
SummEval: Re-evaluating Summarization Evaluation
Paper โข 2007.12626 โข Published