Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles.
-
Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation
Paper β’ 2208.05309 β’ Published β’ 1 -
LLM-Eval: Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with Large Language Models
Paper β’ 2305.13711 β’ Published β’ 2 -
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation
Paper β’ 2302.09664 β’ Published β’ 3 -
BARTScore: Evaluating Generated Text as Text Generation
Paper β’ 2106.11520 β’ Published β’ 2