Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 17 days ago • 53
view article Article Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios By quotientai and 3 others • May 2 • 19
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems Paper • 2311.09476 • Published Nov 16, 2023 • 6
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • about 1 month ago • 35