Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks Paper • 2506.21182 • Published Jun 26 • 2
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA Paper • 2505.21115 • Published May 27 • 139
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts Paper • 2506.05229 • Published Jun 5 • 37
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images Paper • 2505.07704 • Published May 12 • 29
Knowledge Distillation of Russian Language Models with Reduction of Vocabulary Paper • 2205.02340 • Published May 4, 2022
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published Feb 20 • 91
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Paper • 2501.12835 • Published Jan 22 • 4
LLM-Independent Adaptive RAG: Let the Question Speak for Itself Paper • 2505.04253 • Published May 7 • 13
Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts Paper • 2503.15948 • Published Mar 20
Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task Paper • 2406.14213 • Published Jun 20, 2024 • 21
Uncertainty Guided Global Memory Improves Multi-Hop Question Answering Paper • 2311.18151 • Published Nov 29, 2023
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 53