Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages Paper • 2501.06346 • Published Jan 10 • 1
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published 5 days ago • 65
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 • 61
Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models Paper • 2502.12892 • Published Feb 18 • 1
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 76
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models Paper • 2502.15886 • Published Feb 21 • 1
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 25
Building Bridges, Not Walls -- Advancing Interpretability by Unifying Feature, Data, and Model Component Attribution Paper • 2501.18887 • Published Jan 31 • 1
Sparse Autoencoders Trained on the Same Data Learn Different Features Paper • 2501.16615 • Published Jan 28 • 1
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders Paper • 2501.17148 • Published Jan 28 • 1