LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published 27 days ago • 6
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before By isaacchung and 2 others • Apr 24 • 14
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models Paper • 2502.03032 • Published Feb 5 • 61
Mechanistic Permutability: Match Features Across Layers Paper • 2410.07656 • Published Oct 10, 2024 • 20
Extending the Massive Text Embedding Benchmark to French Paper • 2405.20468 • Published May 30, 2024 • 2