LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 2
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Paper • 2312.00878 • Published Dec 1, 2023 • 2