Gated Slot Attention for Efficient Linear-Time Sequence Modeling Paper • 2409.07146 • Published Sep 11, 2024 • 21
Parallelizing Linear Transformers with the Delta Rule over Sequence Length Paper • 2406.06484 • Published Jun 10, 2024 • 4
Non-autoregressive Text Editing with Copy-aware Latent Alignments Paper • 2310.07821 • Published Oct 11, 2023
Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments Paper • 2110.06865 • Published Oct 13, 2021
Efficient Second-Order TreeCRF for Neural Dependency Parsing Paper • 2005.00975 • Published May 3, 2020