Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
16
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
Link model to paper
#1
by
nielsr
HF staff
- opened
Sep 21
base:
refs/heads/main
←
from:
refs/pr/1
Discussion
Files changed
+1
-0
Files changed (1)
hide
show
README.md
+1
-0
README.md
ADDED
Viewed
@@ -0,0 +1 @@
1
+
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).