2 3 2

Albert Gu

albertgu

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

A failed experiment: Infini-Attention, and why we should keep trying?

published an article 7 months ago

Bamba: Inference-Efficient Hybrid Mamba2 Model

liked a Space 11 months ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

upvoted an article about 2 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 67

published an article 7 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

and 28 others •

Dec 18, 2024

• 57

liked a Space 11 months ago

1.01k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

upvoted a paper about 1 year ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 68

authored a paper about 1 year ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 68

liked a model about 1 year ago

ai21labs/Jamba-v0.1

Text Generation • 52B • Updated Sep 11, 2024 • 10.1k • 1.18k

authored 2 papers over 1 year ago

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling

Paper • 2403.03234 • Published Mar 5, 2024 • 15

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 57

upvoted a paper over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

authored a paper over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

updated a model over 2 years ago

krandiash/sashimi-release

Updated Mar 31, 2023 • 6

Albert Gu

AI & ML interests

Recent Activity

Organizations

albertgu's activity

A failed experiment: Infini-Attention, and why we should keep trying?

Bamba: Inference-Efficient Hybrid Mamba2 Model

FineWeb: decanting the web for the finest text data at scale