leegao19 (Lee Gao) – Community Activity

commented 2 papers 12 months ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7, 2025 • 14 •

8

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7, 2025 • 14 •

8

commented 6 papers about 1 year ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41 •

26

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41 •

26

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Paper • 2412.07171 • Published Dec 10, 2024 • 1 •

1

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Paper • 2501.00712 • Published Jan 1, 2025 • 6 •

4

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 32 •

5

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41 •

26

commented a paper over 1 year ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 96 •

9

commented 11 papers almost 2 years ago

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Paper • 2403.00071 • Published Feb 29, 2024 • 24 •

2

Lee Gao

AI & ML interests

Organizations

Entropy-Guided Attention for Private LLMs

Entropy-Guided Attention for Private LLMs

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Deliberation in Latent Space via Differentiable Cache Augmentation

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Emu3: Next-Token Prediction is All You Need

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Simple linear attention language models balance the recall-throughput tradeoff

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Simple linear attention language models balance the recall-throughput tradeoff

Simple linear attention language models balance the recall-throughput tradeoff

Simple linear attention language models balance the recall-throughput tradeoff

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Yi: Open Foundation Models by 01.AI

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Lee Gao

AI & ML interests

Organizations

leegao19's activity