-
Attention Is All You Need
Paper • 1706.03762 • Published • 85 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 16 -
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Paper • 2305.13245 • Published • 6 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 245
Eli Chen
elichen3051
AI & ML interests
Learning Algorithm, Reinforcement Learning, Data Synthesize, Benchmarking
Recent Activity
upvoted
an
article
9 days ago
SmolLM3: smol, multilingual, long-context reasoner
upvoted
a
paper
17 days ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
liked
a model
about 2 months ago
openai/gpt-oss-120b