Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Paper • 2507.14137 • Published 5 days ago • 22
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published 6 days ago • 170
view article Article Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure By jcudit • 15 days ago • 8
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 21 days ago • 55
Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs Paper • 2507.02778 • Published 20 days ago • 9
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 225
Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models Paper • 2506.06751 • Published Jun 7 • 72
Pre-trained Large Language Models Learn Hidden Markov Models In-context Paper • 2506.07298 • Published Jun 8 • 26
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 37 items • Updated about 1 hour ago • 47
view article Article Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face By Nutanix and 1 other • May 19 • 18
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 62
Falcon Edge series Collection A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated about 1 hour ago • 22
Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published May 12 • 20
Generating Physically Stable and Buildable LEGO Designs from Text Paper • 2505.05469 • Published May 8 • 28
LLMs for Engineering: Teaching Models to Design High Powered Rockets Paper • 2504.19394 • Published Apr 27 • 14
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published May 1 • 26