Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published about 1 month ago • 96
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • Dec 30, 2024 • 33
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper • 2402.14207 • Published Feb 22, 2024 • 8
view article Article 🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces By ariG23498 • 19 days ago • 5
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 116
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 19 days ago • 53
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published 22 days ago • 16
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 12 items • Updated 1 day ago • 76
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 23 days ago • 23
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 21 days ago • 35