rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 10 days ago • 230
Monolith: Real Time Recommendation System With Collisionless Embedding Table Paper • 2209.07663 • Published Sep 16, 2022 • 1
Human-Timescale Adaptation in an Open-Ended Task Space Paper • 2301.07608 • Published Jan 18, 2023 • 1
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 29 days ago • 38
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 30 days ago • 73
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 123
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 88
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
Transformers Can Navigate Mazes With Multi-Step Prediction Paper • 2412.05117 • Published Dec 6, 2024 • 5
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving Paper • 2407.00079 • Published Jun 24, 2024 • 5
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 57
Agent Skill Acquisition for Large Language Models via CycleQD Paper • 2410.14735 • Published Oct 16, 2024 • 2
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once Paper • 2405.12971 • Published May 21, 2024 • 2
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 132