SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published 1 day ago β’ 50
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 9 days ago β’ 40
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper β’ 2505.09343 β’ Published 21 days ago β’ 62
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others β’ 20 days ago β’ 32
RobustDexGrasp: Robust Dexterous Grasping of General Objects from Single-view Perception Paper β’ 2504.05287 β’ Published Apr 7 β’ 6
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 147
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper β’ 2504.11651 β’ Published Apr 15 β’ 28
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 267
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Paper β’ 2201.11903 β’ Published Jan 28, 2022 β’ 13
view article Article An Introduction to AI Model Optimization Techniques By PrunaAI and 1 other β’ Apr 18 β’ 28
Universal Language Model Fine-tuning for Text Classification Paper β’ 1801.06146 β’ Published Jan 18, 2018 β’ 7
Sparse Autoencoders Find Highly Interpretable Features in Language Models Paper β’ 2309.08600 β’ Published Sep 15, 2023 β’ 15
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 185
view article Article β€οΈ a love letter to the Open AI inference client By burtenshaw β’ Feb 28 β’ 9
view article Article Remote VAEs for decoding with HF endpoints π€ By hlky and 1 other β’ Feb 24 β’ 39
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16 β’ 159