The Common Pile v0.1
By
and 2 others
•
•
33Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H
By
and 1 other
•
•
62Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes
By
and 2 others
•
•
19Announcing the Common Pile and Comma v0.1
By
•
•
13xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy
By
•
•
12What if Your AI Conversations Become Public?
By
•
•
10*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings
By
and 1 other
•
•
23Uncensor any LLM with abliteration
By
•
•
608Code a simple RAG from scratch
By
•
•
91DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
151Daily Robotics June #1 - SmolVLA discovery and thoughts
By
•
•
9MCP is at a Tipping Point: Here's Why You Should Care
By
•
•
6Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions
By
and 1 other
•
•
6KV Caching Explained: Optimizing Transformer Inference Efficiency
By
•
•
75Tensors
By
•
•
5The Large Language Model Course
By
•
•
187Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
By
•
•
41Small Language Models (SLM): A Comprehensive Overview
By
•
•
34🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?
By
•
•
284Google Opensources Deep Research Agents using Gemini 2.5 & LangGraph, Let's Take a Look
By
•
•
6