Blog, Articles, and discussions

AI Policy: 🤗 Response to the White House AI Action Plan RFI

By March 19, 2025 • 26

Community Articles

view all

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

and 1 other •

2 days ago

• 59

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

and 1 other •

4 days ago

• 22

xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy

•

1 day ago

• 12

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 282

Interactive Tools for machine learning, deep learning, and math

•

10 days ago

• 40

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

4 days ago

• 9

Daily Robotics June #1 - SmolVLA discovery and thoughts

•

2 days ago

• 9

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

•

9 days ago

• 19

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 148

🌙 Introducing Moon: Storytelling Generator Model

and 1 other •

7 days ago

• 6

Google Opensources Deep Research Agents using Gemini 2.5 & LangGraph, Let's Take a Look

•

2 days ago

• 6

Common AI Model Formats

•

Feb 27

• 42

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 328

Code a simple RAG from scratch

•

Oct 29, 2024

• 85

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 66

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 72

PipelineRL

and 3 others •

Apr 25

• 26

Ethics and Society Newsletter #2: Let's talk about bias!

By December 15, 2022 • 1

Evaluating Language Model Bias with 🤗 Evaluate

By October 24, 2022 • 5

Ethics and Society Newsletter #1

By September 22, 2022

AI Policy @🤗: Comments on U.S. National AI Research Resource Interim Report

By August 1, 2022

Community Articles

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

and 1 other •

2 days ago

• 59

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

and 1 other •

4 days ago

• 22

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

and 2 others •

1 day ago

• 16

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 604

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

and 2 others •

3 days ago

• 12

xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy

•

1 day ago

• 12

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 282

Interactive Tools for machine learning, deep learning, and math

•

10 days ago

• 40

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

4 days ago

• 9

Daily Robotics June #1 - SmolVLA discovery and thoughts

•

2 days ago

• 9

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

•

9 days ago

• 19

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 148

🌙 Introducing Moon: Storytelling Generator Model

and 1 other •

7 days ago

• 6

Google Opensources Deep Research Agents using Gemini 2.5 & LangGraph, Let's Take a Look

•

2 days ago

• 6

Common AI Model Formats

•

Feb 27

• 42

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 328

Code a simple RAG from scratch

•

Oct 29, 2024

• 85

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 66

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 72

PipelineRL

and 3 others •

Apr 25

• 26

View all

Blog, Articles, and discussions

AI Policy: 🤗 Response to the White House AI Action Plan RFI

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Uncensor any LLM with abliteration

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Interactive Tools for machine learning, deep learning, and math

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

Daily Robotics June #1 - SmolVLA discovery and thoughts

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

🌙 Introducing **Moon**: Storytelling Generator Model

Google Opensources Deep Research Agents using Gemini 2.5 & LangGraph, Let's Take a Look

Common AI Model Formats

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Code a simple RAG from scratch

Decoding Strategies in Large Language Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

PipelineRL

Ethics and Society Newsletter #2: Let's talk about bias!

Evaluating Language Model Bias with 🤗 Evaluate

Ethics and Society Newsletter #1

AI Policy @🤗: Comments on U.S. National AI Research Resource Interim Report

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Uncensor any LLM with abliteration

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Interactive Tools for machine learning, deep learning, and math

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

Daily Robotics June #1 - SmolVLA discovery and thoughts

Bigger isn't always better: how to choose the most efficient model for context-specific tasks 🌱🧑🏼‍💻

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

🌙 Introducing **Moon**: Storytelling Generator Model

Google Opensources Deep Research Agents using Gemini 2.5 & LangGraph, Let's Take a Look

Common AI Model Formats

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Code a simple RAG from scratch

Decoding Strategies in Large Language Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

PipelineRL

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

🌙 Introducing Moon: Storytelling Generator Model

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

🌙 Introducing Moon: Storytelling Generator Model