Blog, Articles, and discussions

How to Build an MCP Server with Gradio

By April 30, 2025 • 73

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

7 days ago

• 47

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

3 days ago

• 18

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 127

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

5 days ago

• 6

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 243

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 125

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 14

PipelineRL

and 3 others •

10 days ago

• 17

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

7 days ago

• 4

Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline

•

7 days ago

• 4

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

Active Learning with AutoNLP and Prodigy

By December 23, 2021 • 1

Gradio joins Hugging Face!

By December 21, 2021 • 6

Perceiver IO: a scalable, fully-attentional model that works on any modality

By December 15, 2021 • 7

Training CodeParrot 🦜 from Scratch

By December 8, 2021 • 7

Introducing Snowball Fight ☃️, our First ML-Agents Environment

By December 2, 2021 • 3

Getting Started with Hugging Face Transformers for IPUs with Optimum

By November 30, 2021 guest

Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets

By November 29, 2021

Accelerating PyTorch distributed fine-tuning with Intel technologies

By November 19, 2021 • 1

Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers

By November 15, 2021 • 26

Scaling up BERT-like model Inference on modern CPU - Part 2

By November 4, 2021 • 1

Course Launch Community Event

By October 26, 2021 • 2

Large Language Models: A New Moore's Law?

By October 26, 2021 • 5

Train a Sentence Embedding Model with 1B Training Pairs

By October 25, 2021 guest • 1

The Age of Machine Learning As Code Has Arrived

By October 20, 2021 • 1

Community Articles

I trained a Language Model to schedule events with GRPO!

•

7 days ago

• 47

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

3 days ago

• 18

Creating your custom Ghibli Text-to-Image model

and 3 others •

5 days ago

• 14

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 542

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 229

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

8 days ago

• 12

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 127

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

5 days ago

• 6

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 243

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 125

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 14

PipelineRL

and 3 others •

10 days ago

• 17

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

7 days ago

• 4

Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline

•

7 days ago

• 4

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

View all