Maxime Labonne PRO

mlabonne

nara-simba's profile picture

iandennismiller's profile picture

nandhakumarg's profile picture

https://mlabonne.github.io/blog

maximelabonne
mlabonne
maxime-labonne

AI & ML interests

Post-training, model editing, quantization

Recent Activity

repliedto their post 1 day ago

Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs. > Added many new datasets > New "thinking" column > Refreshed recommended tools. Thanks to everyone who told me they used it for their research at ICLR, you motivated this update!

posted an update 1 day ago

liked a dataset 4 days ago

aladinDJ/ultramix-DPO-annotated

View all activity

Organizations

mlabonne 's collections 12

💧 LFM2 & LFM2.5

Models made at Liquid AI

LiquidAI/LFM2.5-350M

Text Generation • 0.4B • Updated 27 days ago • 58.3k • 285
LiquidAI/LFM2-24B-A2B

Text Generation • 24B • Updated 30 days ago • 32.1k • 315
LiquidAI/LFM2.5-1.2B-Thinking

Text Generation • 1B • Updated 30 days ago • 28.9k • 335
LiquidAI/LFM2.5-1.2B-Instruct

Text Generation • 1B • Updated 30 days ago • 425k • 574

✂️ Abliteration

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration

mlabonne/gemma-3-27b-it-abliterated

Image-Text-to-Text • Updated Mar 21, 2025 • 350k • • 317
mlabonne/gemma-3-27b-it-abliterated-GGUF

Image-Text-to-Text • 27B • Updated Apr 1, 2025 • 14.6k • 247
mlabonne/gemma-3-12b-it-abliterated-v2

Image-Text-to-Text • 12B • Updated May 29, 2025 • 622 • 20
mlabonne/gemma-3-4b-it-abliterated-v2

Image-Text-to-Text • 4B • Updated May 29, 2025 • 464 • 11

👑 Monarch

Family of 7B models that combine excellent reasoning and conversational abilities.

mlabonne/AlphaMonarch-7B

Text Generation • 7B • Updated Mar 28, 2024 • 12.9k • • 148
Sleeping

Agents

27

AlphaMonarch-7B

👑

27

Generate text responses to user queries
mlabonne/NeuralMonarch-7B

Text Generation • 7B • Updated Mar 4, 2024 • 29.1k • • 12
Runtime error

6

NeuralMonarch 7B GGUF Chat

👑

6

Chat with NeuralMonarch-7B

🔀 Phixtral

The first Mixture of Experts with phi-2 models.

mlabonne/phixtral-2x2_8

Text Generation • 4B • Updated Jan 14, 2024 • 84 • 149
Runtime error

Agents

77

Phixtral Chat

🔀

77
mlabonne/phixtral-4x2_8

Text Generation • Updated Jan 15, 2024 • 136 • 209

🧠 NeuralHermes-2.5

Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B

mlabonne/NeuralHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Apr 8, 2024 • 149 • 153
Runtime error

4

NeuralHermes 2.5 Mistral 7B GGUF Chat

🧠

4
mlabonne/chatml_dpo_pairs

Viewer • Updated Apr 11, 2024 • 12.9k • 24 • 55
TheBloke/NeuralHermes-2.5-Mistral-7B-GGUF

7B • Updated Nov 30, 2023 • 598 • 52

💻 CodeLlama

Llama and CodeLlama models trained to improve the performance in terms of code generation.

mlabonne/PyLlama-7b

Text Generation • 7B • Updated Feb 1, 2025 • 12 • 8
mlabonne/EvolCodeLlama-7b

Text Generation • 7B • Updated May 27, 2025 • 17 • 6
mlabonne/codellama-2-7b

Updated Dec 16, 2024 • 14 • 5
mlabonne/EvolCodeLlama-7b-GGUF

7B • Updated Aug 28, 2023 • 28 • 2

📙 LLM Engineer's Handbook

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook

mlabonne/TwinLlama-3.1-8B

Text Generation • 8B • Updated Oct 6, 2024 • 25 • 30
mlabonne/TwinLlama-3.1-8B-GGUF

8B • Updated Oct 6, 2024 • 34 • 4
mlabonne/TwinLlama-3.1-8B-DPO

Text Generation • 8B • Updated Oct 6, 2024 • 22 • 22
mlabonne/TwinLlama-3.1-8B-DPO-GGUF

8B • Updated Oct 6, 2024 • 10 • 2

👿 Daredevil-8B

Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.

mlabonne/NeuralDaredevil-8B-abliterated

Text Generation • 8B • Updated Jan 23 • 15.6k • • 268
mlabonne/Daredevil-8B-abliterated

Text Generation • 8B • Updated Jan 22 • 8.14k • • 59
mlabonne/Daredevil-8B

Text Generation • 8B • Updated Jan 7, 2025 • 527 • • 43
mlabonne/NeuralLlama-3-8B-Instruct-abliterated

Text Generation • 8B • Updated May 27, 2024 • 103 • • 10

🔮 Mixture of Experts

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y

mlabonne/Beyonder-4x7B-v3

Text Generation • 24B • Updated Mar 28, 2024 • 7.86k • 60
Runtime error

Agents

7

Beyonder 4x7B V3

🔮

7

Generate text responses to user prompts
mlabonne/Beyonder-4x7B-v2

Text Generation • 24B • Updated Mar 4, 2024 • 284 • 128
Runtime error

17

Beyonder 4x7B V2 GGUF Chat

🔮

17

Chat with a helpful assistant

🐶 Beagle

Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y

mlabonne/NeuralBeagle14-7B

Text Generation • 7B • Updated Mar 4, 2024 • 183 • 157
Paused

10

NeuralBeagle14 7B GGUF Chat

🐶

10
mlabonne/Beagle14-7B

Text Generation • 7B • Updated Mar 4, 2024 • 467 • 15
mlabonne/NeuralDaredevil-7B

Text Generation • 7B • Updated Jul 14, 2024 • 25 • • 41

🥼 DrMistral

Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.

mlabonne/drmistral-7b

Text Generation • 7B • Updated Nov 25, 2023 • 8 • 3
mlabonne/drllama-7b

Text Generation • 7B • Updated Apr 4, 2025 • 15 • 2
mlabonne/bactrian-fr

Viewer • Updated Nov 24, 2023 • 50k • 20 • 2
mlabonne/medical-cases-fr

Viewer • Updated Sep 9, 2023 • 8.5k • 40 • 7

🦙 Llama 2 Guanaco

Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.

mlabonne/llama-2-7b-guanaco

Text Generation • 7B • Updated Dec 29, 2024 • 21 • 17
mlabonne/llama-2-13b-guanaco

Text Generation • Updated Jul 30, 2023 • 15 • 3
mlabonne/llama-2-7b-miniguanaco

Text Generation • 7B • Updated Nov 16, 2023 • 17 • 7
mlabonne/llama-2-13b-miniguanaco

Text Generation • Updated Jul 30, 2023 • 13 • 2

💧 LFM2 & LFM2.5

Models made at Liquid AI

LiquidAI/LFM2.5-350M

Text Generation • 0.4B • Updated 27 days ago • 58.3k • 285
LiquidAI/LFM2-24B-A2B

Text Generation • 24B • Updated 30 days ago • 32.1k • 315
LiquidAI/LFM2.5-1.2B-Thinking

Text Generation • 1B • Updated 30 days ago • 28.9k • 335
LiquidAI/LFM2.5-1.2B-Instruct

Text Generation • 1B • Updated 30 days ago • 425k • 574

📙 LLM Engineer's Handbook

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook

mlabonne/TwinLlama-3.1-8B

Text Generation • 8B • Updated Oct 6, 2024 • 25 • 30
mlabonne/TwinLlama-3.1-8B-GGUF

8B • Updated Oct 6, 2024 • 34 • 4
mlabonne/TwinLlama-3.1-8B-DPO

Text Generation • 8B • Updated Oct 6, 2024 • 22 • 22
mlabonne/TwinLlama-3.1-8B-DPO-GGUF

8B • Updated Oct 6, 2024 • 10 • 2

✂️ Abliteration

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration

mlabonne/gemma-3-27b-it-abliterated

Image-Text-to-Text • Updated Mar 21, 2025 • 350k • • 317
mlabonne/gemma-3-27b-it-abliterated-GGUF

Image-Text-to-Text • 27B • Updated Apr 1, 2025 • 14.6k • 247
mlabonne/gemma-3-12b-it-abliterated-v2

Image-Text-to-Text • 12B • Updated May 29, 2025 • 622 • 20
mlabonne/gemma-3-4b-it-abliterated-v2

Image-Text-to-Text • 4B • Updated May 29, 2025 • 464 • 11

👿 Daredevil-8B

Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.

mlabonne/NeuralDaredevil-8B-abliterated

Text Generation • 8B • Updated Jan 23 • 15.6k • • 268
mlabonne/Daredevil-8B-abliterated

Text Generation • 8B • Updated Jan 22 • 8.14k • • 59
mlabonne/Daredevil-8B

Text Generation • 8B • Updated Jan 7, 2025 • 527 • • 43
mlabonne/NeuralLlama-3-8B-Instruct-abliterated

Text Generation • 8B • Updated May 27, 2024 • 103 • • 10

👑 Monarch

Family of 7B models that combine excellent reasoning and conversational abilities.

mlabonne/AlphaMonarch-7B

Text Generation • 7B • Updated Mar 28, 2024 • 12.9k • • 148
Sleeping

Agents

27

AlphaMonarch-7B

👑

27

Generate text responses to user queries
mlabonne/NeuralMonarch-7B

Text Generation • 7B • Updated Mar 4, 2024 • 29.1k • • 12
Runtime error

6

NeuralMonarch 7B GGUF Chat

👑

6

Chat with NeuralMonarch-7B

🔮 Mixture of Experts

MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y

mlabonne/Beyonder-4x7B-v3

Text Generation • 24B • Updated Mar 28, 2024 • 7.86k • 60
Runtime error

Agents

7

Beyonder 4x7B V3

🔮

7

Generate text responses to user prompts
mlabonne/Beyonder-4x7B-v2

Text Generation • 24B • Updated Mar 4, 2024 • 284 • 128
Runtime error

17

Beyonder 4x7B V2 GGUF Chat

🔮

17

Chat with a helpful assistant

🔀 Phixtral

The first Mixture of Experts with phi-2 models.

mlabonne/phixtral-2x2_8

Text Generation • 4B • Updated Jan 14, 2024 • 84 • 149
Runtime error

Agents

77

Phixtral Chat

🔀

77
mlabonne/phixtral-4x2_8

Text Generation • Updated Jan 15, 2024 • 136 • 209

🐶 Beagle

Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y

mlabonne/NeuralBeagle14-7B

Text Generation • 7B • Updated Mar 4, 2024 • 183 • 157
Paused

10

NeuralBeagle14 7B GGUF Chat

🐶

10
mlabonne/Beagle14-7B

Text Generation • 7B • Updated Mar 4, 2024 • 467 • 15
mlabonne/NeuralDaredevil-7B

Text Generation • 7B • Updated Jul 14, 2024 • 25 • • 41

🧠 NeuralHermes-2.5

Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B

mlabonne/NeuralHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Apr 8, 2024 • 149 • 153
Runtime error

4

NeuralHermes 2.5 Mistral 7B GGUF Chat

🧠

4
mlabonne/chatml_dpo_pairs

Viewer • Updated Apr 11, 2024 • 12.9k • 24 • 55
TheBloke/NeuralHermes-2.5-Mistral-7B-GGUF

7B • Updated Nov 30, 2023 • 598 • 52

🥼 DrMistral

Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.

mlabonne/drmistral-7b

Text Generation • 7B • Updated Nov 25, 2023 • 8 • 3
mlabonne/drllama-7b

Text Generation • 7B • Updated Apr 4, 2025 • 15 • 2
mlabonne/bactrian-fr

Viewer • Updated Nov 24, 2023 • 50k • 20 • 2
mlabonne/medical-cases-fr

Viewer • Updated Sep 9, 2023 • 8.5k • 40 • 7

💻 CodeLlama

Llama and CodeLlama models trained to improve the performance in terms of code generation.

mlabonne/PyLlama-7b

Text Generation • 7B • Updated Feb 1, 2025 • 12 • 8
mlabonne/EvolCodeLlama-7b

Text Generation • 7B • Updated May 27, 2025 • 17 • 6
mlabonne/codellama-2-7b

Updated Dec 16, 2024 • 14 • 5
mlabonne/EvolCodeLlama-7b-GGUF

7B • Updated Aug 28, 2023 • 28 • 2

🦙 Llama 2 Guanaco

Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.

mlabonne/llama-2-7b-guanaco

Text Generation • 7B • Updated Dec 29, 2024 • 21 • 17
mlabonne/llama-2-13b-guanaco

Text Generation • Updated Jul 30, 2023 • 15 • 3
mlabonne/llama-2-7b-miniguanaco

Text Generation • 7B • Updated Nov 16, 2023 • 17 • 7
mlabonne/llama-2-13b-miniguanaco

Text Generation • Updated Jul 30, 2023 • 13 • 2

Maxime Labonne PRO

AI & ML interests

Recent Activity

Organizations

mlabonne 's collections 12

AlphaMonarch-7B

NeuralMonarch 7B GGUF Chat

Phixtral Chat

NeuralHermes 2.5 Mistral 7B GGUF Chat

Beyonder 4x7B V3

Beyonder 4x7B V2 GGUF Chat

NeuralBeagle14 7B GGUF Chat

AlphaMonarch-7B

NeuralMonarch 7B GGUF Chat

Beyonder 4x7B V3

Beyonder 4x7B V2 GGUF Chat

Phixtral Chat

NeuralBeagle14 7B GGUF Chat

NeuralHermes 2.5 Mistral 7B GGUF Chat