Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook
Maxime Labonne PRO
mlabonne
AI & ML interests
Post-training, model editing, quantization
Recent Activity
updated
a model
about 13 hours ago
LiquidAI/LFM2-350M
updated
a model
about 13 hours ago
LiquidAI/LFM2-700M
updated
a model
about 13 hours ago
LiquidAI/LFM2-1.2B
Organizations
👿 Daredevil-8B
Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.
-
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation • 8B • Updated • 15.2k • • 220 -
mlabonne/Daredevil-8B-abliterated
Text Generation • 8B • Updated • 11.5k • • 50 -
mlabonne/Daredevil-8B
Text Generation • 8B • Updated • 116 • 41 -
mlabonne/NeuralLlama-3-8B-Instruct-abliterated
Text Generation • 8B • Updated • 2.71k • • 9
🔮 Mixture of Experts
MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🐶 Beagle
Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🥼 DrMistral
Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.
🦙 Llama 2 Guanaco
Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.
✂️ Abliteration
Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 3.99k • • 177 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 17.8k • 117 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 2.19k • 6 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 3.17k • 8
👑 Monarch
Family of 7B models that combine excellent reasoning and conversational abilities.
-
mlabonne/AlphaMonarch-7B
Text Generation • 7B • Updated • 12.9k • • 148 -
Running on Zero2727
AlphaMonarch-7B
👑Generate text based on user messages and a chat history
-
mlabonne/NeuralMonarch-7B
Text Generation • 7B • Updated • 29.7k • • 12 -
Running66
NeuralMonarch 7B GGUF Chat
👑Chat with NeuralMonarch-7B
🔀 Phixtral
The first Mixture of Experts with phi-2 models.
🧠 NeuralHermes-2.5
Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B
💻 CodeLlama
Llama and CodeLlama models trained to improve the performance in terms of code generation.
📙 LLM Engineer's Handbook
Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook
✂️ Abliteration
Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 3.99k • • 177 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 17.8k • 117 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 2.19k • 6 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 3.17k • 8
👿 Daredevil-8B
Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.
-
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation • 8B • Updated • 15.2k • • 220 -
mlabonne/Daredevil-8B-abliterated
Text Generation • 8B • Updated • 11.5k • • 50 -
mlabonne/Daredevil-8B
Text Generation • 8B • Updated • 116 • 41 -
mlabonne/NeuralLlama-3-8B-Instruct-abliterated
Text Generation • 8B • Updated • 2.71k • • 9
👑 Monarch
Family of 7B models that combine excellent reasoning and conversational abilities.
-
mlabonne/AlphaMonarch-7B
Text Generation • 7B • Updated • 12.9k • • 148 -
Running on Zero2727
AlphaMonarch-7B
👑Generate text based on user messages and a chat history
-
mlabonne/NeuralMonarch-7B
Text Generation • 7B • Updated • 29.7k • • 12 -
Running66
NeuralMonarch 7B GGUF Chat
👑Chat with NeuralMonarch-7B
🔮 Mixture of Experts
MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🔀 Phixtral
The first Mixture of Experts with phi-2 models.
🐶 Beagle
Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🧠 NeuralHermes-2.5
Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B
🥼 DrMistral
Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.
💻 CodeLlama
Llama and CodeLlama models trained to improve the performance in terms of code generation.
🦙 Llama 2 Guanaco
Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.