Merge Crew

Activity Feed Request to join this org

AI & ML interests

Merging models

Recent Activity

KennethEnevoldsen authored a paper 16 days ago

Dynaword: From One-shot to Continuously Developed Datasets

KennethEnevoldsen authored a paper 4 months ago

MIEB: Massive Image Embedding Benchmark

birgermoell authored a paper 5 months ago

Medical Reasoning in LLMs: An In-Depth Analysis of DeepSeek R1

View all activity

mlabonne

posted an update 9 days ago

Post

4419

Liquid just released two 450M and 1.6B param VLMs!

They're super fast and leverage SigLIP2 NaFlex encoders to handle native resolutions without distortion. It's ideal for on-device deployment in constrained environments like phones.

It's available today on Hugging Face, with an inference and a fine-tuning Colab notebooks.

LiquidAI/LFM2-VL-450M
LiquidAI/LFM2-VL-1.6B

KennethEnevoldsen

authored a paper 16 days ago

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published 18 days ago • 13

mlabonne

posted an update about 1 month ago

Post

5436

LiquidAI open-sources a new generation of edge LLMs! 🥳

Based on a new hybrid architecture, these 350M, 700M, and 1.2B models are both fast and performant, ideal for on-device deployment.

I recommend fine-tuning them to power your next edge application. We already provide Colab notebooks to guide you. More to come soon!

📝 Blog post: https://www.liquid.ai/blog/liquid-foundation-models-v2-our-second-series-of-generative-ai-models
🤗 Models: LiquidAI/lfm2-686d721927015b2ad73eaa38

1 reply

KennethEnevoldsen

authored a paper 4 months ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14 • 18

birgermoell

authored 3 papers 5 months ago

Medical Reasoning in LLMs: An In-Depth Analysis of DeepSeek R1

Paper • 2504.00016 • Published Mar 27 • 1

The order in speech disorder: a scoping review of state of the art machine learning methods for clinical speech classification

Paper • 2503.04802 • Published Mar 3

Artificial Humans

Paper • 2503.16502 • Published Mar 12

mlabonne

posted an update 5 months ago

Post

17252

✂️ AutoAbliteration

I made a Colab notebook to automatically abliterate models.

It's quite general, so you can do interesting stuff like blocking a given language in the model outputs.

💻 Colab: https://colab.research.google.com/drive/1RmLv-pCMBBsQGXQIM8yF-OdCNyoylUR1?usp=sharing

1 reply

mlabonne

posted an update 5 months ago

Post

6409

✂️ Gemma 3 Abliterated

I noticed that Gemma 3 was much more resilient to refusal removal than other models like Qwen 2.5.

I experimented with different recipes and improved the abliteration technique I wrote about last year.

It's still experimental but the refusal rate is super low in my tests. Enjoy!

mlabonne/gemma-3-4b-it-abliterated
mlabonne/gemma-3-12b-it-abliterated
mlabonne/gemma-3-27b-it-abliterated

4 replies

birgermoell

authored a paper 6 months ago

Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology

Paper • 2503.01266 • Published Mar 3

KennethEnevoldsen

authored 2 papers 6 months ago

TextDescriptives: A Python package for calculating a large variety of metrics from text

Paper • 2301.02057 • Published Jan 5, 2023

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38

birgermoell

authored 2 papers 6 months ago

Language Complexity Measurement as a Noisy Zero-Shot Proxy for Evaluating LLM Performance

Paper • 2502.11578 • Published Feb 17

Large Language Models and Mathematical Reasoning Failures

Paper • 2502.11574 • Published Feb 17 • 3

mlabonne

posted an update 7 months ago

Post

6820

🆕 LLM Course 2025 edition!

I updated the LLM Scientist roadmap and added a ton of new information and references. It covers training, datasets, evaluation, quantization, and new trends like test-time compute scaling.

The LLM Course has been incredibly popular (41.3k stars!) and I've been touched to receive many, many messages about how it helped people in their careers.

I know how difficult this stuff can be, so I'm super proud of the impact it had. I want to keep updating it in 2025, especially with the LLM Engineer roadmap.

Thanks everyone, hope you'll enjoy it!

💻 LLM Course: https://huggingface.co/blog/mlabonne/llm-course