帖子、文章和讨论

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Introducing Cogito v2.1

Projected Abliteration

AI Model Optimization More Flexible Than Ever

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Uncensor any LLM with abliteration

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Norm-Preserving Biprojected Abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Granite 4.0 Nano: Just how small can you go?

Why Did MiniMax M2 End Up as a Full Attention Model?

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

To Think or Not to Think: A Router for Hybrid LLMs

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Visualizing How VLMs Work

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

multimodalon-devicellm

现在 Llama 具备视觉能力并可以在你的设备上运行 - 欢迎使用 Llama 3.2

+3

2024年9月25日

videodatasetsmultimodal

揭秘 FineVideo 数据集构建的背后的秘密

+2

2024年9月23日

researchcommunity

Hugging Face 论文平台 Daily Papers 功能全解析

2024年9月23日

inteloptimumquantization

使用 Optimum-Intel 和 OpenVINO GenAI 优化和部署模型

+3

2024年9月20日

nlpresearchcommunity

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+2

2024年9月18日

datasetssqlduckdb

为数据集而生的 SQL 控制台

2024年9月17日

Accelerate 1.0.0

2024年9月13日

paddingpackingFlash Attention 2

通过打包 Flash Attention 来提升 Hugging Face 训练效率

+2

2024年8月21日

long-contextinfini-attentionmemory-compression

一次失败的实验——无限注意力，我们为什么坚持实验

2024年8月14日

guidecommunityggml

ggml 简介

2024年8月13日

nlpcommunityresearch

Falcon Mamba: 首个高效的无注意力机制 7B 模型

+2

2024年8月12日

LLMnlpcommunity

对 LLM 工具使用进行统一

2024年8月12日

announcemententerprisehub

XetHub 加入 Hugging Face!

2024年8月8日

nlpcommunityresearch

Google 最新发布： Gemma 2 2B, ShieldGemma 和 Gemma Scope

2024年7月31日

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Introducing Cogito v2.1

Projected Abliteration

AI Model Optimization More Flexible Than Ever

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Uncensor any LLM with abliteration

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Norm-Preserving Biprojected Abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Granite 4.0 Nano: Just how small can you go?

Why Did MiniMax M2 End Up as a Full Attention Model?

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

To Think or Not to Think: A Router for Hybrid LLMs

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Visualizing How VLMs Work

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

View all articles