Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
帖子、文章和讨论
New Article
分析和解读
社区动态
教程
开源协作
合作伙伴
科研相关
NLP
Audio
CV
RL
AI 伦理
扩散模型
游戏开发
Community Articles
view all
We’re open-sourcing our text-to-image model and the process behind it
9 days ago
•
67
Text-to-image Architectural Experiments
8 days ago
•
33
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
2 days ago
•
18
Introducing Cogito v2.1
1 day ago
•
16
Projected Abliteration
27 days ago
•
26
AI Model Optimization More Flexible Than Ever
4 days ago
•
12
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
16 days ago
•
49
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
6 days ago
•
11
Uncensor any LLM with abliteration
Jun 13, 2024
•
722
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
3 days ago
•
9
Norm-Preserving Biprojected Abliteration
14 days ago
•
14
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
175
Granite 4.0 Nano: Just how small can you go?
24 days ago
•
119
Why Did MiniMax M2 End Up as a Full Attention Model?
22 days ago
•
65
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
18 days ago
•
42
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
2 days ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
5 days ago
•
6
PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs
Jan 24
•
49
Visualizing How VLMs Work
Oct 7
•
45
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
Feb 11
•
87
multimodal
on-device
llm
现在 Llama 具备视觉能力并可以在你的设备上运行 - 欢迎使用 Llama 3.2
+3
2024年9月25日
video
datasets
multimodal
揭秘 FineVideo 数据集构建的背后的秘密
+2
2
2024年9月23日
research
community
Hugging Face 论文平台 Daily Papers 功能全解析
4
2024年9月23日
intel
optimum
quantization
使用 Optimum-Intel 和 OpenVINO GenAI 优化和部署模型
+3
2024年9月20日
nlp
research
community
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
+2
2
2024年9月18日
datasets
sql
duckdb
为数据集而生的 SQL 控制台
2024年9月17日
guide
Accelerate 1.0.0
2024年9月13日
padding
packing
Flash Attention 2
通过打包 Flash Attention 来提升 Hugging Face 训练效率
+2
1
2024年8月21日
long-context
infini-attention
memory-compression
一次失败的实验——无限注意力,我们为什么坚持实验
2024年8月14日
guide
community
ggml
ggml 简介
4
2024年8月13日
nlp
community
research
Falcon Mamba: 首个高效的无注意力机制 7B 模型
+2
1
2024年8月12日
LLM
nlp
community
对 LLM 工具使用进行统一
2024年8月12日
announcement
enterprise
hub
XetHub 加入 Hugging Face!
2024年8月8日
nlp
community
research
Google 最新发布: Gemma 2 2B, ShieldGemma 和 Gemma Scope
2024年7月31日
上一页
1
2
3
4
5
...
16
下一页
Community Articles
Sort: Trending
We’re open-sourcing our text-to-image model and the process behind it
9 days ago
•
67
Text-to-image Architectural Experiments
8 days ago
•
33
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
2 days ago
•
18
Introducing Cogito v2.1
1 day ago
•
16
Projected Abliteration
27 days ago
•
26
AI Model Optimization More Flexible Than Ever
4 days ago
•
12
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
16 days ago
•
49
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
6 days ago
•
11
Uncensor any LLM with abliteration
Jun 13, 2024
•
722
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
3 days ago
•
9
Norm-Preserving Biprojected Abliteration
14 days ago
•
14
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
175
Granite 4.0 Nano: Just how small can you go?
24 days ago
•
119
Why Did MiniMax M2 End Up as a Full Attention Model?
22 days ago
•
65
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
18 days ago
•
42
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
2 days ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
5 days ago
•
6
PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs
Jan 24
•
49
Visualizing How VLMs Work
Oct 7
•
45
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
Feb 11
•
87
View all articles