Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
帖子、文章和讨论
New Article
分析和解读
社区动态
教程
开源协作
合作伙伴
科研相关
NLP
Audio
CV
RL
AI 伦理
扩散模型
游戏开发
Community Articles
view all
We’re open-sourcing our text-to-image model and the process behind it
12 days ago
•
69
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
5 days ago
•
21
Introducing Cogito v2.1
5 days ago
•
17
Text-to-image Architectural Experiments
11 days ago
•
34
AI Model Optimization More Flexible Than Ever
7 days ago
•
12
How to make NeuTTS-air generate over 200 seconds of audio in a single second.
3 days ago
•
11
Projected Abliteration
30 days ago
•
28
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
9 days ago
•
11
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
19 days ago
•
50
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
6 days ago
•
9
Norm-Preserving Biprojected Abliteration
17 days ago
•
15
Uncensor any LLM with abliteration
Jun 13, 2024
•
722
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
177
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
21 days ago
•
42
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
5 days ago
•
6
Prefill and Decode for Concurrent Requests - Optimizing LLM Performance
Apr 16
•
55
Granite 4.0 Nano: Just how small can you go?
27 days ago
•
119
Why Did MiniMax M2 End Up as a Full Attention Model?
25 days ago
•
65
DeLERP: Decomposed Linear Interpolation for Model Merging
4 days ago
•
4
The Epstein Files: Democratizing Access to Public Records
1 day ago
•
4
diffusers
guide
diffusion-transformers
基于 Quanto 和 Diffusers 的内存高效 transformer 扩散模型
1
2024年7月30日
community
evaluation
synthetic-data
LAVE:使用 LLM 对 Docmatix 进行零样本 VQA 评估 - 我们还需要微调吗?
2024年7月25日
nlp
community
research
Llama 3.1:405B/70B/8B 模型的多语言与长上下文能力解析
+4
2
2024年7月23日
community
datasets
synthetic-data
Docmatix - 超大文档视觉问答数据集
2024年7月18日
nlp
tgi
LLM
TGI 多-LoRA:部署一次,搞定 30 个模型的推理服务
1
2024年7月18日
llm
nlp
synthetic-data
SmolLM:一个超快速、超高性能的小模型集合
2
2024年7月16日
ai4math
nlp
community
NuminaMath 是如何荣膺首届 AIMO 进步奖的?
+4
2024年7月11日
datasets
pii
在 Hub 上使用 Presidio 进行自动 PII 检测实验
2024年7月10日
vlm
multimodal
trl
为视觉语言多模态模型进行偏好优化
2024年7月10日
partnerships
intel
llm
在英特尔 Gaudi 2 上加速蛋白质语言模型 ProtST
+3
2024年7月3日
agents
nlp
community
Transformers 代码智能体成功刷榜 GAIA
2024年7月1日
nlp
community
research
Google 发布最新开放大语言模型 Gemma 2,现已登陆 Hugging Face Hub
+2
2024年6月27日
collaboration
community
open-source
微调 Florence-2 - 微软的尖端视觉语言模型
2024年6月24日
open-source
guide
research
从 DeepSpeed 到 FSDP,再回到 Hugging Face Accelerate
2024年6月13日
上一页
1
2
3
4
5
6
...
16
下一页
Community Articles
Sort: Trending
We’re open-sourcing our text-to-image model and the process behind it
12 days ago
•
69
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
5 days ago
•
21
Introducing Cogito v2.1
5 days ago
•
17
Text-to-image Architectural Experiments
11 days ago
•
34
AI Model Optimization More Flexible Than Ever
7 days ago
•
12
How to make NeuTTS-air generate over 200 seconds of audio in a single second.
3 days ago
•
11
Projected Abliteration
30 days ago
•
28
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
9 days ago
•
11
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
19 days ago
•
50
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
6 days ago
•
9
Norm-Preserving Biprojected Abliteration
17 days ago
•
15
Uncensor any LLM with abliteration
Jun 13, 2024
•
722
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
177
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
21 days ago
•
42
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
5 days ago
•
6
Prefill and Decode for Concurrent Requests - Optimizing LLM Performance
Apr 16
•
55
Granite 4.0 Nano: Just how small can you go?
27 days ago
•
119
Why Did MiniMax M2 End Up as a Full Attention Model?
25 days ago
•
65
DeLERP: Decomposed Linear Interpolation for Model Merging
4 days ago
•
4
The Epstein Files: Democratizing Access to Public Records
1 day ago
•
4
View all articles