Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MElHuseyni 's Collections
Speech Models
LLMs Inference

LLMs Inference

updated Nov 12, 2024
Upvote
-

  • DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

    Paper • 2401.08671 • Published Jan 9, 2024 • 15

  • NanoFlow: Towards Optimal Large Language Model Serving Throughput

    Paper • 2408.12757 • Published Aug 22, 2024 • 18

  • richard-park/llama3-deepspeed-v1.0

    Text Generation • Updated Jul 4, 2024 • 1.43k • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs