Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
28.8
TFLOPS
562
925
3332
Victor Mustar
PRO
victor
Follow
francogionardo's profile picture
Ahmedalkaddo's profile picture
Priyanshugurrani's profile picture
3270 followers
Β·
1450 following
victormustar
AI & ML interests
Building the UX of this website
Recent Activity
liked
a Space
about 1 hour ago
teapotai/teapotchat
liked
a model
about 1 hour ago
teapotai/teapotllm
reacted
to
csabakecskemeti
's
post
with π
about 2 hours ago
I'm collecting llama-bench results for inference with a llama 3.1 8B q4 and q8 reference models on varoius GPUs. The results are average of 5 executions. The system varies (different motherboard and CPU ... but that probably that has little effect on the inference performance). https://devquasar.com/gpu-gguf-inference-comparison/ the exact models user are in the page I'd welcome results from other GPUs is you have access do anything else you've need in the post. Hopefully this is useful information everyone.
View all activity
Organizations
victor
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
over 1 year ago
view article
Article
Inference for PROs
By
osanseviero
and 2 others
β’
Sep 22, 2023
β’
53