Matt Cool
mbcool
ยท
AI & ML interests
Open Source, local and offline.
Recent Activity
commented on
an
article
5 days ago
KV Caching Explained: Optimizing Transformer Inference Efficiency
upvoted
an
article
5 days ago
KV Caching Explained: Optimizing Transformer Inference Efficiency
liked
a model
11 months ago
Mozilla/Meta-Llama-3.1-8B-Instruct-llamafile