Experts for GPU-Poors - a phi0112358 Collection

phi0112358 's Collections

updated Aug 25, 2024

GGUFs, conventional and k-quants – both without imatrix. This should be faster for CPU inference. Right now DeepSee MoEs (Mixture of Experts)