EXL2 70B for 24GB VRAM Collection 70B LLMs that fit in 24GB VRAM with over 16k context (with exl2 Q4 cache) • 11 items • Updated Jul 14 • 1