24GB VRAM Optimal Quants Collection When asked what I use locally on a 24GB card, this is what I point to. I favor exl2s for long context, GGUF for very short context. • 12 items • Updated about 1 month ago • 2