24 GB VRAM
Collection
Quants that run fast on single 3090/4090 card with 24GB of VRAM and 4096 context length
•
18 items
•
Updated
•
6
exllamav2 quant for tuantran1632001/Psyfighter2-Orca2-13B-ties
Runs smoothly on single 3090 in webui with context length set to 4096, ExLlamav2_HF loader and cache_8bit=True
All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel: