<23GB quant for 24GB GPUs? IQ2_XS or IQ2_XXS?

#4
by kerrmetric - opened

Could I request a <23GB quant for 24GB GPUs? Q2_XS or IQ2_XXS should work great.

@kerrmetric just added some more

Also can try out @legraphista , he does good work :)

Thanks! Will do

Sign up or log in to comment