Qwen2.5-3B-Instruct-GGUF / perplexity.md
ThomasBaruzier's picture
Upload perplexity.md
3d594e3 verified
|
raw
history blame
660 Bytes
Qwen2.5-3B-Instruct
Quant Size (MB) PPL Size (%) Accuracy (%) PPL error rate
IQ1_S 755 112.0612 0.97138
IQ1_M 811 42.7456 0.34718
IQ2_XXS 905 25.2117 0.20222
IQ2_XS 984 15.9149 0.11965
IQ2_S 1013 14.5975 0.10820
IQ2_M 1088 12.8779 0.09436
Q2_K_S 1143 13.0878 0.09636
Q2_K 1216 11.8001 0.08674
IQ3_XXS 1224 10.6049 0.07572
IQ3_XS 1328 10.0306 0.06975
Q3_K_S 1387 15.5457 0.11941
IQ3_S 1390 9.9591 0.06984
IQ3_M 1420 9.9957 0.06962
Q3_K_M 1517 14.0989 0.10568
Q3_K_L 1629 13.8579 0.10372
IQ4_XS 1659 9.2935 0.06517
IQ4_NL 1741 9.2824 0.06503
Q4_0 1744 9.4850 0.06626
Q4_K_S 1750 9.2573 0.06485
Q4_K_M 1841 9.2305 0.06475