DavidAU
/

M-Metaphors-Of-Madness-19.4B-GGUF

Model card Files Files and versions Community

DavidAU commited on Oct 20

Commit

f617a96

•

1 Parent(s): 7511632

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -77,9 +77,17 @@ Definitely set a "hard limit" for role play and/or chat.
 Different quants will give you slightly different prose, with higher quants giving strongest level of detail, "there"
 and nuance. Q4+ recommended.
-Q2k: If you use this quant you may need to lower temp (less then 1) and raise rep pen (1.08+) to address quality loss.
-IQ4XS: This might be the quant with the most difference compared to other quants.
 Enjoy!

 Different quants will give you slightly different prose, with higher quants giving strongest level of detail, "there"
 and nuance. Q4+ recommended.
+Q2K:
+If you use this quant you may need to lower temp (less then 1) and raise rep pen (1.08+) to address quality loss.
+IQ4XS:
+This might be the quant with the most differences (contrast creavity) compared to other quants.
+ARM QUANTS:
+These quants are for specific systems that support this. If you use on a regular computer / GPU token per second will be VERY SLOW.
 Enjoy!