Text Generation
GGUF
English
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
brainstorm 40x
swearing
rp
horror
mistral
mergekit
Inference Endpoints
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -77,9 +77,17 @@ Definitely set a "hard limit" for role play and/or chat.
|
|
77 |
Different quants will give you slightly different prose, with higher quants giving strongest level of detail, "there"
|
78 |
and nuance. Q4+ recommended.
|
79 |
|
80 |
-
|
81 |
|
82 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
83 |
|
84 |
Enjoy!
|
85 |
|
|
|
77 |
Different quants will give you slightly different prose, with higher quants giving strongest level of detail, "there"
|
78 |
and nuance. Q4+ recommended.
|
79 |
|
80 |
+
Q2K:
|
81 |
|
82 |
+
If you use this quant you may need to lower temp (less then 1) and raise rep pen (1.08+) to address quality loss.
|
83 |
+
|
84 |
+
IQ4XS:
|
85 |
+
|
86 |
+
This might be the quant with the most differences (contrast creavity) compared to other quants.
|
87 |
+
|
88 |
+
ARM QUANTS:
|
89 |
+
|
90 |
+
These quants are for specific systems that support this. If you use on a regular computer / GPU token per second will be VERY SLOW.
|
91 |
|
92 |
Enjoy!
|
93 |
|