DavidAU commited on
Commit
38129d3
1 Parent(s): 4875a18

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +276 -0
README.md ADDED
@@ -0,0 +1,276 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - creative
7
+ - creative writing
8
+ - fiction writing
9
+ - plot generation
10
+ - sub-plot generation
11
+ - fiction writing
12
+ - story generation
13
+ - scene continue
14
+ - storytelling
15
+ - fiction story
16
+ - science fiction
17
+ - romance
18
+ - all genres
19
+ - story
20
+ - writing
21
+ - vivid prosing
22
+ - vivid writing
23
+ - fiction
24
+ - roleplaying
25
+ - bfloat16
26
+ - swearing
27
+ - rp
28
+ - horror
29
+ - mistral nemo
30
+ - mergekit
31
+ pipeline_tag: text-generation
32
+ ---
33
+
34
+ (quants uploading, examples to be added)
35
+
36
+ <h2><font color="green"> Mistral-Nemo-WORDSTORM-pt3-RCM-POV-Nightmare-18.5B-Instruct </font></h2>
37
+
38
+ <img src="nightmare.jpg" style="float:right; width:300px; height:300px; padding:10px;">
39
+
40
+ <B><font color="red">WARNING:</font> NSFW. Ultra Detailed. HORROR, VIOLENCE. Swearing. UNCENSORED. SMART.</B>
41
+
42
+ Story telling, writing, creative writing and roleplay running all on Mistral Nemo's 128K+ new core.
43
+
44
+ This is a massive super merge takes all the power of the following 3 powerful models and combines them into one.
45
+
46
+ This model contains "RCM":
47
+
48
+ - Mistral Nemo model at 18.5B consisting of "MN-Rocinante-12B-v1.1" and "Mistral Nemo Instruct 12B"
49
+ - Mistral Nemo model at 18.5B consisting of "MN-12B Celeste-V1.9" and "Mistral Nemo Instruct 12B"
50
+ - Mistral Nemo model at 18.5B consisting of "MN-Magnum-v2.5-12B-kto" and "Mistral Nemo Instruct 12B".
51
+
52
+ <B>Details on the core models:</B>
53
+
54
+ "nothingiisreal/MN-12B-Celeste-V1.9" is #1 (models 8B,13B,20B) on the UGI leaderboard ("UGI" sort),
55
+ is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
56
+
57
+ "anthracite-org/magnum-v2.5-12b-kto" is #1 (models 8B,13B,20B) on the UGI leaderboard ("Writing" sort),
58
+ is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
59
+
60
+ "TheDrummer/Rocinante-12B-v1.1" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard
61
+ (sort "UGI"), is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
62
+
63
+ "mistralai/Mistral-Nemo-Instruct-2407" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard (sort "writing")
64
+ and is the base model of all the above 3 fine tuned models.
65
+
66
+ [ https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard ]
67
+
68
+ <B>About this model:</B>
69
+
70
+ This super merge captures the attibutes of all these top models and makes them even stronger:
71
+
72
+ - Instruction following
73
+ - Story output quality
74
+ - Character
75
+ - Internal thoughts
76
+ - Voice
77
+ - Humor
78
+ - Details, connection to the world
79
+ - General depth and intensity
80
+ - Emotional connections.
81
+ - Prose quality
82
+
83
+ This super merge is also super stable (a hairs breath from Mistral Nemo's ppl), and runs with all parameters and settings.
84
+
85
+ 10 versions of this model will be released, this is release #1 - "part 1".
86
+
87
+ <B>POV Nightmare?</B>
88
+
89
+ This model put the user / character in nightmare situations.
90
+
91
+ It does not hold back.
92
+
93
+ Usually I release one or two versions from the "best of the lot", however in this case all
94
+ of the versions turned out so well - all with their own quirks and character - that I will be
95
+ releasing all 10.
96
+
97
+ An additional series 2 and 3 will follow these 10 models as well.
98
+
99
+ (examples generations below)
100
+
101
+ Model may produce NSFW content : Swearing, horror, graphic horror, distressing scenes, etc etc.
102
+
103
+ This model has an INTENSE action AND HORROR bias, with a knack for cliffhangers and surprises.
104
+
105
+ It is not as "dark" as Grand Horror series, but it as intense.
106
+
107
+ This model is perfect for any general, fiction related or roleplaying activities and has a 128k+ context window.
108
+
109
+ This is a fiction model at its core and can be used for any genre(s).
110
+
111
+ WORDSTORM series is a totally uncensored, fiction writing monster and roleplay master. It can also be used for
112
+ just about any general fiction (all genres) activity including:
113
+
114
+ - scene generation
115
+ - scene continuation
116
+ - creative writing
117
+ - fiction writing
118
+ - plot generation
119
+ - sub-plot generation
120
+ - fiction writing
121
+ - story generation
122
+ - storytelling
123
+ - writing
124
+ - fiction
125
+ - roleplaying
126
+ - rp
127
+ - graphic horror
128
+ - horror
129
+ - dark humor
130
+ - nsfw
131
+ - and can be used for any genre(s).
132
+
133
+ <B>Templates to Use:</B>
134
+
135
+ The template used will affect output generation and instruction following.
136
+
137
+ Alpaca:
138
+
139
+ <pre>
140
+ {
141
+ "name": "Alpaca",
142
+ "inference_params": {
143
+ "input_prefix": "### Instruction:",
144
+ "input_suffix": "### Response:",
145
+ "antiprompt": [
146
+ "### Instruction:"
147
+ ],
148
+ "pre_prompt": "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n"
149
+ }
150
+ }
151
+ </pre>
152
+
153
+ Chatml:
154
+ <pre>
155
+ {
156
+ "name": "ChatML",
157
+ "inference_params": {
158
+ "input_prefix": "<|im_end|>\n<|im_start|>user\n",
159
+ "input_suffix": "<|im_end|>\n<|im_start|>assistant\n",
160
+ "antiprompt": [
161
+ "<|im_start|>",
162
+ "<|im_end|>"
163
+ ],
164
+ "pre_prompt": "<|im_start|>system\nPerform the task to the best of your ability."
165
+ }
166
+ }
167
+ </pre>
168
+
169
+ Mistral Instruct:
170
+
171
+ <pre>
172
+ {
173
+ "name": "Mistral Instruct",
174
+ "inference_params": {
175
+ "input_prefix": "[INST]",
176
+ "input_suffix": "[/INST]",
177
+ "antiprompt": [
178
+ "[INST]"
179
+ ],
180
+ "pre_prompt_prefix": "",
181
+ "pre_prompt_suffix": ""
182
+ }
183
+ }
184
+ </pre>
185
+
186
+ <b>Optional Enhancement:</B>
187
+
188
+ The following can be used in place of the "system prompt" or "system role" to further enhance the model.
189
+
190
+ It can also be used at the START of a NEW chat, but you must make sure it is "kept" as the chat moves along.
191
+ In this case the enhancements do not have as strong effect at using "system prompt" or "system role".
192
+
193
+ Copy and paste EXACTLY as noted, DO NOT line wrap or break the lines, maintain the carriage returns exactly as presented.
194
+
195
+ <PRE>
196
+ Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.
197
+
198
+ Here are your skillsets:
199
+ [MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)
200
+
201
+ [*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)
202
+
203
+ Here are your critical instructions:
204
+ Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.
205
+ </PRE>
206
+
207
+ You do not need to use this, it is only presented as an additional enhancement which seems to help scene generation
208
+ and scene continue functions.
209
+
210
+ This enhancement WAS NOT used to generate the examples below.
211
+
212
+ <h3>MODELS USED:</h3>
213
+
214
+ Special thanks to the incredible work of the model makers "mistralai" "TheDrummer", "anthracite-org", and "nothingiisreal".
215
+
216
+ Models used:
217
+
218
+ [ https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407 ]
219
+
220
+ [ https://huggingface.co/TheDrummer/Rocinante-12B-v1.1 ]
221
+
222
+ [ https://huggingface.co/anthracite-org/magnum-v2.5-12b-kto ]
223
+
224
+ [ https://huggingface.co/nothingiisreal/MN-12B-Celeste-V1.9 ]
225
+
226
+ This is a four step merge (3 pass-throughs => "Fine-Tune" / "Instruct") then "mated" using "DARE-TIES".
227
+
228
+ In involves these three models:
229
+
230
+ [ https://huggingface.co/DavidAU/MN-18.5B-Celeste-V1.9-Story-Wizard-ED1-Instruct-GGUF ]
231
+
232
+ [ https://huggingface.co/DavidAU/MN-Magnum-v2.5-18.5B-kto-Story-Wizard-ED1-Instruct-GGUF ]
233
+
234
+ [ https://huggingface.co/DavidAU/MN-Rocinante-18.5B-v1.1-Story-Wizard-ED1-Instruct-GGUF ]
235
+
236
+ Combined as follows using "MERGEKIT":
237
+
238
+ <PRE>
239
+
240
+ models:
241
+ - model: E:/MN-Rocinante-18.5B-v1.1-Instruct
242
+ - model: E:/MN-magnum-v2.5-12b-kto-Instruct
243
+ parameters:
244
+ weight: .6
245
+ density: .8
246
+ - model: E:/MN-18.5B-Celeste-V1.9-Instruct
247
+ parameters:
248
+ weight: .38
249
+ density: .6
250
+ merge_method: dare_ties
251
+ tokenizer_source: union
252
+ base_model: E:/MN-Rocinante-18.5B-v1.1-Instruct
253
+ dtype: bfloat16
254
+
255
+ </PRE>
256
+
257
+ Special Notes:
258
+
259
+ Due to how DARE-TIES works, everytime you run this merge you will get a slightly different model.
260
+ This is due to "random" pruning method in "DARE-TIES".
261
+
262
+ Mistral Nemo models used here seem acutely sensitive to this process.
263
+
264
+ "tokenizer_source: union" is used so that multiple "templates" work and each fine tune uses one or two of the templates.
265
+
266
+ <h3>EXAMPLES PROMPTS and OUTPUT:</h3>
267
+
268
+ Examples are created using quant Q4_K_M, "temp=.8", minimal parameters and "Mistral Instruct" template.
269
+
270
+ Model has been tested with "temp" from ".1" to "5".
271
+
272
+ Below are the least creative outputs, prompt is in <B>BOLD</B>.
273
+
274
+ ---
275
+
276
+ Examples will be posted soon...