Text Generation
GGUF
English
Inference Endpoints
conversational
adamo1139 commited on
Commit
4bf173a
1 Parent(s): 830f09a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - adamo1139/4chan_archive_ShareGPT_only5
5
+ - adamo1139/HESOYAM_v0.4
6
+ - adamo1139/uninstruct-v1-experimental-chatml
7
+ language:
8
+ - en
9
+ base_model:
10
+ - h2oai/h2o-danube3-4b-base
11
+ pipeline_tag: text-generation
12
+ ---
13
+ # GGUF Quants
14
+
15
+ Quants are available in this repo
16
+
17
+ # Model Details
18
+
19
+ I finetuned Danube3 4B Base on [adamo1139/uninstruct-v1-experimental-chatml](https://huggingface.co/datasets/adamo1139/uninstruct-v1-experimental-chatml) dataset with the goal being making AI assistant slop less likely.
20
+
21
+ Then I did finetuning on [adamo1139/4chan_archive_ShareGPT_only5](https://huggingface.co/datasets/adamo1139/4chan_archive_ShareGPT_only5) which is a filtered collection of 4chan threads from various boards for 1 epoch to introduce 4chan-specific slang.
22
+
23
+ Then I did finetuning on [adamo1139/HESOYAM_v0.4](https://huggingface.co/datasets/adamo1139/HESOYAM_v0.4) for 3 epochs to improve 1-on-1 chat capabilities.
24
+
25
+ This is a resulting model.
26
+
27
+ # Prompt format
28
+
29
+ Use ChatML prompt format.
30
+
31
+ System message should be in the format as below:
32
+
33
+ ```
34
+ A chat on 4chan board /3/
35
+ A chat on 4chan board /g/
36
+ A chat on 4chan board /x/
37
+ A chat on 4chan board /pol/
38
+ ```
39
+
40
+ # Evaluation
41
+
42
+ I am still vibe-checking the model but initial results are good. I might have put in a bit too much reddit style from HESOYAM, not sure.
43
+