adamo1139
/

Danube3-4b-4chan-HESOYAM-2510-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

adamo1139 commited on Oct 26

Commit

4bf173a

•

1 Parent(s): 830f09a

Create README.md

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+license: apache-2.0
+datasets:
+- adamo1139/4chan_archive_ShareGPT_only5
+- adamo1139/HESOYAM_v0.4
+- adamo1139/uninstruct-v1-experimental-chatml
+language:
+- en
+base_model:
+- h2oai/h2o-danube3-4b-base
+pipeline_tag: text-generation
+---
+# GGUF Quants
+Quants are available in this repo
+# Model Details
+I finetuned Danube3 4B Base on [adamo1139/uninstruct-v1-experimental-chatml](https://huggingface.co/datasets/adamo1139/uninstruct-v1-experimental-chatml) dataset with the goal being making AI assistant slop less likely.
+Then I did finetuning on [adamo1139/4chan_archive_ShareGPT_only5](https://huggingface.co/datasets/adamo1139/4chan_archive_ShareGPT_only5) which is a filtered collection of 4chan threads from various boards for 1 epoch to introduce 4chan-specific slang.
+Then I did finetuning on [adamo1139/HESOYAM_v0.4](https://huggingface.co/datasets/adamo1139/HESOYAM_v0.4) for 3 epochs to improve 1-on-1 chat capabilities.
+This is a resulting model.
+# Prompt format
+Use ChatML prompt format.
+System message should be in the format as below:
+```
+A chat on 4chan board /3/
+A chat on 4chan board /g/
+A chat on 4chan board /x/
+A chat on 4chan board /pol/
+```
+# Evaluation
+I am still vibe-checking the model but initial results are good. I might have put in a bit too much reddit style from HESOYAM, not sure.