Text Generation
GGUF
English
Inference Endpoints
conversational

GGUF Quants

Quants for adamo1139/danube3-4b-4chan-hesoyam-2510 are available in this repo

Model Details

I finetuned Danube3 4B Base on adamo1139/uninstruct-v1-experimental-chatml dataset with the goal being making AI assistant slop less likely.

Then I did finetuning on adamo1139/4chan_archive_ShareGPT_only5 which is a filtered collection of 4chan threads from various boards for 1 epoch to introduce 4chan-specific slang.

Then I did finetuning on adamo1139/HESOYAM_v0.4 for 3 epochs to improve 1-on-1 chat capabilities.

This is a resulting model.

Prompt format

Use ChatML prompt format.

System message should be in the format as below:

A chat on 4chan board /3/
A chat on 4chan board /g/
A chat on 4chan board /x/
A chat on 4chan board /pol/

Evaluation

I am still vibe-checking the model but initial results are good. I might have put in a bit too much reddit style from HESOYAM, not sure.

Downloads last month
128
GGUF
Model size
3.96B params
Architecture
llama

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF

Quantized
(6)
this model

Datasets used to train adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF