--- license: cc-by-nc-4.0 tags: - not-for-all-audiences - nsfw --- ## Exl2 version of [Undi95/FlatDolphinMaid-8x7B](https://huggingface.co/Undi95/FlatDolphinMaid-8x7B) ## branch 3.5bh8 : 3.5bpw h8 Using ThePile [0007.parquet](https://huggingface.co/datasets/EleutherAI/the_pile_deduplicated/resolve/refs%2Fconvert%2Fparquet/default/train/0007.parquet) as dataset Quantization settings : ```python convert.py -i models/Undi95_FlatDolphinMaid-8x7B -o FlatDolphinMaid-8x7B-temp -cf FlatDolphinMaid-8x7B-3.5bpw-h8-exl2 -c 0007.parquet -l 8192 -b 3.5 -hb 8 -m FlatDolphinMaid-8x7B-measurement.json -ml 8192``` ### below this line is original readme First experimental merge of Noromaid 8x7b (Instruct) and dolphin 8x7b. The idea behind this is to add a little more IQ to the model, because Noromaid was only trained on RP/ERP data. Dolphin 2.7 is the only real Mixtral finetune I consider "usable", and so the merging quest begin again kek. Merged Dolphin 2.7 with Mixtral Base (Dolphin was at 1.0 weight) to get rid of ChatLM, and then I merged Noromaid 8x7b with the output, SLERP method. This model feel better on the IQ chart and have the ~same average ERP score on ayumi bench' than Noromaid 8x7b, but it's softer and more prude too, it also have the typical Mixtral repeat issue at some point. Choose your poison. ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63ab1241ad514ca8d1430003/uZlU0PEPtKPZPLzXcoqJ_.png) ## Description This repo contains fp16 files of FlatDolphinMaid-8x7B. ## Models used - [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) - [cognitivecomputations/dolphin-2.7-mixtral-8x7b](https://huggingface.co/cognitivecomputations/dolphin-2.7-mixtral-8x7b) - [NeverSleep/Noromaid-v0.1-mixtral-8x7b-Instruct-v3](https://huggingface.co/NeverSleep/Noromaid-v0.1-mixtral-8x7b-Instruct-v3) ### Custom format: ``` ### Instruction: {system prompt} ### Input: {input} ### Response: {reply} ``` If you want to support me, you can [here](https://ko-fi.com/undiai).