Exl2 version of Undi95/FlatDolphinMaid-8x7B
branch
3.5bh8 : 3.5bpw h8
Using ThePile 0007.parquet as dataset
Quantization settings : python convert.py -i models/Undi95_FlatDolphinMaid-8x7B -o FlatDolphinMaid-8x7B-temp -cf FlatDolphinMaid-8x7B-3.5bpw-h8-exl2 -c 0007.parquet -l 8192 -b 3.5 -hb 8 -m FlatDolphinMaid-8x7B-measurement.json -ml 8192
below this line is original readme
First experimental merge of Noromaid 8x7b (Instruct) and dolphin 8x7b. The idea behind this is to add a little more IQ to the model, because Noromaid was only trained on RP/ERP data. Dolphin 2.7 is the only real Mixtral finetune I consider "usable", and so the merging quest begin again kek.
Merged Dolphin 2.7 with Mixtral Base (Dolphin was at 1.0 weight) to get rid of ChatLM, and then I merged Noromaid 8x7b with the output, SLERP method.
This model feel better on the IQ chart and have the ~same average ERP score on ayumi bench' than Noromaid 8x7b, but it's softer and more prude too, it also have the typical Mixtral repeat issue at some point. Choose your poison.
Description
This repo contains fp16 files of FlatDolphinMaid-8x7B.
Models used
- mistralai/Mixtral-8x7B-v0.1
- cognitivecomputations/dolphin-2.7-mixtral-8x7b
- NeverSleep/Noromaid-v0.1-mixtral-8x7b-Instruct-v3
Custom format:
### Instruction:
{system prompt}
### Input:
{input}
### Response:
{reply}
If you want to support me, you can here.