Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,80 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model:
|
3 |
+
- migtissera/Tess-70B-v1.6
|
4 |
+
- 152334H/miqu-1-70b-sf
|
5 |
+
- NeverSleep/MiquMaid-v2-70B
|
6 |
+
- sophosympatheia/Midnight-Miqu-70B-v1.0
|
7 |
+
library_name: transformers
|
8 |
+
tags:
|
9 |
+
- mergekit
|
10 |
+
- merge
|
11 |
+
|
12 |
+
---
|
13 |
+
# Miqu-MS-70B
|
14 |
+
|
15 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
+
|
17 |
+
The new MODEL STOCK merge method was used, see below for more information!
|
18 |
+
|
19 |
+
Feedback on this model is greatly appreciated! I hope this new merge method will be able to fill some hole Miqu have.
|
20 |
+
|
21 |
+
## Merge Details
|
22 |
+
### Merge Method
|
23 |
+
|
24 |
+
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [152334H/miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf) as a base.
|
25 |
+
|
26 |
+
### Models Merged
|
27 |
+
|
28 |
+
The following models were included in the merge:
|
29 |
+
* [migtissera/Tess-70B-v1.6](https://huggingface.co/migtissera/Tess-70B-v1.6)
|
30 |
+
* [NeverSleep/MiquMaid-v2-70B](https://huggingface.co/NeverSleep/MiquMaid-v2-70B)
|
31 |
+
* [sophosympatheia/Midnight-Miqu-70B-v1.0](https://huggingface.co/sophosympatheia/Midnight-Miqu-70B-v1.0)
|
32 |
+
|
33 |
+
### Configuration
|
34 |
+
|
35 |
+
The following YAML configuration was used to produce this model:
|
36 |
+
|
37 |
+
```yaml
|
38 |
+
models:
|
39 |
+
- model: NeverSleep/MiquMaid-v2-70B
|
40 |
+
- model: sophosympatheia/Midnight-Miqu-70B-v1.0
|
41 |
+
- model: migtissera/Tess-70B-v1.6
|
42 |
+
- model: 152334H/miqu-1-70b-sf
|
43 |
+
merge_method: model_stock
|
44 |
+
base_model: 152334H/miqu-1-70b-sf
|
45 |
+
dtype: bfloat16
|
46 |
+
```
|
47 |
+
### Prompt format
|
48 |
+
|
49 |
+
Since it was made with model using different prompt format, the following should work.
|
50 |
+
|
51 |
+
## Alpaca
|
52 |
+
|
53 |
+
```
|
54 |
+
### Instruction:
|
55 |
+
{system prompt}
|
56 |
+
|
57 |
+
### Input:
|
58 |
+
{prompt}
|
59 |
+
|
60 |
+
### Response:
|
61 |
+
{output}
|
62 |
+
```
|
63 |
+
|
64 |
+
## Mistral
|
65 |
+
|
66 |
+
```
|
67 |
+
[INST] {prompt} [/INST]
|
68 |
+
```
|
69 |
+
|
70 |
+
## Vicuna
|
71 |
+
|
72 |
+
```
|
73 |
+
SYSTEM: <ANY SYSTEM CONTEXT>
|
74 |
+
USER:
|
75 |
+
ASSISTANT:
|
76 |
+
```
|
77 |
+
|
78 |
+
### Support
|
79 |
+
|
80 |
+
If you want to support me, you can [here](https://ko-fi.com/undiai).
|