File size: 7,519 Bytes
12e3a95
00c5e23
12e3a95
 
 
 
 
 
 
 
 
 
 
b06fdd8
12e3a95
298fa92
4df475e
12e3a95
 
 
 
 
 
 
 
 
 
 
 
 
 
f8b21de
bfd4e38
7e53bff
5541451
e0e242e
c80cecb
12e3a95
98fc7e4
048209f
964e762
0bdc9a5
f8b21de
cbfb4a6
22bf270
a67fd19
bfd4e38
b09f126
22bf270
 
134f725
0bdc9a5
12e3a95
 
 
 
 
 
 
 
 
00c5e23
 
 
 
 
12e3a95
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
---
base_model: lodrick-the-lafted/Grafted-Wind-Elementals-2x70B
language:
- en
library_name: transformers
license: other
quantized_by: mradermacher
tags:
- merge
- moe
---
## About

weighted/imatrix quants of https://huggingface.co/lodrick-the-lafted/Grafted-Wind-Elementals-2x70B

imatrix training data reduced to 130k tokens because llama otherwise corrupts the imatrix.

<!-- provided-files -->
static quants are available at https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-GGUF
## Usage

If you are unsure how to use GGUF files, refer to one of [TheBloke's
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
more details, including on how to concatenate multi-part files.

## Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

| Link | Type | Size/GB | Notes |
|:-----|:-----|--------:|:------|
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ1_S.gguf) | i1-IQ1_S | 26.3 | for the desperate |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ1_M.gguf) | i1-IQ1_M | 28.4 | mostly desperate |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 33.4 |  |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 37.2 |  |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ2_S.gguf) | i1-IQ2_S | 38.4 |  |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ2_M.gguf) | i1-IQ2_M | 42.0 |  |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q2_K.gguf) | i1-Q2_K | 46.3 | IQ3_XXS probably better |
| [GGUF](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 48.6 | lower quality |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_XS.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_XS.gguf.part2of2) | i1-IQ3_XS | 51.3 |  |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_S.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_S.gguf.part2of2) | i1-IQ3_S | 54.6 | beats Q3_K* |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_S.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_S.gguf.part2of2) | i1-Q3_K_S | 54.6 | IQ3_XS probably better |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_M.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ3_M.gguf.part2of2) | i1-IQ3_M | 55.9 |  |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_M.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_M.gguf.part2of2) | i1-Q3_K_M | 60.6 | IQ3_S probably better |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_L.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q3_K_L.gguf.part2of2) | i1-Q3_K_L | 65.6 | IQ3_M probably better |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ4_XS.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-IQ4_XS.gguf.part2of2) | i1-IQ4_XS | 67.2 |  |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_0.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_0.gguf.part2of2) | i1-Q4_0 | 71.0 | fast, low quality |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_K_S.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_K_S.gguf.part2of2) | i1-Q4_K_S | 71.7 | optimal size/speed/quality |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_K_M.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q4_K_M.gguf.part2of2) | i1-Q4_K_M | 76.0 | fast, recommended |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q5_K_S.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q5_K_S.gguf.part2of2) | i1-Q5_K_S | 86.6 |  |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q5_K_M.gguf.part1of2) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q5_K_M.gguf.part2of2) | i1-Q5_K_M | 89.2 |  |
| [PART 1](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q6_K.gguf.part1of3) [PART 2](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q6_K.gguf.part2of3) [PART 3](https://huggingface.co/mradermacher/Grafted-Wind-Elementals-2x70B-i1-GGUF/resolve/main/Grafted-Wind-Elementals-2x70B.i1-Q6_K.gguf.part3of3) | i1-Q6_K | 103.2 | practically like static Q6_K |

Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)

And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

## FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.

## Thanks

I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time.

<!-- end -->