lemon07r commited on
Commit
318afe2
1 Parent(s): e730ba8

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -3
README.md CHANGED
@@ -1,3 +1,47 @@
1
- ---
2
- license: gemma
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - lemon07r/Gemma-2-Ataraxy-Advanced-9B
4
+ - nbeerbower/Gemma2-Gutenberg-Doppel-9B
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+
10
+ ---
11
+ # Gemma-2-Ataraxy-v3-Advanced-9B
12
+
13
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
+
15
+ ## Merge Details
16
+ ### Merge Method
17
+
18
+ This model was merged using the SLERP merge method.
19
+
20
+ ### Models Merged
21
+
22
+ The following models were included in the merge:
23
+ * [lemon07r/Gemma-2-Ataraxy-Advanced-9B](https://huggingface.co/lemon07r/Gemma-2-Ataraxy-Advanced-9B)
24
+ * [nbeerbower/Gemma2-Gutenberg-Doppel-9B](https://huggingface.co/nbeerbower/Gemma2-Gutenberg-Doppel-9B)
25
+
26
+ ### Configuration
27
+
28
+ The following YAML configuration was used to produce this model:
29
+
30
+ ```yaml
31
+ base_model: lemon07r/Gemma-2-Ataraxy-Advanced-9B
32
+ dtype: bfloat16
33
+ merge_method: slerp
34
+ parameters:
35
+ t:
36
+ - filter: self_attn
37
+ value: [0.0, 0.5, 0.3, 0.7, 1.0]
38
+ - filter: mlp
39
+ value: [1.0, 0.5, 0.7, 0.3, 0.0]
40
+ - value: 0.5
41
+ slices:
42
+ - sources:
43
+ - layer_range: [0, 42]
44
+ model: nbeerbower/Gemma2-Gutenberg-Doppel-9B
45
+ - layer_range: [0, 42]
46
+ model: lemon07r/Gemma-2-Ataraxy-Advanced-9B
47
+ ```