Triangle104
commited on
Commit
•
b421183
1
Parent(s):
ed97045
Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
license:
|
3 |
library_name: transformers
|
4 |
base_model: nbeerbower/Mistral-Small-Drummer-22B
|
5 |
datasets:
|
@@ -26,7 +26,8 @@ model-index:
|
|
26 |
value: 63.31
|
27 |
name: strict accuracy
|
28 |
source:
|
29 |
-
url:
|
|
|
30 |
name: Open LLM Leaderboard
|
31 |
- task:
|
32 |
type: text-generation
|
@@ -41,7 +42,8 @@ model-index:
|
|
41 |
value: 40.12
|
42 |
name: normalized accuracy
|
43 |
source:
|
44 |
-
url:
|
|
|
45 |
name: Open LLM Leaderboard
|
46 |
- task:
|
47 |
type: text-generation
|
@@ -56,7 +58,8 @@ model-index:
|
|
56 |
value: 16.69
|
57 |
name: exact match
|
58 |
source:
|
59 |
-
url:
|
|
|
60 |
name: Open LLM Leaderboard
|
61 |
- task:
|
62 |
type: text-generation
|
@@ -71,7 +74,8 @@ model-index:
|
|
71 |
value: 12.42
|
72 |
name: acc_norm
|
73 |
source:
|
74 |
-
url:
|
|
|
75 |
name: Open LLM Leaderboard
|
76 |
- task:
|
77 |
type: text-generation
|
@@ -86,7 +90,8 @@ model-index:
|
|
86 |
value: 9.8
|
87 |
name: acc_norm
|
88 |
source:
|
89 |
-
url:
|
|
|
90 |
name: Open LLM Leaderboard
|
91 |
- task:
|
92 |
type: text-generation
|
@@ -103,7 +108,8 @@ model-index:
|
|
103 |
value: 34.39
|
104 |
name: accuracy
|
105 |
source:
|
106 |
-
url:
|
|
|
107 |
name: Open LLM Leaderboard
|
108 |
---
|
109 |
|
@@ -111,6 +117,28 @@ model-index:
|
|
111 |
This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
112 |
Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
|
113 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
114 |
## Use with llama.cpp
|
115 |
Install llama.cpp through brew (works on Mac and Linux)
|
116 |
|
@@ -149,4 +177,4 @@ Step 3: Run inference through the main binary.
|
|
149 |
or
|
150 |
```
|
151 |
./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
|
152 |
-
```
|
|
|
1 |
---
|
2 |
+
license: apache-2.0
|
3 |
library_name: transformers
|
4 |
base_model: nbeerbower/Mistral-Small-Drummer-22B
|
5 |
datasets:
|
|
|
26 |
value: 63.31
|
27 |
name: strict accuracy
|
28 |
source:
|
29 |
+
url: >-
|
30 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
31 |
name: Open LLM Leaderboard
|
32 |
- task:
|
33 |
type: text-generation
|
|
|
42 |
value: 40.12
|
43 |
name: normalized accuracy
|
44 |
source:
|
45 |
+
url: >-
|
46 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
47 |
name: Open LLM Leaderboard
|
48 |
- task:
|
49 |
type: text-generation
|
|
|
58 |
value: 16.69
|
59 |
name: exact match
|
60 |
source:
|
61 |
+
url: >-
|
62 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
63 |
name: Open LLM Leaderboard
|
64 |
- task:
|
65 |
type: text-generation
|
|
|
74 |
value: 12.42
|
75 |
name: acc_norm
|
76 |
source:
|
77 |
+
url: >-
|
78 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
79 |
name: Open LLM Leaderboard
|
80 |
- task:
|
81 |
type: text-generation
|
|
|
90 |
value: 9.8
|
91 |
name: acc_norm
|
92 |
source:
|
93 |
+
url: >-
|
94 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
95 |
name: Open LLM Leaderboard
|
96 |
- task:
|
97 |
type: text-generation
|
|
|
108 |
value: 34.39
|
109 |
name: accuracy
|
110 |
source:
|
111 |
+
url: >-
|
112 |
+
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
|
113 |
name: Open LLM Leaderboard
|
114 |
---
|
115 |
|
|
|
117 |
This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
118 |
Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
|
119 |
|
120 |
+
---
|
121 |
+
Model details:
|
122 |
+
-
|
123 |
+
mistralai/Mistral-Small-Instruct-2409 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.
|
124 |
+
|
125 |
+
Method
|
126 |
+
ORPO tuned with 2xA40 on RunPod for 1 epoch.
|
127 |
+
|
128 |
+
learning_rate=4e-6,
|
129 |
+
lr_scheduler_type="linear",
|
130 |
+
beta=0.1,
|
131 |
+
per_device_train_batch_size=4,
|
132 |
+
per_device_eval_batch_size=4,
|
133 |
+
gradient_accumulation_steps=8,
|
134 |
+
optim="paged_adamw_8bit",
|
135 |
+
num_train_epochs=1,
|
136 |
+
|
137 |
+
Dataset was prepared using Mistral-Small Instruct format.
|
138 |
+
|
139 |
+
Fine-tune Llama 3 with ORPO
|
140 |
+
|
141 |
+
---
|
142 |
## Use with llama.cpp
|
143 |
Install llama.cpp through brew (works on Mac and Linux)
|
144 |
|
|
|
177 |
or
|
178 |
```
|
179 |
./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
|
180 |
+
```
|