Triangle104 commited on
Commit
b421183
1 Parent(s): ed97045

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -8
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- license: other
3
  library_name: transformers
4
  base_model: nbeerbower/Mistral-Small-Drummer-22B
5
  datasets:
@@ -26,7 +26,8 @@ model-index:
26
  value: 63.31
27
  name: strict accuracy
28
  source:
29
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
30
  name: Open LLM Leaderboard
31
  - task:
32
  type: text-generation
@@ -41,7 +42,8 @@ model-index:
41
  value: 40.12
42
  name: normalized accuracy
43
  source:
44
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
45
  name: Open LLM Leaderboard
46
  - task:
47
  type: text-generation
@@ -56,7 +58,8 @@ model-index:
56
  value: 16.69
57
  name: exact match
58
  source:
59
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
60
  name: Open LLM Leaderboard
61
  - task:
62
  type: text-generation
@@ -71,7 +74,8 @@ model-index:
71
  value: 12.42
72
  name: acc_norm
73
  source:
74
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
75
  name: Open LLM Leaderboard
76
  - task:
77
  type: text-generation
@@ -86,7 +90,8 @@ model-index:
86
  value: 9.8
87
  name: acc_norm
88
  source:
89
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
90
  name: Open LLM Leaderboard
91
  - task:
92
  type: text-generation
@@ -103,7 +108,8 @@ model-index:
103
  value: 34.39
104
  name: accuracy
105
  source:
106
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
 
107
  name: Open LLM Leaderboard
108
  ---
109
 
@@ -111,6 +117,28 @@ model-index:
111
  This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
112
  Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
113
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
114
  ## Use with llama.cpp
115
  Install llama.cpp through brew (works on Mac and Linux)
116
 
@@ -149,4 +177,4 @@ Step 3: Run inference through the main binary.
149
  or
150
  ```
151
  ./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
152
- ```
 
1
  ---
2
+ license: apache-2.0
3
  library_name: transformers
4
  base_model: nbeerbower/Mistral-Small-Drummer-22B
5
  datasets:
 
26
  value: 63.31
27
  name: strict accuracy
28
  source:
29
+ url: >-
30
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
31
  name: Open LLM Leaderboard
32
  - task:
33
  type: text-generation
 
42
  value: 40.12
43
  name: normalized accuracy
44
  source:
45
+ url: >-
46
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
47
  name: Open LLM Leaderboard
48
  - task:
49
  type: text-generation
 
58
  value: 16.69
59
  name: exact match
60
  source:
61
+ url: >-
62
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
63
  name: Open LLM Leaderboard
64
  - task:
65
  type: text-generation
 
74
  value: 12.42
75
  name: acc_norm
76
  source:
77
+ url: >-
78
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
79
  name: Open LLM Leaderboard
80
  - task:
81
  type: text-generation
 
90
  value: 9.8
91
  name: acc_norm
92
  source:
93
+ url: >-
94
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
95
  name: Open LLM Leaderboard
96
  - task:
97
  type: text-generation
 
108
  value: 34.39
109
  name: accuracy
110
  source:
111
+ url: >-
112
+ https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=nbeerbower/Mistral-Small-Drummer-22B
113
  name: Open LLM Leaderboard
114
  ---
115
 
 
117
  This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
118
  Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
119
 
120
+ ---
121
+ Model details:
122
+ -
123
+ mistralai/Mistral-Small-Instruct-2409 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.
124
+
125
+ Method
126
+ ORPO tuned with 2xA40 on RunPod for 1 epoch.
127
+
128
+ learning_rate=4e-6,
129
+ lr_scheduler_type="linear",
130
+ beta=0.1,
131
+ per_device_train_batch_size=4,
132
+ per_device_eval_batch_size=4,
133
+ gradient_accumulation_steps=8,
134
+ optim="paged_adamw_8bit",
135
+ num_train_epochs=1,
136
+
137
+ Dataset was prepared using Mistral-Small Instruct format.
138
+
139
+ Fine-tune Llama 3 with ORPO
140
+
141
+ ---
142
  ## Use with llama.cpp
143
  Install llama.cpp through brew (works on Mac and Linux)
144
 
 
177
  or
178
  ```
179
  ./llama-server --hf-repo Triangle104/Mistral-Small-Drummer-22B-Q8_0-GGUF --hf-file mistral-small-drummer-22b-q8_0.gguf -c 2048
180
+ ```