Update README.md
Browse files
README.md
CHANGED
@@ -101,7 +101,23 @@ I use [mergekit](https://github.com/cg123/mergekit) for all the manipulation tol
|
|
101 |
|
102 |
## Some scoring I done myself
|
103 |
|
104 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
105 |
|
106 |
## Others
|
107 |
|
|
|
101 |
|
102 |
## Some scoring I done myself
|
103 |
|
104 |
+
|
105 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/63ab1241ad514ca8d1430003/5aDYq-V0XWUsqbLH2ehPr.png)
|
106 |
+
|
107 |
+
hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-OmniMix-bf16), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
|
108 |
+
| Task |Version| Metric |Value | |Stderr|
|
109 |
+
|-------------|------:|--------|-----:|---|-----:|
|
110 |
+
|arc_challenge| 0|acc |0.5580|± |0.0145|
|
111 |
+
| | |acc_norm|0.5819|± |0.0144|
|
112 |
+
|arc_easy | 0|acc |0.8300|± |0.0077|
|
113 |
+
| | |acc_norm|0.8211|± |0.0079|
|
114 |
+
|hellaswag | 0|acc |0.6372|± |0.0048|
|
115 |
+
| | |acc_norm|0.8209|± |0.0038|
|
116 |
+
|piqa | 0|acc |0.8145|± |0.0091|
|
117 |
+
| | |acc_norm|0.8286|± |0.0088|
|
118 |
+
|truthfulqa_mc| 1|mc1 |0.3978|± |0.0171|
|
119 |
+
| | |mc2 |0.5680|± |0.0155|
|
120 |
+
|winogrande | 0|acc |0.7427|± |0.0123|
|
121 |
|
122 |
## Others
|
123 |
|