Tasks |
Version |
Filter |
n-shot |
Metric |
|
Value |
|
Stderr |
arc_challenge |
1 |
none |
0 |
acc |
↑ |
0.2176 |
± |
0.0121 |
|
|
none |
0 |
acc_norm |
↑ |
0.2628 |
± |
0.0129 |
arc_easy |
1 |
none |
0 |
acc |
↑ |
0.2584 |
± |
0.0090 |
|
|
none |
0 |
acc_norm |
↑ |
0.2567 |
± |
0.0090 |
boolq |
2 |
none |
0 |
acc |
↑ |
0.4171 |
± |
0.0086 |
hellaswag |
1 |
none |
0 |
acc |
↑ |
0.2565 |
± |
0.0044 |
|
|
none |
0 |
acc_norm |
↑ |
0.2639 |
± |
0.0044 |
openbookqa |
1 |
none |
0 |
acc |
↑ |
0.1620 |
± |
0.0165 |
|
|
none |
0 |
acc_norm |
↑ |
0.2800 |
± |
0.0201 |
piqa |
1 |
none |
0 |
acc |
↑ |
0.5419 |
± |
0.0116 |
|
|
none |
0 |
acc_norm |
↑ |
0.5234 |
± |
0.0117 |
winogrande |
1 |
none |
0 |
acc |
↑ |
0.5272 |
± |
0.0140 |
Unable to determine this model's library. Check the
docs
.