BramVanroy leaderboard-pr-bot commited on
Commit
d5c9a22
1 Parent(s): b7b4479

Adding Evaluation Results (#1)

Browse files

- Adding Evaluation Results (c7e8e6f8d36725e4baee9067ef1d0281d2513159)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +19 -6
README.md CHANGED
@@ -1,22 +1,22 @@
1
  ---
 
 
2
  license: mit
3
- base_model: BramVanroy/fietje-2
4
  tags:
5
  - trl
6
  - fietje
7
  - alignment-handbook
8
  - sft
 
9
  datasets:
10
  - BramVanroy/ultrachat_200k_dutch
11
  - BramVanroy/no_robots_dutch
12
  - BramVanroy/belebele_dutch
 
 
13
  model-index:
14
  - name: fietje-2-instruct
15
  results: []
16
- pipeline_tag: text-generation
17
- inference: false
18
- language:
19
- - nl
20
  ---
21
 
22
  <p align="center" style="margin:0;padding:0">
@@ -93,4 +93,17 @@ The following hyperparameters were used during training:
93
  - Transformers 4.39.1
94
  - Pytorch 2.1.2+cu121
95
  - Datasets 2.18.0
96
- - Tokenizers 0.15.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - nl
4
  license: mit
 
5
  tags:
6
  - trl
7
  - fietje
8
  - alignment-handbook
9
  - sft
10
+ base_model: BramVanroy/fietje-2
11
  datasets:
12
  - BramVanroy/ultrachat_200k_dutch
13
  - BramVanroy/no_robots_dutch
14
  - BramVanroy/belebele_dutch
15
+ pipeline_tag: text-generation
16
+ inference: false
17
  model-index:
18
  - name: fietje-2-instruct
19
  results: []
 
 
 
 
20
  ---
21
 
22
  <p align="center" style="margin:0;padding:0">
 
93
  - Transformers 4.39.1
94
  - Pytorch 2.1.2+cu121
95
  - Datasets 2.18.0
96
+ - Tokenizers 0.15.2
97
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
98
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__fietje-2-instruct)
99
+
100
+ | Metric |Value|
101
+ |-------------------|----:|
102
+ |Avg. |10.20|
103
+ |IFEval (0-Shot) |27.90|
104
+ |BBH (3-Shot) |17.57|
105
+ |MATH Lvl 5 (4-Shot)| 0.53|
106
+ |GPQA (0-shot) | 0.00|
107
+ |MuSR (0-shot) | 2.91|
108
+ |MMLU-PRO (5-shot) |12.26|
109
+