Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Update utils.py
Browse files
utils.py
CHANGED
@@ -29,10 +29,12 @@ COLUMN_NAMES = MODEL_INFO
|
|
29 |
|
30 |
LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
|
31 |
|
32 |
-
MMLU-Pro
|
33 |
|
34 |
-
|
35 |
-
|
|
|
|
|
36 |
|
37 |
TABLE_INTRODUCTION = """
|
38 |
"""
|
|
|
29 |
|
30 |
LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
|
31 |
|
32 |
+
Welcome to the MMLU-Pro leaderboard, showcasing the performance of various advanced language models on the MMLU-Pro dataset. The MMLU-Pro dataset is an enhanced version of the original MMLU, specifically engineered to offer a more rigorous and realistic evaluation environment..
|
33 |
|
34 |
+
The MMLU-Pro dataset consists of approximately 12,000 intricate questions that challenge the comprehension and reasoning abilities of LLMs. Below you can find the accuracies of different models tested on this dataset.
|
35 |
+
|
36 |
+
For detailed information about the dataset, visit our page on Hugging Face: MMLU-Pro at Hugging Face. If you are interested in replicating these results or wish to evaluate your models using our dataset, access our evaluation scripts available on GitHub: TIGER-AI-Lab/MMLU-Pro.
|
37 |
+
"""
|
38 |
|
39 |
TABLE_INTRODUCTION = """
|
40 |
"""
|