ubowang commited on
Commit
11c0693
1 Parent(s): bb25558

Update utils.py

Browse files
Files changed (1) hide show
  1. utils.py +5 -3
utils.py CHANGED
@@ -29,10 +29,12 @@ COLUMN_NAMES = MODEL_INFO
29
 
30
  LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
31
 
32
- MMLU-Pro dataset, a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various disciplines. The following are the accuracies of various models evaluated on MMLU-Pro.
33
 
34
- Our dataset is available at [https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro](https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro). If you want to reproduce our results or evaluate your own models on MMLU-Pro, please check out our evaluation scripts at [https://github.com/TIGER-AI-Lab/MMLU-Pro](https://github.com/TIGER-AI-Lab/MMLU-Pro).
35
- """
 
 
36
 
37
  TABLE_INTRODUCTION = """
38
  """
 
29
 
30
  LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
31
 
32
+ Welcome to the MMLU-Pro leaderboard, showcasing the performance of various advanced language models on the MMLU-Pro dataset. The MMLU-Pro dataset is an enhanced version of the original MMLU, specifically engineered to offer a more rigorous and realistic evaluation environment..
33
 
34
+ The MMLU-Pro dataset consists of approximately 12,000 intricate questions that challenge the comprehension and reasoning abilities of LLMs. Below you can find the accuracies of different models tested on this dataset.
35
+
36
+ For detailed information about the dataset, visit our page on Hugging Face: MMLU-Pro at Hugging Face. If you are interested in replicating these results or wish to evaluate your models using our dataset, access our evaluation scripts available on GitHub: TIGER-AI-Lab/MMLU-Pro.
37
+ """
38
 
39
  TABLE_INTRODUCTION = """
40
  """