Spaces:

TIGER-Lab
/

MMLU-Pro

Running on CPU Upgrade

ubowang commited on May 20

Commit

11c0693

•

1 Parent(s): bb25558

Update utils.py

Files changed (1) hide show

utils.py CHANGED Viewed

@@ -29,10 +29,12 @@ COLUMN_NAMES = MODEL_INFO
 LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
-MMLU-Pro dataset, a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various disciplines. The following are the accuracies of various models evaluated on MMLU-Pro.
-Our dataset is available at [https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro](https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro). If you want to reproduce our results or evaluate your own models on MMLU-Pro, please check out our evaluation scripts at [https://github.com/TIGER-AI-Lab/MMLU-Pro](https://github.com/TIGER-AI-Lab/MMLU-Pro).
-    """
 TABLE_INTRODUCTION = """
     """

 LEADERBOARD_INTRODUCTION = """# MMLU-Pro Leaderboard
+Welcome to the MMLU-Pro leaderboard, showcasing the performance of various advanced language models on the MMLU-Pro dataset. The MMLU-Pro dataset is an enhanced version of the original MMLU, specifically engineered to offer a more rigorous and realistic evaluation environment..
+The MMLU-Pro dataset consists of approximately 12,000 intricate questions that challenge the comprehension and reasoning abilities of LLMs. Below you can find the accuracies of different models tested on this dataset.
+For detailed information about the dataset, visit our page on Hugging Face: MMLU-Pro at Hugging Face. If you are interested in replicating these results or wish to evaluate your models using our dataset, access our evaluation scripts available on GitHub: TIGER-AI-Lab/MMLU-Pro.
+"""
 TABLE_INTRODUCTION = """
     """