KlaudiaTH
Fixed README
aa451eb
|
raw
history blame
No virus
676 Bytes
---
title: Leaderboard
emoji: πŸ‘
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 4.19.2
app_file: app.py
pinned: false
license: unknown
---
This is the OpenGPT-X mutlilingual leaderboard source code repository.
The leaderboard aims to provied an overview of LLM performance over various languages.
The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele.
To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta.