Model is instruction-finetuned using Open-Platypus dataset: https://huggingface.co/datasets/garage-bAInd/Open-Platypus
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 53.64 |
ARC (25-shot) | 62.37 |
HellaSwag (10-shot) | 85.08 |
MMLU (5-shot) | 63.79 |
TruthfulQA (0-shot) | 47.33 |
Winogrande (5-shot) | 77.66 |
GSM8K (5-shot) | 17.29 |
DROP (3-shot) | 21.93 |
Support My Work
Building LLMs takes time and resources; if you find my work interesting, your support would be epic!
- Downloads last month
- 770
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.