Performance LLMs - Fine tuned
Collection
27 items
•
Updated
•
3
Prompt Example:
### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### User:
How do you fine tune a large language model?
### Assistant:
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 52.35 |
AI2 Reasoning Challenge (25-Shot) | 44.71 |
HellaSwag (10-Shot) | 70.39 |
MMLU (5-Shot) | 52.79 |
TruthfulQA (0-shot) | 39.61 |
Winogrande (5-shot) | 65.27 |
GSM8k (5-shot) | 41.32 |