Performance LLMs - Fine tuned
Collection
27 items
•
Updated
•
3
To understand the pun intended, lookup my 3b Deacon model.
Prompt Example:
### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### Instruction:
How do you fine tune a large language model?
### Response:
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 61.28 |
AI2 Reasoning Challenge (25-Shot) | 60.75 |
HellaSwag (10-Shot) | 81.74 |
MMLU (5-Shot) | 60.70 |
TruthfulQA (0-shot) | 58.49 |
Winogrande (5-shot) | 76.80 |
GSM8k (5-shot) | 29.19 |