Edit model card

mistral-nemo-gutenberg-12B-v4

TheDrummer/Rocinante-12B-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 19.56
IFEval (0-Shot) 23.79
BBH (3-Shot) 31.97
MATH Lvl 5 (4-Shot) 10.95
GPQA (0-shot) 8.84
MuSR (0-shot) 13.20
MMLU-PRO (5-shot) 28.62
Downloads last month
229
Safetensors
Model size
12.2B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for nbeerbower/mistral-nemo-gutenberg-12B-v4

Finetuned
this model
Merges
10 models
Quantizations
6 models

Dataset used to train nbeerbower/mistral-nemo-gutenberg-12B-v4

Collection including nbeerbower/mistral-nemo-gutenberg-12B-v4

Evaluation results