Minitron 8B Derivative
Collection
Derived from the Nemo minitron 8B prune.
•
7 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
This model was merged using the SLERP merge method.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: jeiku/fatgirlmagicv2
- model: jeiku/magicfatgirlv2
merge_method: slerp
base_model: jeiku/fatgirlmagicv2
parameters:
t:
- value: 0.5
dtype: bfloat16