Llama-3.2-1B
Collection
8 items
•
Updated
•
1
This model is a fine-tuned version of unsloth/meta-llama-3.1-8b-instruct-bnb-4bit on the None dataset.
This model is finetuned on all successful episodes of the top 3 models from benchmark versoions 0.9 and 1.0. Approximatedly 920 episodes are in the dataset.
The dataset id is D20002
The following hyperparameters were used during training: