Edit model card

MarcoroCapy-7B

This model is a DPO fine tune of mlabonne/Marcoro14-7B-slerp on argilla/distilabel-capybara-dpo-7k-binarized

Process

Realigned the chat template to ChatML
Completed 1 Epoch
5e-5 learning rate
Training time was about 4.5 hours on 1 H100
Cost was ~$20

GGUF

TODO

Evaluations

TODO

Downloads last month: 14

Safetensors

Model size

7.24B params

Tensor type

FP16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for macadeliccc/MarcoroCapy-7B

Quantizations

2 models