Edit model card

Uploaded model

  • Developed by: rdli
  • License: apache-2.0
  • Finetuned from model : unsloth/Phi-3-mini-4k-instruct-bnb-4bit

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
2
Safetensors
Model size
2.07B params
Tensor type
F32
FP16
U8
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for rdli/rdl-k8s-v3-4bit_incremental_dpo

Quantized
this model