PEFT
Safetensors
Transformers
English
text-generation-inference
unsloth
mistral
trl
Not-For-All-Audiences
Edit model card

LoRA trained in 4-bit with 8k context using alpindale/Mistral-7B-v0.2-hf as the base model for 1 epoch.

Dataset used is a modified version of KaraKaraWitch/PIPPA-ShareGPT-formatted.

Prompt format: ChatML

Uploaded model

  • Developed by: mpasila
  • License: apache-2.0
  • Finetuned from model : unsloth/mistral-7b-v0.2-bnb-4bit

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
1
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for mpasila/PIPPA-Named-LoRA-7B

Adapter
(3)
this model

Datasets used to train mpasila/PIPPA-Named-LoRA-7B