vihangd's picture
Update README.md
34b3b15 verified
|
raw
history blame
267 Bytes
metadata
license: apache-2.0

DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template