vihangd's picture
Update README.md
00a0990 verified
|
raw
history blame
187 Bytes
---
license: apache-2.0
---
An experimental DPO finetune of SmartTinyLlama with Alpaca-QLoRA
Datasets
Trained on bagel style dpo datasets
Prompt Template
Uses chatml style prompt template