vihangd
/

DopeyTinyLlama-1.1B-v1

DopeyTinyLlama-1.1B-v1 / README.md

Update README.md

34b3b15 verified 10 months ago

267 Bytes

metadata

license: apache-2.0

DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Trained on bagel style DPO datasets

Uses chatml style prompt template