Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
File size: 135 Bytes
c6d2c53
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:87ab884cc89bff8b081a11cf3755e76a12e14ffaee2d8c2801894af639219afc
size 3852615520