Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
File size: 135 Bytes
a3f00a6
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:1da39fc84cf4f39dc1971d7ef00f4cf4d33f8ed28ea8c6b4add0e3615fa72db5
size 4980945440