Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
File size: 135 Bytes
cf6a0e7
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:85aee1073c597a2c5b0d96189913c0b83a9de5f776174ccf4d6b47a4aea39eaf
size 4987202208