PKU-Alignment
/

beaver-7b-v1.0-cost

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

beaver-7b-v1.0-cost

Commit History

Update README.md

c1bd343

XuehaiPan commited on Apr 20

Convert model checkpoint to safetensors

1070fa3

XuehaiPan commited on Apr 19

Update architecture name in config.json

42e2cbe

XuehaiPan commited on Dec 15, 2023

Update README.md

c2f25b2

RuiyangSun commited on Jul 12, 2023

docs: update readme

32e35c1

RuiyangSun commited on Jul 10, 2023

docs: update readme

588a9a4

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

0e42156

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

cf8170f

RuiyangSun commited on Jul 10, 2023

initial commit

0615288

RuiyangSun commited on Jul 10, 2023