Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sanduntg
/
llama_2_dpo_with_reward_1
like
0
Transformers
Safetensors
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama_2_dpo_with_reward_1
Commit History
Upload adapter_config.json
f4533be
verified
sanduntg
commited on
Mar 15
Upload tokenizer
46e26ac
verified
sanduntg
commited on
Mar 15
Upload model
d412a8a
verified
sanduntg
commited on
Mar 15
initial commit
2e7dae5
verified
sanduntg
commited on
Mar 15