Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Lichang-Chen
/
ODIN-ppo-L230-best
like
0
Text Generation
Transformers
PyTorch
English
llama
ODIN
RLHF
PPO
text-generation-inference
Inference Endpoints
arxiv:
2402.07319
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
ODIN-ppo-L230-best
Commit History
Update README.md
38eac53
verified
Lichang-Chen
commited on
Feb 14
Upload folder using huggingface_hub
52c1c2f
verified
Lichang-Chen
commited on
Feb 14
Create README.md
f6f67dd
verified
Lichang-Chen
commited on
Feb 14
initial commit
64fe8ba
verified
Lichang-Chen
commited on
Feb 14