Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
minnmamin
/
reward_modeling_anthropic_hh
like
0
Text Classification
Transformers
Safetensors
gpt2
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
reward_modeling_anthropic_hh
Commit History
End of training
ccde6bb
verified
minnmamin
commited on
Aug 21
End of training
e3d4558
verified
minnmamin
commited on
Aug 21
initial commit
5e927a6
verified
minnmamin
commited on
Aug 21