Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Ray2333
/
GRM-llama3.2-3B-sftreg
like
1
Text Classification
Safetensors
hendrydong/preference_700K
llama
custom_code
arxiv:
2406.10216
License:
mit
Model card
Files
Files and versions
Community
Train
main
GRM-llama3.2-3B-sftreg
/
model-00002-of-00002.safetensors
Commit History
Upload LlamaForCausalLM
705f9b7
verified
Ray2333
commited on
18 days ago