Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Ray2333
/
GRM-llama3.2-3B-sftreg
like
1
Text Classification
Safetensors
hendrydong/preference_700K
llama
custom_code
arxiv:
2406.10216
License:
mit
Model card
Files
Files and versions
Community
Train
main
GRM-llama3.2-3B-sftreg
1 contributor
History:
6 commits
Ray2333
Update README.md
d55f23c
verified
16 days ago
.gitattributes
Safe
1.52 kB
initial commit
17 days ago
README.md
Safe
6.57 kB
Update README.md
16 days ago
config.json
Safe
1.2 kB
Update config.json
17 days ago
generation_config.json
Safe
184 Bytes
Upload LlamaForCausalLM
17 days ago
model-00001-of-00002.safetensors
Safe
4.97 GB
LFS
Upload LlamaForCausalLM
17 days ago
model-00002-of-00002.safetensors
Safe
1.47 GB
LFS
Upload LlamaForCausalLM
17 days ago
model.py
Safe
8.28 kB
Upload model.py
16 days ago
model.safetensors.index.json
Safe
21.2 kB
Upload LlamaForCausalLM
17 days ago
special_tokens_map.json
Safe
434 Bytes
Upload tokenizer
17 days ago
tokenizer.json
Safe
9.09 MB
Upload tokenizer
17 days ago
tokenizer_config.json
Safe
54.7 kB
Upload tokenizer
17 days ago