Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Haleshot
/
Mathmate-7B-DELLA-ORPO
like
0
Safetensors
argilla/distilabel-math-preference-dpo
llama
finetuned
orpo
math
preference-learning
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
Mathmate-7B-DELLA-ORPO
Commit History
Update README.md
81533a4
verified
Haleshot
commited on
Sep 29
Update README.md
80cab8f
verified
Haleshot
commited on
Sep 23
Create README.md
90127cd
verified
Haleshot
commited on
Sep 23
Upload folder using huggingface_hub
f20490d
verified
Haleshot
commited on
Sep 8
initial commit
78d1acf
verified
Haleshot
commited on
Sep 8