Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RLHF-And-Friends
/
Llama-3.2-3B-Instruct-DPO-Math
like
0
Follow
RLHF-And-Friends
3
Text Generation
Transformers
Safetensors
GGUF
English
llama
text-generation-inference
unsloth
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.2-3B-Instruct-DPO-Math
Commit History
(Trained with Unsloth)
eafdb58
verified
arqa39
commited on
21 days ago
(Trained with Unsloth)
f4c7faf
verified
arqa39
commited on
21 days ago
(Trained with Unsloth)
68b1682
verified
arqa39
commited on
22 days ago
(Trained with Unsloth)
81a2727
verified
arqa39
commited on
22 days ago
(Trained with Unsloth)
19dc234
verified
arqa39
commited on
22 days ago
Upload README.md with huggingface_hub
afd99d3
verified
arqa39
commited on
22 days ago
initial commit
6143d92
verified
arqa39
commited on
22 days ago