Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
robinsmits
/
Qwen1.5-7B-Dutch-Chat-Dpo
like
0
Text Generation
PEFT
TensorBoard
Safetensors
BramVanroy/ultra_feedback_dutch_cleaned
Dutch
trl
dpo
conversational
Generated from Trainer
qwen2
arxiv:
2309.16609
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
c25c91a
Qwen1.5-7B-Dutch-Chat-Dpo
/
README.md
Commit History
Adding Evaluation Results
c25c91a
verified
robinsmits
commited on
Mar 31
Update README.md
cca8d18
verified
robinsmits
commited on
Mar 30
Update README.md
e18c885
verified
robinsmits
commited on
Mar 30
Update README.md
fbc2ad9
verified
robinsmits
commited on
Mar 30
Update README.md
1c1ad28
verified
robinsmits
commited on
Mar 29
Update README.md
014a7af
verified
robinsmits
commited on
Mar 29
Update README.md
8acbaa9
verified
robinsmits
commited on
Mar 29
Update README.md
938e9b1
verified
robinsmits
commited on
Mar 29
End of training
b7e48de
verified
robinsmits
commited on
Mar 29
Upload tokenizer
e21c9b4
verified
robinsmits
commited on
Mar 29