argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 193 • 6
NickyNicky/neovalle_H4rmony_dpo_translated_English_to_Spanish Viewer • Updated May 17 • 2.02k • 43 • 4
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 43 • 4
Mitsuki-Sakamoto/hh-rlhf-reward-model-deberta-v3-large-v2-helpful-2-original_mix_50_random_seed_2 Viewer • Updated Jun 8 • 46.2k • 45 • 1
vwxyzjn/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated Jan 27 • 179k • 188 • 2
insub/imdb_prefix20_forDPO_gpt2-large-imdb-FT_siebert_sentiment-roberta-large-english Viewer • Updated Oct 22, 2023 • 50k • 59 • 2