Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Korean Reward Modeling
updated
May 29
Korean Datasets, Reward Models for RLHF
Upvote
3
heegyu/KoSafeGuard-8b-0503
Text Generation
•
Updated
16 days ago
•
107
•
5
heegyu/ko-reward-model-helpful-1.3b-v0.2
Text Classification
•
Updated
Jan 10
•
15
heegyu/ko-reward-model-safety-1.3b-v0.2
Text Classification
•
Updated
Jan 13
•
15
•
5
heegyu/ko-reward-model-helpful-roberta-large-v0.1
Text Classification
•
Updated
Dec 31, 2023
•
8
•
1
heegyu/ko-reward-model-safety-roberta-large-v0.1
Text Classification
•
Updated
Dec 31, 2023
•
5
heegyu/ko-reward-model-1.3b-v0.1
Text Classification
•
Updated
Dec 7, 2023
•
7
•
1
heegyu/ko-reward-model-1.3b-v0
Text Classification
•
Updated
Dec 1, 2023
•
39
•
1
heegyu/ko-ultrafeedback-binarized-1.3b
Text Classification
•
Updated
Nov 27, 2023
•
5
•
2
maywell/ko_Ultrafeedback_binarized
Viewer
•
Updated
Nov 9, 2023
•
62k
•
56
•
28
maywell/ko_hh-rlhf-20k_filtered
Viewer
•
Updated
Nov 4, 2023
•
19.4k
•
41
•
4
heegyu/hh-rlhf-ko
Viewer
•
Updated
Dec 24, 2023
•
169k
•
85
•
3
heegyu/PKU-SafeRLHF-ko
Viewer
•
Updated
Dec 31, 2023
•
320k
•
48
•
4
heegyu/webgpt_comparisons_ko
Viewer
•
Updated
Dec 5, 2023
•
19.6k
•
17
•
2
SJ-Donald/orca-dpo-pairs-ko
Viewer
•
Updated
Jan 24
•
36k
•
81
•
7
Upvote
3
Share collection
View history
Collection guide
Browse collections