AIF Datasets (with distilabel)
Small to medium size datasets either: synthetically generated, labelled with AI Feedback (AIF), or both
Viewer • Updated • 1k • 93 • 6Note Contains the first 1K rows of `nvidia/HelpSteer` without the original annotations, and using GPT-4 generated annotations instead for the categories: correctness, coherence, complexity, verbosity, and helpfulness.
alvarobartt/HelpSteer-AIF-raw
Viewer • Updated • 1k • 50Note Contains the first 1K rows of `nvidia/HelpSteer` without the original annotations, and using GPT-4 generated annotations instead for the categories: correctness, coherence, complexity, verbosity, and helpfulness. This dataset differs from the one above, as it contains also the raw responses and some other additional columns.
alvarobartt/replacing-judges-with-juries-distilabel
Viewer • Updated • 100 • 90 • 3Note Dataset generated for the post at https://huggingface.co/blog/alvarobartt/replacing-judges-with-juries-distilabel