OpenRLHF

community

https://github.com/OpenRLHF

AI & ML interests

None defined yet.

models 10

OpenRLHF/Llama-3-8b-rm-mixture

Updated about 11 hours ago • 6.88k

OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt

Updated about 11 hours ago • 4 • 1

OpenRLHF/Llama-3-8b-rm-700k

Updated about 11 hours ago • 622

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

Updated Oct 30 • 219 • 1

OpenRLHF/Llama-3-8b-iter-dpo-179k

Text Generation • Updated Jul 28 • 13

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • Updated Jun 24 • 336 • 2

OpenRLHF/Llama-3-8b-sft-mixture

Text Generation • Updated Jun 14 • 24.9k

OpenRLHF/Llama-2-7b-sft-model-ocra-500k

Text Generation • Updated Jun 9 • 541

OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt

Updated Jan 24 • 4

OpenRLHF/Llama-2-13b-sft-model-ocra-500k

Text Generation • Updated Jan 5 • 64 • 1

datasets 3

OpenRLHF/preference_700K

Viewer • Updated Jul 13 • 700k • 64

OpenRLHF/prompt-collection-v0.1

Viewer • Updated Jun 14 • 179k • 1.71k • 2

OpenRLHF/preference_dataset_mixture2_and_safe_pku

Viewer • Updated Jun 14 • 555k • 517 • 2