Amartya77
/

RLHF_PPOppo_model

Reinforcement Learning

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

Edit model card

README.md exists but content is empty. Use the Edit model card button to edit it.

Downloads last month: 3

Safetensors

Model size

582M params

Tensor type

F32

·

Video Preview

Reinforcement Learning

loading