What is the range of the output score from the model?

#12
by nshen7 - opened

Is it limited to [0,1] ?

RLHFlow org

The training labels are constrained in [0,1], but the output.score is not guaranteed to be in [0,1], since we do not apply sigmoid.

Haoxiang-Wang changed discussion status to closed

Thanks for the prompt reply! Just out of curiosity, do you have a rough idea on how the output score distribution look like?

Sign up or log in to comment