Difference in code example between here and github

#1
by fabriceyhc - opened

Which code example is recommended?

Your github repo code looks much different. The one here uses a "sentiment-analysis" pipeline, which is unexpected / unintuitive.

NCSOFT org
β€’
edited Aug 13

For the reward model, this code(huggingface) is recommended since github repo currently supports the generative model only.

Sign up or log in to comment