THUDM
/

LongReward-llama3.1-8b-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LongReward-llama3.1-8b-DPO / model-00003-of-00005.safetensors

Commit History

Upload folder using huggingface_hub

a74f280
verified

davidlvxin commited on Oct 22