mrzjy commited on
Commit
bd72bd4
Β·
verified Β·
1 Parent(s): 3a80cd9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -85,7 +85,7 @@ This approach ensures a balanced distribution of positive and negative labels.
85
  We trained 2 models on the above dataset:
86
 
87
  - NovelWriting-Outline-Qwen2.5-7B-Instruct: The SFT LLM, trained by [Llama-Factory](https://github.com/hiyouga/LLaMA-Factory).
88
- - [NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward](https://huggingface.co/mrzjy/NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward): The PRM for outline generation task, trained by using TRL library [Doc](https://huggingface.co/docs/trl/prm_trainer).
89
 
90
  ## 4. Performance Evaluation
91
 
 
85
  We trained 2 models on the above dataset:
86
 
87
  - NovelWriting-Outline-Qwen2.5-7B-Instruct: The SFT LLM, trained by [Llama-Factory](https://github.com/hiyouga/LLaMA-Factory).
88
+ - [NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward](https://huggingface.co/mrzjy/NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward): The PRM for outline generation task, trained by using TRL library ([Refer to Doc](https://huggingface.co/docs/trl/prm_trainer)).
89
 
90
  ## 4. Performance Evaluation
91