Update README.md
Browse files
README.md
CHANGED
@@ -85,7 +85,7 @@ This approach ensures a balanced distribution of positive and negative labels.
|
|
85 |
We trained 2 models on the above dataset:
|
86 |
|
87 |
- NovelWriting-Outline-Qwen2.5-7B-Instruct: The SFT LLM, trained by [Llama-Factory](https://github.com/hiyouga/LLaMA-Factory).
|
88 |
-
- [NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward](https://huggingface.co/mrzjy/NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward): The PRM for outline generation task, trained by using TRL library [Doc](https://huggingface.co/docs/trl/prm_trainer).
|
89 |
|
90 |
## 4. Performance Evaluation
|
91 |
|
|
|
85 |
We trained 2 models on the above dataset:
|
86 |
|
87 |
- NovelWriting-Outline-Qwen2.5-7B-Instruct: The SFT LLM, trained by [Llama-Factory](https://github.com/hiyouga/LLaMA-Factory).
|
88 |
+
- [NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward](https://huggingface.co/mrzjy/NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward): The PRM for outline generation task, trained by using TRL library ([Refer to Doc](https://huggingface.co/docs/trl/prm_trainer)).
|
89 |
|
90 |
## 4. Performance Evaluation
|
91 |
|