mrzjy
/

NovelWriting-Outline-PRM-Qwen2.5-0.5B-Reward

Token Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mrzjy commited on 3 days ago

Commit

129fd3f

·

verified ·

1 Parent(s): e20f44f

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -15,4 +15,28 @@ tags:
 # PRM for Simplistic Novel Outline Generation
-This is a small project driven by personal interest, focused on developing a Process-Level Reward Model (PRM) for a specific task: generating outlines for novels. The aim is to explore how PRMs can provide quality signals for the process of structured outline creation.

 # PRM for Simplistic Novel Outline Generation
+This is a small project driven by personal interest, focused on developing a Process-Level Reward Model (PRM) for a specific task: generating outlines for novels.
+The aim is to explore how PRMs can provide quality signals for the process of structured outline creation.
+## Task Definition
+In practice, creating a novel outline typically involves a far more complex reflective process.
+However, for the purposes of this experiment, the task is simplified as follows:
+- Given a `story idea` and `character designs`, generate `outlines` for the first `n` chapters (`n` can range from 1 to 10, as for the construction of the training data).
+Below is a system prompt template used for training data construction:
+- English
+```
+```
+- Chinese
+```
+```