Update README.md
Browse files
README.md
CHANGED
@@ -15,4 +15,28 @@ tags:
|
|
15 |
|
16 |
# PRM for Simplistic Novel Outline Generation
|
17 |
|
18 |
-
This is a small project driven by personal interest, focused on developing a Process-Level Reward Model (PRM) for a specific task: generating outlines for novels.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
# PRM for Simplistic Novel Outline Generation
|
17 |
|
18 |
+
This is a small project driven by personal interest, focused on developing a Process-Level Reward Model (PRM) for a specific task: generating outlines for novels.
|
19 |
+
|
20 |
+
The aim is to explore how PRMs can provide quality signals for the process of structured outline creation.
|
21 |
+
|
22 |
+
## Task Definition
|
23 |
+
|
24 |
+
In practice, creating a novel outline typically involves a far more complex reflective process.
|
25 |
+
|
26 |
+
However, for the purposes of this experiment, the task is simplified as follows:
|
27 |
+
|
28 |
+
- Given a `story idea` and `character designs`, generate `outlines` for the first `n` chapters (`n` can range from 1 to 10, as for the construction of the training data).
|
29 |
+
|
30 |
+
Below is a system prompt template used for training data construction:
|
31 |
+
|
32 |
+
- English
|
33 |
+
|
34 |
+
```
|
35 |
+
|
36 |
+
```
|
37 |
+
|
38 |
+
- Chinese
|
39 |
+
|
40 |
+
```
|
41 |
+
|
42 |
+
```
|