--- license: apache-2.0 language: - zh - en base_model: - Qwen/Qwen2.5-0.5B-Instruct pipeline_tag: token-classification library_name: transformers tags: - novel-writing - PRM - outline --- # PRM for Simplistic Novel Outline Generation This is a small project driven by personal interest, focused on developing a Process-Level Reward Model (PRM) for a specific task: generating outlines for novels. The aim is to explore how PRMs can provide quality signals for the process of structured outline creation. ## Task Definition In practice, creating a novel outline typically involves a far more complex reflective process. However, for the purposes of this experiment, the task is simplified as follows: - Given a `story idea` and `character designs`, generate `outlines` for the first `n` chapters (`n` can range from 1 to 10, as for the construction of the training data). Below is a system prompt template used for training data construction: - English ``` ``` - Chinese ``` ```