burtenshaw
first refactored commit
9cc6120
|
raw
history blame
758 Bytes
## Dataset Schema for Project
### KTO Dataset Structure
The dataset should be organized into two splits: `train` and `test`.
Each split contains the following features:
| **Feature** | **Type** | **Description** |
|---------------|-----------|--------------------------------------------------------------------------------------|
| `prompt` | `string` | The input text for the model. This should be a natural language query or input. |
| `completion` | `string` | The output text generated by the model in response to the `prompt`. |
| `label` | `bool` | A binary value (`True` or `False`) indicating whether the `completion` is desirable. |