Add ChatML prompt format details
Browse files
README.md
CHANGED
@@ -30,6 +30,30 @@ And then a DPO finetune using:
|
|
30 |
- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
|
31 |
- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
|
32 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
# Evaluations
|
34 |
Evaluations done using mlabonne's usefull [Colab notebook llm-autoeval](https://github.com/mlabonne/llm-autoeval).
|
35 |
Also check out the alternative leaderboard at [Yet_Another_LLM_Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard)
|
|
|
30 |
- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
|
31 |
- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
|
32 |
|
33 |
+
# Prompt Format
|
34 |
+
|
35 |
+
Phi-2 Orange uses ChatML as the prompt format, with or without the system instruction.
|
36 |
+
|
37 |
+
To prompt with a system instruction (use whatever system prompt you like):
|
38 |
+
|
39 |
+
```
|
40 |
+
<|im_start|>system
|
41 |
+
You are a helpful assistant for Python which outputs in Markdown format.<|im_end|>
|
42 |
+
<|im_start|>user
|
43 |
+
Write a function to calculate the Fibonacci sequence<|im_end|>
|
44 |
+
<|im_start|>assistant
|
45 |
+
|
46 |
+
```
|
47 |
+
|
48 |
+
You can also omit the system prompt if you wish:
|
49 |
+
|
50 |
+
```
|
51 |
+
<|im_start|>user
|
52 |
+
Why is the sky blue?<|im_end|>
|
53 |
+
<|im_start|>assistant
|
54 |
+
|
55 |
+
```
|
56 |
+
|
57 |
# Evaluations
|
58 |
Evaluations done using mlabonne's usefull [Colab notebook llm-autoeval](https://github.com/mlabonne/llm-autoeval).
|
59 |
Also check out the alternative leaderboard at [Yet_Another_LLM_Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard)
|