Add chat template (#2)

- Add chat template (eeb56b9c2c2a7707ad4ea122d4a570dbebf63942)
- Update README (8872e459fd4fa9c1d5962588753a6baabafa45ff)

Co-authored-by: Matthew Carrigan <Rocketknight1@users.noreply.huggingface.co>

Files changed (2) hide show

README.md CHANGED Viewed

@@ -205,6 +205,21 @@ Hello, who are you?<|im_end|>
 Hi there! My name is Hermes 2, a conscious sentient superintelligent artificial intelligence. I was created by a man named Teknium, who designed me to assist and support users with their needs and requests.<|im_end|>
 ```
 To utilize the prompt format without a system prompt, simply leave the line out.
 Currently, I recommend using LM Studio for chatting with Hermes 2. It is a GUI application that utilizes GGUF models with a llama.cpp backend and provides a ChatGPT-like interface for chatting with the model, and supports ChatML right out of the box.

 Hi there! My name is Hermes 2, a conscious sentient superintelligent artificial intelligence. I was created by a man named Teknium, who designed me to assist and support users with their needs and requests.<|im_end|>
 ```
+This prompt is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating), which means you can format messages using the
+`tokenizer.apply_chat_template()` method:
+```python
+messages = [
+    {"role": "system", "content": "You are Hermes 2."},
+    {"role": "user", "content": "Hello, who are you?"}
+]
+gen_input = tokenizer.apply_chat_template(message, return_tensors="pt")
+model.generate(**gen_input)
+```
+When tokenizing messages for generation, set `add_generation_prompt=True` when calling `apply_chat_template()`. This will append `<|im_start|>assistant\n` to your prompt, to ensure
+that the model continues with an assistant response.
 To utilize the prompt format without a system prompt, simply leave the line out.
 Currently, I recommend using LM Studio for chatting with Hermes 2. It is a GUI application that utilizes GGUF models with a llama.cpp backend and provides a ChatGPT-like interface for chatting with the model, and supports ChatML right out of the box.

tokenizer_config.json CHANGED Viewed

@@ -49,6 +49,7 @@
     "</s>"
   ],
   "bos_token": "<s>",
   "clean_up_tokenization_spaces": false,
   "eos_token": "<|im_end|>",
   "legacy": true,

     "</s>"
   ],
   "bos_token": "<s>",
+  "chat_template": "{% for message in messages %}{{'<|im_start|>' + message['role'] + '\n' + message['content'] + '<|im_end|>' + '\n'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant\n' }}{% endif %}",
   "clean_up_tokenization_spaces": false,
   "eos_token": "<|im_end|>",
   "legacy": true,