MediaTek-Research
/

Breeze-7B-Instruct-v0_1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

YC-Chen commited on Jan 14

Commit

6a281a4

•

1 Parent(s): 21032c4

Update README.md

Files changed (1) hide show

README.md +16 -2

README.md CHANGED Viewed

@@ -185,9 +185,9 @@ model = AutoModelForCausalLM.from_pretrained(
 )
 ```
-The structure of the query template follows that of Mistral-7B-Instruct, as shown below.
 ```txt
-<s> SYS_PROMPT   [INST] QUERY1 [/INST] RESPONSE1 [INST] QUERY2 [/INST]
 ```
 where `SYS_PROMPT`, `QUERY1`, `RESPONSE1`, and `QUERY2` can be provided by the user.
@@ -196,6 +196,20 @@ The suggested default `SYS_PROMPT` is
 You are a helpful AI assistant built by MediaTek Research. The user you are helping speaks Traditional Chinese and comes from Taiwan.
 ```
 ## Citation
 ```

 )
 ```
+The structure of the query is
 ```txt
+<s>SYS_PROMPT   [INST] QUERY1 [/INST] RESPONSE1 [INST] QUERY2 [/INST]
 ```
 where `SYS_PROMPT`, `QUERY1`, `RESPONSE1`, and `QUERY2` can be provided by the user.
 You are a helpful AI assistant built by MediaTek Research. The user you are helping speaks Traditional Chinese and comes from Taiwan.
 ```
+We also integrate `chat_template` into [tokenizer_config.json](tokenizer_config.json), so you can `apply_chat_template` to get the prompt.
+```python
+>>> from transformers import AutoTokenizer
+>>> tokenizer = AutoTokenizer.from_pretrained("MediaTek-Research/Breeze-7B-Instruct-v0.1")
+>>> chat = [
+...   {"role": "user", "content": "Hello, how are you?"},
+...   {"role": "assistant", "content": "I'm doing great. How can I help you today?"},
+...   {"role": "user", "content": "I'd like to show off how chat templating works!"},
+... ]
+>>> tokenizer.apply_chat_template(chat, tokenize=False)
+```
 ## Citation
 ```