dranger003 commited on
Commit
40e3c2b
·
verified ·
1 Parent(s): 2ce05e8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -7,7 +7,7 @@ GGUF quants for https://huggingface.co/HuggingFaceH4/zephyr-7b-gemma-v0.1
7
 
8
  > Zephyr is a series of language models that are trained to act as helpful assistants. Zephyr 7B Gemma is the third model in the series, and is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) that was trained on on a mix of publicly available, synthetic datasets using Direct Preference Optimization (DPO). You can reproduce the training of this model via the recipe provided in the [Alignment Handbook](https://github.com/huggingface/alignment-handbook).
9
 
10
- There are few things to consider when using this model:
11
  * Special tokens `<|im_start|>` and `<|im_end|>` are not properly mapped as overrides of `<start_of_turn>` and `<end_of_turn>` (issue in the GGUF)
12
  * Repeat penalty must `1.0` (i.e. disabled) just like with the base model
13
  * The model was not trained with the system instructions (i.e. don't add the `system` instructions part of the chatml template)
 
7
 
8
  > Zephyr is a series of language models that are trained to act as helpful assistants. Zephyr 7B Gemma is the third model in the series, and is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) that was trained on on a mix of publicly available, synthetic datasets using Direct Preference Optimization (DPO). You can reproduce the training of this model via the recipe provided in the [Alignment Handbook](https://github.com/huggingface/alignment-handbook).
9
 
10
+ There are few things to consider when using this model with llama.cpp:
11
  * Special tokens `<|im_start|>` and `<|im_end|>` are not properly mapped as overrides of `<start_of_turn>` and `<end_of_turn>` (issue in the GGUF)
12
  * Repeat penalty must `1.0` (i.e. disabled) just like with the base model
13
  * The model was not trained with the system instructions (i.e. don't add the `system` instructions part of the chatml template)