TheBloke
/

wizardLM-7B-GGML

Model card Files Files and versions Community

TheBloke commited on Apr 27, 2023

Commit

efc44ea

•

1 Parent(s): b83783d

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -49,6 +49,26 @@ These new methods were released to llama.cpp on 26th April. You will need to pul
 Don't expect any third-party UIs/tools to support them yet.
 # Original model info
 Overview of Evol-Instruct

 Don't expect any third-party UIs/tools to support them yet.
+## How to run in `llama.cpp`
+I use the following command line; adjust for your tastes and needs:
+```
+./main -t 18 -m WizardLM-7B.GGML.q4_2.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
+Write a story about llamas
+### Response:"
+```
+Change `-t 18` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.
+If you want to have a chat-style conversation, replace the `-p <PROMPT>` argument with `-i -ins`
+## How to run in `text-generation-webui`
+Put the desired .bin file in a model directory with `ggml` (case sensitive) in its name.
+Further instructions here: [text-generation-webui/docs/llama.cpp-models.md](https://github.com/oobabooga/text-generation-webui/blob/main/docs/llama.cpp-models.md).
 # Original model info
 Overview of Evol-Instruct