anon8231489123
commited on
Commit
•
ddf839b
1
Parent(s):
3e231a4
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
** Converted model for GPTQ from https://huggingface.co/lmsys/vicuna-13b-delta-v0 **
|
2 |
+
|
3 |
+
GPTQ conversion command (on CUDA branch):
|
4 |
+
CUDA_VISIBLE_DEVICES=0 python llama.py ../lmsys/vicuna-13b-v0 c4 --wbits 4 --true-sequential --groupsize 128 --save vicuna-13b-4bit-128g.pt
|
5 |
+
|
6 |
+
Added 1 token to the tokenizer model:
|
7 |
+
python llama-tools/add_tokens.py lmsys/vicuna-13b-v0/tokenizer.model /content/tokenizer.model llama-tools/test_list.txt
|
8 |
+
|
9 |
+
Use of Oobabooga with these tags:
|
10 |
+
--wbits 4
|
11 |
+
--groupsize 128
|
12 |
+
|
13 |
+
Enjoy
|