How do I convert v0 to v1 for the new llama.cpp?

by jimaldon - opened Apr 5, 2023

Discussion

jimaldon

Apr 5, 2023

The current v0 is incompatible with llama.cpp

Pi3141

Owner Apr 5, 2023

Oops I completely forgot about this one. I'll do it later today.

You'll need the huggingface converted pytorch files. And then merge them into a single file. There should be a script for hf to pth. Then convert the pth file into a ggml f32 file (option 0). Then quantize it to q4_1 (option 3)

yyloc

Apr 6, 2023

Any updates on the q4_1 model?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment