Run this model with llama.cpp, get gibberish output

#12

by xiaojinchuan - opened Jun 8, 2023

Jun 8, 2023

I converted this model to ggml, and quantized it to 4bit using https://github.com/ggerganov/llama.cpp/blob/master/convert.py
and run the quantized model with llama-cpp-python, get gibberish output as below.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment