How to get running using fastchat on a m1 mac?

by kkostecky - opened Apr 20, 2023

Apr 20, 2023

Hi there, can someone give me directions on how to get a ggml model like this running using fastchat on an m1 mac. I have the regular vicuna 7 and 13B models running, but these are not pytorch files. Thanks!

eachadea

Owner Apr 20, 2023

Fastchat doesn't support ggml as far as I know. You're gonna have to use either oobabooga or llama.cpp.

Also, since you're on an M1, make sure to get the q4_2 models. They're great on apple silicon.

kkostecky

Apr 20, 2023

Thanks. Yeah, I already have it running on llama.cpp and using the q4_2 model.

OK, fair enough regarding fastchat. Thank you!

eachadea

Owner Apr 20, 2023

This might be of interest

https://github.com/oobabooga/text-generation-webui/wiki/llama.cpp-models

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment