Is it using ggml to compute?

#30
by CHNtentes - opened

Or just dequant from gguf then use transformers

We're just using it as a storage format so we dequant on the fly and use the code in ComfyUI (which is why it's the reference format not the diffusers one). Using the ggml.dll kernes would be nice though.

Sign up or log in to comment