qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_iq4xs_embedding4xs_output8bit.gguf

Commit History

best speed/perplexity for mobile devices with int8 acceleration
9869461
verified

nisten commited on