nisten
/

qwenv2-7b-inst-imatrix-gguf

Inference Endpoints

Model card Files Files and versions Community

qwenv2-7b-inst-imatrix-gguf / qwen7bv2inst_iq4xs_embedding4xs_output8bit.gguf

Commit History

best speed/perplexity for mobile devices with int8 acceleration

9869461
verified

nisten commited on Jun 16