GGUF Mistral-Nemo-2407-Instruct OQ8_0.EF32 IQuants
Collection
Custom GGUF quants of Mistral-Nemo-2407-Instruct, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. ๐ง ๐ฅ๐
โข
1 item
โข
Updated