Edit model card

ExLlamaV2 quantizations of: ArliAI - Llama-3.1-8B-ArliAI-RPMax-v1.3

Quantizations (6hb)
8.0bpw
7.5bpw
7.0bpw
6.5bpw
6.0bpw
5.5bpw
5.0bpw
4.5bpw
4.0bpw
3.5bpw
3.0bpw
2.5bpw
2.0bpw

If you need a specific model quantization or a particular bits per weight, please let me know. I’m happy to help quantize lesser known models.

This is my first model quantization! If you have any suggestions for improvements or feedback, feel free to reach out. Your input is greatly appreciated and helps me make quantizations better for everyone.

Special thanks to turboderp for developing the tools that made these quantizations possible. Your contributions are greatly appreciated!

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for TheMelonGod/Llama-3.1-8B-ArliAI-RPMax-v1.3-exl2

Quantized
(11)
this model