Edit model card

image/png

ExLlamaV2 BPW 6.0 quant of xxx777xxxASD/PrimaSumika-10.7B-128k (Fits in 12GB VRAM/42k context/4-bit cache)

Downloads last month
5
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Collection including xxx777xxxASD/PrimaSumika-10.7B-128k-bpw-6.0