a few interesting models

by KnutJaegersberg - opened Feb 20

Discussion

KnutJaegersberg

Feb 20

would you consider making 2 bit and 1.5 bit quants of:

https://huggingface.co/deepnight-research/Saily_220B

https://huggingface.co/quantumaikr/falcon-180B-WizardLM_Orca

KnutJaegersberg

Feb 20

I'm doing sally 100b now, but that's pretty much the last one I'm gonna do.

KnutJaegersberg

Feb 20

here is another one, please make a 1.5 bit version

https://huggingface.co/Wtzwho/Prometh-222B

HR1777

Feb 20

@dranger003 , please make the GGUF version of this model. It seems so promising. https://huggingface.co/Wtzwho/Prometh-222B
As its a 222B model, please share the iq1_s.gguf of it too.

sayhan

Feb 25

An iMat quantization on Prometh-222B would be fantastic. As a noob (this would be my first importance matrix quantization), I gave it a shot on a RunPod pod, but I kept running into the error "ggml_new_object: not enough space in the context's memory pool," even though my pod had 400GB of VRAM and 880GB of RAM. Maybe @dranger003 can pull it off.

dranger003

Owner Feb 26

A 222B model, really? Sounds like I need to get a new mortgage... let me see what I can do, no promise.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment