Magic in here

#12

by BlueNipples - opened Sep 19

Sep 19

Some occasional coherency issues, but there's sparkle within, signs of improved intelligence.

Wonder if you could do something similar with solar 11b, mistral 7b, or nemo, given I think these punch a little above llama-3's weight.

MaziyarPanahi

Arcee AI org Sep 19

maybe the qwen2.5-14b?

AaronFeng753

Sep 22

maybe the qwen2.5-14b?

That would be fantastic, qwen2.5 14B is great for 24GB cards since it can run at Q8 with large context length, maybe you can do a Qwen2.5 72B distilled into qwen2.5 14B?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment