need new models versions (revisions) gguf files

#25

by mohan007 - opened Jun 29

Discussion

mohan007

Jun 29

need new models versions (revisions = 2024-05-20) gguf files

vikhyatk

Owner Jun 29

It's currently not compatible with llama.cpp due to an update to the projection architecture, working on it.

mohan007

Jun 30

•

edited Jun 30

hi @vikhyatk in that case what is the best possible way to speed up inference for moondream2 , currently using huggingface and batch inference , on nvidia gpu 4090 (for reference)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment