Multi-image

by pbarker - opened Sep 26

Discussion

pbarker

Sep 26

Can this support multi-image?

chrisc36

Ai2 org Sep 26

•

edited Sep 26

The model was not trained on any multi-image data, and the preprocessor in this codebase does not currently support interleaved image/text messages.

The model's design does, in principle, allow it to handle multiple images as input by concatenating them into a very long input sequence, so it is still possible to try multi-image input (although it would require tweaking the preprocessor). However we have not experimented with this ourselves.

Florianeuler

Sep 26

Would be nice to have such a feature (especially for a multimodal RAG scenario...)

chrisc36 changed discussion status to closed Oct 4

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment