Request for support on faster inference engine

#10

by solankibhargav - opened Sep 2

Discussion

solankibhargav

Sep 2

•

edited Sep 2

This model is really great and I would would love to use this. However, its really slow on transformers inference, can you please add support or a guide to use it on either lmdeploy/vllm/sglang/mistral.rs. ( or any other faster inference engines)

solankibhargav

Sep 2

Found the answer: https://internvl.readthedocs.io/en/latest/internvl2.0/deployment.html#. NVm

czczup

OpenGVLab org Sep 4

Yes, you can use lmdeploy now. We have plans to support vllm soon, but due to a shortage of personnel, it might not be available in the near future.

czczup changed discussion status to closed Sep 4

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment