What's the difference between this llava-hf and lmms-lab? Why the checkpoints of Llava-Next-Video is different?

by lalalandw - opened Oct 12

Oct 12

As said in title, why the LLava-next-videos-7b in llava-hf is pretrained by vicuna-7b-v1.5, but the same size in lmms-labs is released by Qwen? Why release different models?
Hope to get you response soon! Thank you！
Best wishes

RaushanTurganbay

Llava Hugging Face org Oct 14

These are the prev video models, one of which is https://huggingface.co/lmms-lab/LLaVA-NeXT-Video-7B/blob/main/config.json. They were based on vicuna. The new video models are not still converted to HF format

The general aim of out org is to convert llava models into HF compatible format so that it can be used by directly calling from_pretrained and support various generation tecniques. I can look into the new model series and convert them soon, thanks!

lalalandw

Oct 21

Thank you for your reply!

lalalandw changed discussion status to closed Oct 22

liyucheng

14 days ago

@RaushanTurganbay Hi, any plans to support the most up-to-date Llava-Video?

RaushanTurganbay

Llava Hugging Face org 14 days ago

@liyucheng yes, I will add those when I have bandwidth, some time this month I hope

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment