Edit model card

Model Card for llava-polyglot-ko-1.3b-hf

Model Description

llava-polyglot-ko-1.3b-hf is a model based on polyglot-ko-13b. We use llava for the vision question answering. You can see ‘demo.py’ and ‘llava_gpt_neox.py’. Currently, the model has been trained on small vision question answer dataset (approx, 10k) with 1.3b (small) model.

TODO

  • Multi-turn chat based on the image
  • Larger LLM
  • More pretraining on for the vision-text adapter

References

Downloads last month
19
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using LearnItAnyway/llava-polyglot-ko-1.3b-hf 1