Model Card for llava-polyglot-ko-1.3b-hf
Model Description
llava-polyglot-ko-1.3b-hf
is a model based on polyglot-ko-13b.
We use llava for the vision question answering.
You can see ‘demo.py’ and ‘llava_gpt_neox.py’.
Currently, the model has been trained on small vision question answer dataset (approx, 10k) with 1.3b (small) model.
TODO
- Multi-turn chat based on the image
- Larger LLM
- More pretraining on for the vision-text adapter
References
- Downloads last month
- 19
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.