Image-to-Text
Transformers
Safetensors
Japanese
llava-jp
text-generation
vision
image-captioning
VQA
Inference Endpoints
File size: 135 Bytes
ad0c144
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:4f9d7f34ad88e7b37d1dd493817decda61720bfadb07d8ed48772649879fb576
size 4997886200