Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
microsoft
/
Phi-3.5-vision-instruct
like
553
Follow
Microsoft
4,766
Image-Text-to-Text
Transformers
Safetensors
multilingual
phi3_v
text-generation
nlp
code
vision
conversational
custom_code
arxiv:
2404.14219
License:
mit
Model card
Files
Files and versions
Community
32
Train
Deploy
Use this model
Adding audio support
#17
by
Barshan
- opened
Aug 30
Discussion
Barshan
Aug 30
will phi 3.5 support audio for video pationing and video reasoning?
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment