|
--- |
|
pipeline_tag: image-to-text |
|
inference: true |
|
license: apache-2.0 |
|
language: |
|
- en |
|
base_model: black-forest-labs/FLUX.1-dev |
|
library_name: transformers |
|
tags: |
|
- image-to-text |
|
- image-captioning |
|
- vison |
|
--- |
|
|
|
# MaxCushion Image-to-Text Model |
|
|
|
This model generates textual descriptions (captions) for input images. |
|
|
|
## Model Details |
|
|
|
- Model Name: colt12/maxcushion |
|
- Task: Image-to-Text |
|
- Type: Vision Encoder-Decoder |
|
|
|
## Usage |
|
|
|
This model can be used with the Hugging Face Inference API. Here's an example of how to use it: |