Create a processor for multimodal tasks.