Multimodal Learning

Multimodal Learning is a method of analysing multimodal information such as videos, image and text pairs, etc. The main goal is to build computational models to jointly extract, interpret information from all available modalities.