Jump to content
Symbolfoto: Das AIT ist Österreichs größte außeruniversitäre Forschungseinrichtung

Multi-Modal Artificial Intelligence

The definition of intelligence is diverse, and so is the definition of artificial intelligence. Yet most mainstream Artificial Intelligence (AI) solutions simplify a task to a Machine Learning problem using a single sensory modality such as images or videos. By this, relevant information such as the audio layer of videos or the text of document images is neglected. Here at AIT, we focus on the complex task of harnessing and combining information from multiple modalities to develop higher-level cognitive AI systems. These systems learn from the correlation of e.g. audio and visual events and provide advanced models, e.g. for security-related applications. By leveraging cross-modal correlations, multimodal AI further reduces the need for manual annotations. We use multimodal AI solutions in many of our applications, such as predictive maintenance or public safety.