The definition of intelligence is diverse, and so is the definition of artificial intelligence. Yet most mainstream Artificial Intelligence (AI) solutions simplify a task to a Machine Learning problem using a single sensory modality such as images or videos. By this, relevant information such as the audio layer of videos or the text of document images is neglected. Here at AIT, we focus on the complex task of harnessing and combining information from multiple modalities to develop higher-level cognitive AI systems. These systems learn from the correlation of e.g. audio and visual events and provide advanced models, e.g. for security-related applications. By leveraging cross-modal correlations, multimodal AI further reduces the need for manual annotations. We use multimodal AI solutions in many of our applications, such as predictive maintenance or public safety.
Kontakt Formular
Dr. Alexander Schindler
Thematic Coordinator / Multimodal Analytics- +43 50550 2902
- +43 50550 4150
- alexander.schindler(at)ait.ac.at