This project has developed novel methods of analysing and describing video content based on a combination of computer vision techniques, human input and machine learning approaches. These descriptions will allow Creative Industries as well as people using their services to access, use and find audiovisual information in novel ways with better metadata. They will be able to locate particular segments in video rapidly and accurately on the basis of searching and browsing in text corpora which have been compiled from audiovisual data aligned with verbal descriptions. Moreover, the intermodal translation from images and sounds into words will attract new users, such as the deaf, hard-of-hearing, blind, and partially-sighted audiences who would else be excluded from the visual or auditory content.
The MeMAD consortium has focused especially in TV broadcasting and in on-demand media services. Four main project objectives were:
Objective O1: Develop novel methods and tools for digital storytelling
Objective O2: Deliver methods and tools to expand the size of media audiences
Objective O3: Develop an improved scientific understanding of multimodal and multilingual media content analysis, linking and consumption
Objective O4: Deliver object models and formal languages, distribution protocols and display tools for enriched audiovisual data
The results of MeMAD were well aligned to the action ICT-20-2017, developing tools for smart digital content for creative industries in the European broadcasting domain. The research results were world class as proved by our excellent success, first in various scientific benchmarking challenges, and then by the interesting results in novel real-world tasks. In addition to publishing scientific articles, sharing the software and the results in public, we moved the research field forward. By disseminating the results directly to various European broadcasters and their service suppliers, we worked to maximize our impact in the domain of production and distribution of audiovisual content.