Objetivo In recent years, there has been a marked increase in communication technologies and computer interfaces that operate within the audio-visual speech domain, (e.g. video-telephony, synthesised avatars, etc). Faithful synchrony between the visual and acoustic speech elements of such technologies is of great importance in ensuring that they are perceived by end-users as operating at high and optimal quality levels. The effect of intermodal asynchrony on user-perceived quality is typically assessed using subjective evaluation techniques. A system for automatically assessing asynchrony levels, and predicting quality degradation on that basis, would therefore be both desirable and useful, and will have direct application to techniques for automatic synchrony adjustment.The proposed project will examine audio-visual speech as both spoken naturally by humans and as artificially synthesised by machines, and will employ subjective assessment techniques and machine learning in a combined iterative semi-automatic strategy for producing a Quality Prediction Model. Different levels of intermodal asynchrony will first be assessed by human subjects, who will be required to score the effect of the asynchrony levels on perceived speech quality using standardisedtechniques that will be modified for use with multimodal speech. Asynchrony patterns and their corresponding subjective assessment scores will be automatically learned by machines, resulting in an initial Quality Prediction Model. The initial model will be tested using data that will be simultaneously assessed by humans, using the subjective assessment techniques, above. Theoutput from the prediction model will be directly compared with the subjective scores, providing an initial evaluation of the model's performance. The model will be adjusted on this basis, and re-trained using new data. The process of re-train, re-test, re-score, will be repeated iteratively, leading to a more robust quality prediction model. Ámbito científico engineering and technologyelectrical engineering, electronic engineering, information engineeringinformation engineeringtelecommunicationstelecommunications networkshumanitieslanguages and literaturelinguisticsphoneticsnatural sciencescomputer and information sciencesartificial intelligencemachine learning Programa(s) FP7-PEOPLE - Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013) Tema(s) FP7-PEOPLE-2010-IEF - Marie-Curie Action: "Intra-European fellowships for career development" Convocatoria de propuestas FP7-PEOPLE-2010-IEF Consulte otros proyectos de esta convocatoria Régimen de financiación MC-IEF - Intra-European Fellowships (IEF) Coordinador TECHNISCHE UNIVERSITAT BERLIN Aportación de la UE € 155 542,40 Dirección STRASSE DES 17 JUNI 135 10623 Berlin Alemania Ver en el mapa Región Berlin Berlin Berlin Tipo de actividad Higher or Secondary Education Establishments Contacto administrativo Simone Ludwig (Ms.) Enlaces Contactar con la organización Opens in new window Sitio web Opens in new window Coste total Sin datos