Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
Content archived on 2024-06-16

Audiovisual to Articulatory Speech Inversion

Project description


FET - Open

ASPI is concerned with the recovery of vocal tract shape dynamics from an acoustical speech signal supplemented by image analysis of a speaker’s face.  It is (i) developing inversion methods with emphasis to audiovisual to articulatory inversion methods and the investigation of additional constraints and optimization methods to deduce the under-determined nature of inversion, and (ii) constructing a multimodal articulatory database based on ultrasound, MRI, and facial motion capture.

ASPI may lead to a much needed breakthrough in our understanding of speech and our approach to speech research, given the focus on multimodal data collection, the activities related to publicizing data collection protocols and technical specifications of data collection equipment as well the activities planned to exploit the data.

Audiovisual-to-articulatory inversion consists in recovering the vocal tract shape (from vocal folds to lips) dynamics from the acoustical speech signal, supplemented by image analysis of speaker's face. Being able to recover this information automatically would be a major break-through in speech research and technology, as a vocal tract representation of a speech signal would be both beneficial from a theoretical point of view and practically useful in many speech processing applications (language learnin, automatic speech processing, speech coding, speech therapy, film industry...). The design of audiovisual-to-articulatory inversion involves two kinds of interdependent task. The first is the development of inversion methods that successfully answer the main acknowledged difficulties (non-unicity of inverse solution, lack of phonetic relevancy of inverse solutions, impossibility of using standard spectral data), and the second is the construction of an articulatory database that comprises dynamic images of the vocal tract together with the speech signal uttered, and that for several male and female speakers. For the inversion itself the main objectives are: 1.Development of inversion methods, 2.Investigation of additional constraints to reduce the under-determination of the inversion, 3.Evaluation of the inversion methods on articulatory data. For the construction of the articulatory database: 4.Design and acquisition of articulatory data that enables both the development of articulatory models and the assessment of inversion methods, 5.Design of a low cost acquisition technology based on ultrasound and facial motion capture, 6.Exploitation of existing databases (mainly X-ray images previously acquired). The consortium provides an outstanding blend of competences, mixing groups with theoretical background in speech production, acoustic-to-articulatory inversion, computer vision and medical imaging.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: https://op.europa.eu/en/web/eu-vocabularies/euroscivoc.

You need to log in or register to use this function

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

FP6-2002-IST-C
See other projects for this call

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

STREP - Specific Targeted Research Project

Coordinator

CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE
EU contribution
€ 612 112,40
Address
615 rue du jardin botanique
54600 Villers-l�Nancy
France

See on map

Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data

Participants (4)

My booklet 0 0