Skip to main content
Go to the home page of the European Commission (opens in new window)
English en
CORDIS - EU research results
CORDIS
Content archived on 2024-04-30

Voice variability in speaker verification

Objective

The main aim of VeriVox is to improve the reliability of automatic speaker verification (ASV), by developing novel, phonetically-informed methods for coping with the variation in a speaker's voice. Widespread deployment of ASV is hindered by unacceptable performance in real applications - in particular by 'false rejection' rates for genuine claims which are too high to be tolerated by customer-oriented end-users such as banks. Variation in the way a person speaks contributes significantly to this problem. Advances in signal processing and statistical methods are likely to produce gradual improvements, but VeriVox will exploit the phonetically-structured nature of much within-speaker variation. Sub-aims include the development of methods for eliciting such variation in a controlled way, and the analysis of its acoustic consequences in order to provide a better understanding of the structure of the variation.

The main approach used will be "Structured Training". This will be a procedure for obtaining training data from each new speaker in a way structured to elicit different manners of speaking, so that the system becomes familiar with the variation in that person's voice likely to be encountered. Instead of merely repeating the 'passwords' (which may be short phrases), the speaker will be asked, and where necessary induced, to vary in the way he or she speaks during training. Variation will include loudness and rate, low level psychological stress, and (de-)nasalisation. Acoustic analysis will determine the success of elicitation of rate and loudness variation. As well as this 'Structured Training', training data will also be collected in the usual way to act as a control. The ASV system will learn two alternative models for each speaker, one from the structured training data and one from the training data. Speakers will make identity claims under simulated real-life conditions which will induce variation, and the performance of the two models will be compared. A population of 50 speakers of Swedish will be used, and these will make both 'genuine' and 'imposter' identity claims.

The "Structured Training" strategy is predicted to bring a 25% reduction in false rejection errors without an increase in the false acceptance rate. That is, if we suppose a 'baseline' performance of the system, using the normally trained model, of 5% false rejections and 5% false acceptances, then a reduction in false rejections to 3.75% (or lower) will be achieved without false acceptances rising from 5%. Other deliverables will be a database of utterances produced with known types of speaking variation, and acoustic data on those variations.

The work described here for the first phase of VeriVox will form the basis for a thorough development of strategies for reducing the problem of speaker variation in the second (main) phase, in which English, German, and French will be added, 'structured training' further refined, and an additional strategy of 'guided elicitation' introduced, as described in the project proposal.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

ACM - Preparatory, accompanying and support measures

Coordinator

ADAPTIVE MANUFACTURING SYSTEMS
EU contribution
No data
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0