Skip to main content
Ir a la página de inicio de la Comisión Europea (se abrirá en una nueva ventana)
español es
CORDIS - Resultados de investigaciones de la UE
CORDIS
Contenido archivado el 2024-04-30

Voice variability in speaker verification

Objetivo

The main aim of VeriVox is to improve the reliability of automatic speaker verification (ASV), by developing novel, phonetically-informed methods for coping with the variation in a speaker's voice. Widespread deployment of ASV is hindered by unacceptable performance in real applications - in particular by 'false rejection' rates for genuine claims which are too high to be tolerated by customer-oriented end-users such as banks. Variation in the way a person speaks contributes significantly to this problem. Advances in signal processing and statistical methods are likely to produce gradual improvements, but VeriVox will exploit the phonetically-structured nature of much within-speaker variation. Sub-aims include the development of methods for eliciting such variation in a controlled way, and the analysis of its acoustic consequences in order to provide a better understanding of the structure of the variation.

The main approach used will be "Structured Training". This will be a procedure for obtaining training data from each new speaker in a way structured to elicit different manners of speaking, so that the system becomes familiar with the variation in that person's voice likely to be encountered. Instead of merely repeating the 'passwords' (which may be short phrases), the speaker will be asked, and where necessary induced, to vary in the way he or she speaks during training. Variation will include loudness and rate, low level psychological stress, and (de-)nasalisation. Acoustic analysis will determine the success of elicitation of rate and loudness variation. As well as this 'Structured Training', training data will also be collected in the usual way to act as a control. The ASV system will learn two alternative models for each speaker, one from the structured training data and one from the training data. Speakers will make identity claims under simulated real-life conditions which will induce variation, and the performance of the two models will be compared. A population of 50 speakers of Swedish will be used, and these will make both 'genuine' and 'imposter' identity claims.

The "Structured Training" strategy is predicted to bring a 25% reduction in false rejection errors without an increase in the false acceptance rate. That is, if we suppose a 'baseline' performance of the system, using the normally trained model, of 5% false rejections and 5% false acceptances, then a reduction in false rejections to 3.75% (or lower) will be achieved without false acceptances rising from 5%. Other deliverables will be a database of utterances produced with known types of speaking variation, and acoustic data on those variations.

The work described here for the first phase of VeriVox will form the basis for a thorough development of strategies for reducing the problem of speaker variation in the second (main) phase, in which English, German, and French will be added, 'structured training' further refined, and an additional strategy of 'guided elicitation' introduced, as described in the project proposal.

Ámbito científico (EuroSciVoc)

CORDIS clasifica los proyectos con EuroSciVoc, una taxonomía plurilingüe de ámbitos científicos, mediante un proceso semiautomático basado en técnicas de procesamiento del lenguaje natural. Véas: El vocabulario científico europeo..

Para utilizar esta función, debe iniciar sesión o registrarse

Programa(s)

Programas de financiación plurianuales que definen las prioridades de la UE en materia de investigación e innovación.

Tema(s)

Las convocatorias de propuestas se dividen en temas. Un tema define una materia o área específica para la que los solicitantes pueden presentar propuestas. La descripción de un tema comprende su alcance específico y la repercusión prevista del proyecto financiado.

Convocatoria de propuestas

Procedimiento para invitar a los solicitantes a presentar propuestas de proyectos con el objetivo de obtener financiación de la UE.

Datos no disponibles

Régimen de financiación

Régimen de financiación (o «Tipo de acción») dentro de un programa con características comunes. Especifica: el alcance de lo que se financia; el porcentaje de reembolso; los criterios específicos de evaluación para optar a la financiación; y el uso de formas simplificadas de costes como los importes a tanto alzado.

ACM - Preparatory, accompanying and support measures

Coordinador

ADAPTIVE MANUFACTURING SYSTEMS
Aportación de la UE
Sin datos
Coste total

Los costes totales en que ha incurrido esta organización para participar en el proyecto, incluidos los costes directos e indirectos. Este importe es un subconjunto del presupuesto total del proyecto.

Sin datos
Mi folleto 0 0