CORDIS
EU research results

CORDIS

English EN

Diagnostic and intrinsic variabilities in natural speech

Project information

Grant agreement ID: 002034

  • Start date

    1 February 2004

  • End date

    31 January 2007

Funded under:

FP6-IST

  • Overall budget:

    € 3 632 544

  • EU contribution

    € 2 223 834

Coordinated by:

MULTITEL

Belgium

Objective

When one considers human performance as our target, universal automatic recognition of speech is far from a solved problem. This seems to be related by a large amount to feature extraction, modelling and adaptability weaknesses, as discussed in recognized publications. Strikingly, these weaknesses remain fully in the case of clean speech in known conditions, emphasizing deficiencies in dealing with intrinsic speech variabilities and extracting information form the signal itself. This has however been partly hidden by the more pressing problem of making state-of-the-art systems usable in real noisy situations, under constrained tasks, with the implicit target of reaching "clean speech" performance, with deserved success.

The goal of DIVINES is to develop some new knowledge towards renewed feature extraction and modelling techniques that would have better capacities, particularly in handling speech intrinsic variabilities. First, human and machine performance and the effect of intrinsic variabilities will be compared based on a diagnostic procedure. The outcomes of this analysis will then be exploited to target feature extraction, acoustic and lexical modelling. Compatibility with techniques dealing with noise and integration within current systems are also part of the objectives.

The project is relevant to the "multimodal interfaces" objective as it concerns more accurate and adaptable recognition of spoken language. This is central to the concept of multimodal man-machine interaction where the speech understanding service is likely to remain an independent component in a modular design. Advances in this field could be decisive in realizing the vision of natural interactivity.

Leaflet | Map data © OpenStreetMap contributors, Credit: EC-GISCO, © EuroGeographics for the administrative boundaries

Coordinator

MULTITEL

Address

Parc Initialis, Copernic Avenue 1
7000 Mons

Belgium

Participants (8)

BABEL TECHNOLOGIES S.A.

Belgium

CARL VON OSSIETZKY UNIVERSITAET OLDENBURG

Germany

FRANCE TELECOM SA

France

INSTITUT EURECOM

France

LOQUENDO SPA

Italy

POLITECNICO DI TORINO

Italy

THE ROYAL INSTITUTION FOR THE ADVANCEMENT OF LEARNING (MCGILL UNIVERSITY)

Canada

UNIVERSITE D'AVIGNON ET DU PAYS-VAUCLUSE

France

Project information

Grant agreement ID: 002034

  • Start date

    1 February 2004

  • End date

    31 January 2007

Funded under:

FP6-IST

  • Overall budget:

    € 3 632 544

  • EU contribution

    € 2 223 834

Coordinated by:

MULTITEL

Belgium