Adverse-Environment Recognition of Speech

Objective

The objective of ARS was to develop improved algorithms for medium-size vocabulary speaker-dependent speech recognition in the presence of noise, and to build a real-time demonstrator. The demonstrator was to incorporate an isolated word noise-robust recogniser, verify algorithm performance, and address the problem of speech-based person-machine dialogue as a system interface in practical applications. The application environment chosen was the car.
The aim of the project is to extend the state of the art in speech recognition and to place this innovative technology in adverse environments such as car and factory floor. Starting from an established base of expertise, this project involves theoretical work on algorithms and the development of hardware prototypes. To get the best recognition performance, algorithms covering the different aspects of signal processing were considered. The activities were subdivided into 6 work packages concerning respectively system definition and standards, transducers and noise reduction, feature extraction, pattern processing, human factors and user interface, system prototyping and evaluation. After a brief presentation of the general structure of the project (objectives, organisation, participation, resources), this paper presents the state of the work after two years.

The objective of adverse environment recognition of speech (ARS) project was to develop improved algorithms for speech recognition in the presence of noise and to build a real time demonstrator. The demonstrator was to incorporate an isolated word noise robust recognizer, verify algorithm performance, and address the problem of speech based person machine dialogue as a system interface in practical applications.

The application environment chosen was the car. The system has a 100 word vocabulary, chosen by each national group of partners and tailored to the specific application environment. Advances were made in:
reduction, by signal preprocessing, of the effects of noise on speech signals;
feature extraction, to improve noise robustness;
study and refinement of algorithms for speech pattern matching in noisy environments;
speaker adaptation;
dynamic system adjustment to user feedback and the development of error correction strategies in the human interface;
development of system prototypes (hardware and firmware) for real time speech recognition.

The real time demonstrator was based on a general purpose digital signal processing (DSP) chip attached to a personal computer or a stand alone system. A multilingual database collected in noisy environments was made available and used for the evaluation of baseline systems. These were realized according to a common standard suitable for exchanging the software modules of the algorithms. Various algorithms were developed and evaluated and a set of algorithms for the final prototype were selected. A human machine interface concept was defined and the porting of the various models to the target system hardware was initiated.
The complete chain of processing has been initiated on a real time hardware; 2 demonstrators have been installed inside cars for assessment of their performance in real operating conditions.
The requirements included a 100-word vocabulary, chosen for each language group of partners and tailored to the specific application environment. Advances were needed in terms of:

- reduction, by signal preprocessing, of the effects of noise on speech signals
- feature extraction, to improve noise robustness
- study and refinement of algorithms for speech pattern matching in noisy environments
- speaker adaptation
- dynamic system adjustment to user feedback and the development of error correction strategies in the human interface
- development of system prototypes (hardware and firmware) for real-time speech recognition.

The system would be integrated in a real-time demonstrator based on a general-purpose DSP chip attached to a personal computer on a stand-alone system. Performance evaluations were first scheduled in the laboratory, using databases collected in noisy environments, to evaluate the resulting rate of correct recognition. Performance under field conditions were then to be assessed from a prototype fitted in a car and a laboratory system installed in a factory.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP2-ESPRIT 2 - European strategic programme (EEC) for research and development in information technologies (ESPRIT), 1987-1992

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Data not available

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Data not available

Coordinator

Centro Studi e Laboratori Telecomunicazioni SpA

EU contribution

No data

Address

Via G. Reiss Romoli, 274
10148 TORINO
Italy

Links

Contact the organisation Website

HORIZON collaboration network

Total cost

No data

Participants (9)

Logica Ltd

United Kingdom

EU contribution

No data

Address

64-68 Newman Street
W1A 4SE London

Total cost

No data

Logica Ltd

United Kingdom

EU contribution

No data

Address

Cobham Park Downside Road
KT11 3LX Cobham

Total cost

No data

Matra Communication

France

EU contribution

No data

Address

Rue Jean-Pierre Timbaud
78392 Bois d'Arcy

Total cost

No data

PAGE IBERICA

Spain

EU contribution

No data

Address

LAGASCA 88
28001 MADRID

Total cost

No data

POLITECNICO DI TORINO

Italy

EU contribution

No data

Address

CORSO DUCA DEGLI ABRUZZI 24
10129 TORINO

Total cost

No data

TELECOM PARIS

France

EU contribution

No data

Address

46 RUE BARRAULT
75634 PARIS

Total cost

No data

UNIVERSITAT POLITECNICA DE MADRID

Spain

EU contribution

No data

Address

CAMPUS DE MONTEGANCEDO
28660 MADRID

Total cost

No data

University of Cambridge

United Kingdom

EU contribution

No data

Address

Trumpington Street
CB2 1PZ Cambridge

Total cost

No data

University of Keele

United Kingdom

EU contribution

No data

Address

ST5 5BG Keele

Total cost

No data

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (9)

Share this page Share this page on social networks

Download PDF Download the content of the page

Adverse-Environment Recognition of Speech

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (9)

Share this page Share this page on social networks

Download PDF Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.