SPEECH DATABASES FOR CREATION OF VOICE DRIVEN TELESERVICES

Project Information

SPEECHDAT

Grant agreement ID: LE24001

Project closed

Start date 1 March 1996

End date 28 February 1998

Funded under

Specific programme of research and technological development and demonstration in the area of telematic applications of common interest, 1994-1998

Total cost

€ 3 299 000,00

EU contribution

€ 1 960 000,00

1 960 000,00

1 339 000,00

Coordinated by

Siemens AG
Germany

Objective

SPEECHDAT aims at providing basic speech resources to spur research and technological development in automated services, including mobile and fixed telephone communication. Such resources are especially required to develop, train and test robust speech recognisers and verification. The SPEECHDAT databases will cover all 11 EU official languages plus Norwegian, Slovenian, and Welsh, addressing numerous application parameters, speaking styles and environmental influences. They will be geared to recognition tasks over fixed and mobile telephone networks, as well as training and testing of telephone-based verification systems.
Progress
During the first year of the project all necessary specifications for the speech data collection over fixed and mobile telephone networks, including speaker verification, have been finished. Most of the hardware platforms, i.e. telephone servers, were installed, and some of them were successfully validated based on a collection from 10 sample speakers.
The project is now well prepared to start the actual collection of speech data. The databases will cover all eleven official languages of the European Union as well as Norwegian, Slovenian, Welsh, and specific variants of Dutch, French, German and Swedish. The databases will cover a wide range of applications (application-oriented words, phonetically rich sentences, spontaneous utterances), speaking styles (commands, carefully pronounced and spontaneous speech) and environmental influences (mobile and fixed telephone networks).
Three major types of database are to be created. For each official EU language a 5,000 speaker database, recorded over the fixed telephone network, for training and testing speech recognisers. For selected languages a 1,000 speaker database, recorded over mobile telephone networks will be built. A speaker verification database will be built for some languages, with multiple calls by a small number of speakers for training and testing of verification systems over the telephone network.
These resources will be suitable for developing and training robust speech recognisers and developing and testing robust speech verification.
Establishing uniform and high quality tested platforms at each site represented a major effort. The detailed specification of the speech databases has been finished and agreed by all members of the consortium. This will ensure the usefulness of the speech databases for a wide range mono- and multlingual applications as well as by other potential end-users. The consortium has established close co-operation with ELRA concerning the future promotion and distribution of the SpeechDat databases.
The Way Ahead
The consortium represents many major European players in the field of developing voice driven teleservices. They will actually use the created databases for building their own teleservices.
The major steps in the next year concerning the creation of the speech databases will be:
- installing and validation of the remaining hardware platforms
- recording and annotation of the speech databases
- validation of the speech databases.
The SpeechDat database products and demonstrators of teleservices based on SpeechDat will be reported and exhibited at international conferences and trade shows, such as EuroSpeech, ICASSP, Voice and Cebit.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

natural sciences computer and information sciences databases

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP4-TELEMATICS 2C - Specific programme of research and technological development and demonstration in the area of telematic applications of common interest, 1994-1998

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

D.12 - Language Engineering

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

CSC - Cost-sharing contracts

Coordinator

Siemens AG

EU contribution

No data

Address

Otto-Hahn-Ring 6
81739 München
Germany

Total cost

No data

Participants (11)

British Telecom plc (BT)

United Kingdom

EU contribution

No data

Centro Studi e Laboratori Telecomunicazioni SpA (CSELT)

Italy

EU contribution

No data

Address

Via Guglielmo Reiss Romoli 274
10148 Torino

Total cost

No data

GPT Ltd

United Kingdom

EU contribution

No data

Address

New Century Park
CV3 1HJ Coventry

Total cost

No data

INSTITUT DALLE MOLLE D'INTELLIGENCE ARTIFICIELLE PERCEPTIVE

Switzerland

EU contribution

No data

Address

4,AVENUE DE SIMPLON
1920 MARTIGNY

Total cost

No data

KNOWLEDGE S.A.

Greece

EU contribution

No data

Address

37,N.E.O. ATHINON PATRON
264-41 PATRAS

Total cost

No data

LERNOUT & HAUSPIE SPEECH PRODUCTS NV

Belgium

EU contribution

No data

Address

7,SINT-KRIPIJNSTRAAT
8900 IEPER

Total cost

No data

MATRA COMMUNICATION SA

France

EU contribution

No data

Address

RUE JEAN-PIERRE TIMBAUD - MS 38
78392 BOIS D'ARCY

Total cost

No data

PHILIPS DIKTIERSYSTEME, ZWEIGNIEDERLASSUNG DER PHILIPS GMBH

Germany

EU contribution

No data

Address

205,MEIENDORFER STRASSE
22145 HAMBURG

Total cost

No data

PORTUGAL TELECOM S.A.

Portugal

EU contribution

No data

Address

40,AV. FONTES PEREIRA DE MELO
1089 LISBON

Total cost

No data

SPEECH PROCESSING EXPERTISE CENTRE

Netherlands

EU contribution

No data

Address

4,SINT PAULUSSTRAAT
2264 XZ LEIDSCHENDAM

Total cost

No data

Vocalis Ltd

United Kingdom

EU contribution

No data

Address

Chaston House Mill Court Great Shelford
CB2 5LD Cambridge

Total cost

No data

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (11)

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.