Objective SPEECHDAT aims at providing basic speech resources to spur research and technological development in automated services, including mobile and fixed telephone communication. Such resources are especially required to develop, train and test robust speech recognisers and verification. The SPEECHDAT databases will cover all 11 EU official languages plus Norwegian, Slovenian, and Welsh, addressing numerous application parameters, speaking styles and environmental influences. They will be geared to recognition tasks over fixed and mobile telephone networks, as well as training and testing of telephone-based verification systems.Progress During the first year of the project all necessary specifications for the speech data collection over fixed and mobile telephone networks, including speaker verification, have been finished. Most of the hardware platforms, i.e. telephone servers, were installed, and some of them were successfully validated based on a collection from 10 sample speakers. The project is now well prepared to start the actual collection of speech data. The databases will cover all eleven official languages of the European Union as well as Norwegian, Slovenian, Welsh, and specific variants of Dutch, French, German and Swedish. The databases will cover a wide range of applications (application-oriented words, phonetically rich sentences, spontaneous utterances), speaking styles (commands, carefully pronounced and spontaneous speech) and environmental influences (mobile and fixed telephone networks). Three major types of database are to be created. For each official EU language a 5,000 speaker database, recorded over the fixed telephone network, for training and testing speech recognisers. For selected languages a 1,000 speaker database, recorded over mobile telephone networks will be built. A speaker verification database will be built for some languages, with multiple calls by a small number of speakers for training and testing of verification systems over the telephone network. These resources will be suitable for developing and training robust speech recognisers and developing and testing robust speech verification. Establishing uniform and high quality tested platforms at each site represented a major effort. The detailed specification of the speech databases has been finished and agreed by all members of the consortium. This will ensure the usefulness of the speech databases for a wide range mono- and multlingual applications as well as by other potential end-users. The consortium has established close co-operation with ELRA concerning the future promotion and distribution of the SpeechDat databases. The Way Ahead The consortium represents many major European players in the field of developing voice driven teleservices. They will actually use the created databases for building their own teleservices. The major steps in the next year concerning the creation of the speech databases will be: - installing and validation of the remaining hardware platforms - recording and annotation of the speech databases - validation of the speech databases. The SpeechDat database products and demonstrators of teleservices based on SpeechDat will be reported and exhibited at international conferences and trade shows, such as EuroSpeech, ICASSP, Voice and Cebit. Fields of science natural sciencescomputer and information sciencesdatabases Programme(s) FP4-TELEMATICS 2C - Specific programme of research and technological development and demonstration in the area of telematic applications of common interest, 1994-1998 Topic(s) D.12 - Language Engineering Call for proposal Data not available Funding Scheme CSC - Cost-sharing contracts Coordinator Siemens AG EU contribution No data Address Otto-Hahn-Ring 6 81739 München Germany See on map Total cost No data Participants (11) Sort alphabetically Sort by EU Contribution Expand all Collapse all British Telecom plc (BT) United Kingdom EU contribution No data Address British Telecom Centre 81 Newgate Street EC1A 7AJ London See on map Links Website Opens in new window Total cost No data Centro Studi e Laboratori Telecomunicazioni SpA (CSELT) Italy EU contribution No data Address Via Guglielmo Reiss Romoli 274 10148 Torino See on map Total cost No data GPT Ltd United Kingdom EU contribution No data Address New Century Park CV3 1HJ Coventry See on map Total cost No data INSTITUT DALLE MOLLE D'INTELLIGENCE ARTIFICIELLE PERCEPTIVE Switzerland EU contribution No data Address 4,AVENUE DE SIMPLON 1920 MARTIGNY See on map Total cost No data KNOWLEDGE S.A. Greece EU contribution No data Address 37,N.E.O. ATHINON PATRON 264-41 PATRAS See on map Total cost No data LERNOUT & HAUSPIE SPEECH PRODUCTS NV Belgium EU contribution No data Address 7,SINT-KRIPIJNSTRAAT 8900 IEPER See on map Total cost No data MATRA COMMUNICATION SA France EU contribution No data Address RUE JEAN-PIERRE TIMBAUD - MS 38 78392 BOIS D'ARCY See on map Total cost No data PHILIPS DIKTIERSYSTEME, ZWEIGNIEDERLASSUNG DER PHILIPS GMBH Germany EU contribution No data Address 205,MEIENDORFER STRASSE 22145 HAMBURG See on map Total cost No data PORTUGAL TELECOM S.A. Portugal EU contribution No data Address 40,AV. FONTES PEREIRA DE MELO 1089 LISBON See on map Total cost No data SPEECH PROCESSING EXPERTISE CENTRE Netherlands EU contribution No data Address 4,SINT PAULUSSTRAAT 2264 XZ LEIDSCHENDAM See on map Total cost No data Vocalis Ltd United Kingdom EU contribution No data Address Chaston House Mill Court Great Shelford CB2 5LD Cambridge See on map Total cost No data