Service Communautaire d'Information sur la Recherche et le Développement - CORDIS


MULTISENSE Résumé de rapport

Project ID: IST-2001-34121
Financé au titre de: FP5-IST
Pays: Sweden

Improved speech technology

A framework for rapid design of speech interfaces aimed for control of visualization software has been developed. The speech framework is compatible with low-level speech technology components that are considered as pre-existing know-how of KTH, but it is not explicitly dependent on these.

In particular, any speech recogniser supporting context free grammar and any speech synthesis supporting SAPI may be connected with some effort. The speech interface is developed as a module compatible with MAF. Thus, exploitation will take place within the framework of MAF exploitation.

An additional result is a speech utterance detector capable of determine the start and end point in time of a spoken utterance. Such a detector is often used as a pre-processor to a speech recognizer whose task it is to determine the textual contents of the utterance. The result is a new algorithm for speech utterance detection.

Informations connexes


Daniel NEIBERG, (Research Engineer)
Tél.: +46-8-7907567
Fax: +46-8-7907854