The NESPOLE! Interlingua representation is a language-independent symbolic meaning representation framework, used as the centre-point of our speech translation approach. Translation in NESPOLE! is performed by analysing a source language sentence into the interlingua representation, and then generating a target language sentence from the interlingua.
The interlingua representation developed in the project is suited for task-oriented language in limited but large domains. This interlingua, called Interchange Format (IF), consists of four representational components: a speaker tag, a speech act, a list of concepts, and a list of argument-value pairs. The main principle underlying the design of the IF is that a task-oriented domain can be described by a reasonable number of domain actions. Domain actions are tasks such as giving information about the availability of a room, thanking, greeting, etc. In the IF, the domain action is represented by the combination of speech acts and concepts. We currently have a travel domain database of around 8000 IF-tagged sentences in German, Italian, English, Korean, and Japanese.
Around 800 different domain actions are used, with the 50 most common domain actions covering about 65% of the sentences. The NESPOLE! IF is currently in use in several other speech-translation projects, including C-STAR consortium partner groups in Japan, China and Korea. Extensions of the IF will be used in future speech translation projects. Of particular value is the IF database of sentences tagged with IF representations.