Community Research and Development Information Service - CORDIS

LS-GRAM: large-scale grammars for European languages

The project addresses the need for high-quality, large-scale grammatical resources as a basis for natural language (NL) technology. Commercial resources so far often lack properties such as stability, extensibility, modularity, and quality. Unification-based grammars could fill this gap, but so far they have been limited to academia and not of much practical relevance, as they are too inefficient to be practically applicable. LS-GRAM narrows down the gap between the scientific concept of unification grammar and real industrial applications. To overcome the inefficiency of unification grammars the project uses a 'lean' formalism included in the ALEP system. The basic idea is that the grammar developer sacrifices some formal constructs and in order to gain the efficiency he needs for developing realistic grammars. Essentially, the formalism provides constructs which are directly compilable into Prolog, making use of efficient term unification. Results achieved so far consist of very carefully documented grammars for 3 languages (Spanish, English and German), grammar design for 6 languages (Greek, Danish, Dutch, French, Italian and Portuguese), 9 corpus investigations, specifications for semantic output structures, and an ALEP rule coding manual. Also nearly a dozen publications have resulted from the project.

Contact

Paul SCHMIDT, (Senior Researcher)
Tel.: +49-681-39313
Fax: +49-681-397482
E-mail
Follow us on: RSS Facebook Twitter YouTube Managed by the EU Publications Office Top