SHALLOW PARSING AND KNOWLEDGE EXTRACTION FOR LANGUAGE ENGINEERING

Projektinformationen

SPARKLE

ID Finanzhilfevereinbarung: LE12111

Projekt abgeschlossen

Startdatum 1 Dezember 1995

Enddatum 30 November 1997

Finanziert unter

Specific programme of research and technological development and demonstration in the area of telematic applications of common interest, 1994-1998

Gesamtkosten

€ 854 500,00

EU-Beitrag

€ 650 000,00

650 000,00

204 500,00

Koordiniert durch

Università degli Studi di Pisa
Italy

CORDIS bietet Links zu öffentlichen Ergebnissen und Veröffentlichungen von HORIZONT-Projekten.

Links zu Ergebnissen und Veröffentlichungen von RP7-Projekten sowie Links zu einigen Typen spezifischer Ergebnisse wie Datensätzen und Software werden dynamisch von OpenAIRE abgerufen.

Verwertbare Ergebnisse

SPARKLE provides advanced methods and tools for powerful, flexible and automatic acquisition oflexical information from text corpora. The tools fall into two categories: - robust, shallow parsers of unrestricted text, and- lexical acquisition systems, capable of learning (from pre-parsed texts) aspects of word knowledge needed for language Engineering applications. The tools are based on up-to-date, finite-state technology and were originally developed for statistical and inferential routines for efficiently resolving data deficiencies. The methods are applicable to any type of text and were tested with remarkable results in English, French, German and Italian. SPARKLE is able to acquire lexical information for verbs - probably the most elusive and challenging category for lexical analysis - as well as being the most important for Language Engineering applications such as machine translation, information retrieval and speech recognition. The variety of syntactic patterns typical for a verb is detected efficiently, and then statistically validated and automatically typed with respect to semantic preferences. SPARKLE technology has been used for intelligent cross-lingual text editing and translation filtering within multilingual information retrieval systems (Xerox, Sharp), and speech recognition systems (Daimler-Benz), and has demonstrated a steady improvement in performance. Acquired information was also used for automatic word sense disambiguation. Work in SPARKLE actively contributes to efficient development in: automatic parsing of unrestricted text, computational lexical databases, speech dialogue systems, cross-lingual information retrieval, exchange and filtering. Project URL: http://www.ilc.pi.cnr.it/sparkle.html

Suche nach OpenAIRE-Daten ...

Verwertbare Ergebnisse

Herunterladen Den Inhalt der Seite herunterladen