Interactive corpus-based translation drafting tool

Informations projet

TRANSLEARN

N° de convention de subvention: LRE61016

Projet clôturé

Date de début 1 Janvier 1993

Date de fin 1 Juillet 1995

Financé au titre de

Specific programme of research and technological development (EEC) in the field of telematic systems in areas of general interest - Linguistic research and engineering -, 1990-1994

Coût total

Aucune donnée

Contribution de l’UE

Aucune donnée

Coordonné par

Institute for Language and Speech Processing (ILSP)
Greece

CORDIS fournit des liens vers les livrables publics et les publications des projets HORIZON.

Les liens vers les livrables et les publications des projets du 7e PC, ainsi que les liens vers certains types de résultats spécifiques tels que les jeux de données et les logiciels, sont récupérés dynamiquement sur OpenAIRE .

Résultats exploitables

Translation work is very frequently characterized by two parameters: repetition and high demand on quality. This is particularly true for translation of technical and administrative documentation. The project tackles this problem by providing a computational environment, in more practical terms a toolbox that will: rid translators of the repetitive part of their work by reusing existing human translations and learning from them; enhance quality and consistency of translation by being able to integrate ancillary translation tools. Parallel texts of about 5.5 Mwords, in English, French, Portuguese and Greek have been processed and a large portion has been normalized, lemmatized, tagged and aligned at sentence level. Experiments for alignment below the level of sentence have been made yielding promising results. Two matching algorithms requiring shallow linguistic processing, have been implemented in the system's text matching tool, computing perfect and fuzzy matches between compared sentences. Fuzzily matching translations are post-edited and stored for future use, enabling the system to learn new translations. TRANSLEARN succeeded in combining numerical/statistical and symbolic/knowledge-based approaches to natural language processing (NLP), which are often regarded as mutually incompatible. The prototype software package produced is a powerful tool for pattern-matching and other intelligent applications. TRANSLEARN is a stand-alone utility or an integral part of workbench of wider scope.

Recherche de données OpenAIRE...

Résultats exploitables

Partager cette page Partager cette page sur les réseaux sociaux

Télécharger Télécharger le contenu de la page