Description du projet Language-based interactionTTC aims at automatically generating bilingual terminologies from comparable corpora in seven languages: English, French, German, Spanish, Latvian, Chinese and Russian. Afficher les objectifs du projet Masquer les objectifs du projet The TTC project (Terminology Extraction, Translation Tools and Comparable Corpora) aims at leveraging machine translation tools (MT tools), computer-assisted translation tools (CAT tools) and multilingual content management tools by automatically generating bilingual terminologies from comparable corpora in several European languages (i.e. English, French, German and Latvian), as well as in Chinese and Russian.Comparable corpora gather sets of texts corresponding to a same domain, but not necessary being a translation from each other.The main steps for automatically generating bilingual terminologies are the automatic extraction of monolingual terminologies and the bilingual alignment of the extracted terminologies. The terminologies will include single word terms (SWT) and multi-word terms (MWT), as well as their variations.The TTC project will develop generic methods and tools for automatic extraction of terminologies and alignment algorithms including adaptors to domains and languages, in order to break the lexical acquisition bottleneck in both statistical and rule-based machine translation. Alignment will be based on several strategies, i.e. lexical strategies (use of compositional methods and of an interlingua representation), contextual strategies (use of cognates, context vectors and labelled links) and corpora strategies (bettering of available corpora, for instance by topical web crawling). Developed methods will require as less prior linguistic knowledge as possible, so as to reduce the gaps in language coverage.It will also develop or adapt tools for gathering and managing these comparable corpora and for managing terminologies. In particular, a topical web crawler and an open terminology platform will be developed. This open terminology platform will support tasks such as terminology storage, search, editing and export.The TTC project will integrate developed and existing tools in an online platform, which will be based on Web Services and will use reputable open solutions such as UIMA (Unstructured Information Management Architecture ) and EuroTermBank . Existing tools to be integrated in the platform consist of already developed GPL term extraction tools, a framework for contextual analysis, as well as TreeTagger versions, tokenisers and POS taggers for several languages. The platform will allow users to create thematic corpora given some clues (such as terms or documents on a specific domain), to extract monolingual terminology from such corpora, to create a comparable corpus in a target language from a corpus in a source language, to align bilingual terminologies, to choose the tools to apply for terminology extraction, to expand a given corpus and to export monolingual or bilingual terminologies in order to use them easily in automatic and semi-automatic translation tools. Champ scientifique humanitieslanguages and literaturegeneral language studies Programme(s) FP7-ICT - Specific Programme "Cooperation": Information and communication technologies Thème(s) ICT-2009.2.2 - Language-based interaction Appel à propositions FP7-ICT-2009-4 Voir d’autres projets de cet appel Régime de financement CP - Collaborative project (generic) Coordonnées du coordinateur Béatrice Daille Mrs. Coordinateur UNIVERSITE DE NANTES Contribution de l’UE € 460 966,00 Adresse QUAI DE TOURVILLE 1 44035 NANTES CEDEX 1 France Voir sur la carte Région Pays de la Loire Pays de la Loire Loire-Atlantique Type d’activité Higher or Secondary Education Establishments Contact administratif Pauline BOUDANT (Ms.) Liens Contacter l’organisation Opens in new window Site web Opens in new window Coût total Aucune donnée Participants (6) Trier par ordre alphabétique Trier par contribution de l’UE Tout développer Tout réduire UNIVERSITY OF STUTTGART Allemagne Contribution de l’UE € 372 430,00 Adresse KEPLERSTRASSE 7 70174 Stuttgart Voir sur la carte Région Baden-Württemberg Stuttgart Stuttgart, Stadtkreis Type d’activité Higher or Secondary Education Establishments Contact administratif Ulrich Heid (Dr.) Liens Contacter l’organisation Opens in new window Site web Opens in new window Coût total Aucune donnée SYLLABS SARL France Contribution de l’UE € 390 241,00 Adresse RUE JEAN BAPTISTE BERLIER - PEPINIERE MASSENA 75013 PARIS 13 Voir sur la carte Type d’activité Private for-profit entities (excluding Higher or Secondary Education Establishments) Contact administratif Helena Blancafort (Ms.) Liens Contacter l’organisation Opens in new window Coût total Aucune donnée EURINNOV SARL France Contribution de l’UE € 76 260,00 Adresse Rue Jean Goujon 75008 Paris Voir sur la carte Type d’activité Private for-profit entities (excluding Higher or Secondary Education Establishments) Contact administratif Matthieu Rolland (Mr.) Liens Contacter l’organisation Opens in new window Coût total Aucune donnée SOGITEC INDUSTRIES SA France Contribution de l’UE € 100 320,00 Adresse Rue Marcel Monge 92158 Suresnes Voir sur la carte Type d’activité Private for-profit entities (excluding Higher or Secondary Education Establishments) Contact administratif Claude Méchoulam (Mr.) Liens Contacter l’organisation Opens in new window Coût total Aucune donnée TILDE SIA Lettonie Contribution de l’UE € 260 160,00 Adresse VIENIBAS GATVE 75 A LV-1004 Riga Voir sur la carte Région Latvija Latvija Rīga Type d’activité Private for-profit entities (excluding Higher or Secondary Education Establishments) Contact administratif Aivars Berzins (Mr.) Liens Contacter l’organisation Opens in new window Site web Opens in new window Coût total Aucune donnée UNIVERSITY OF LEEDS Royaume-Uni Contribution de l’UE € 364 623,00 Adresse WOODHOUSE LANE LS2 9JT Leeds Voir sur la carte Région Yorkshire and the Humber West Yorkshire Leeds Type d’activité Higher or Secondary Education Establishments Contact administratif Serge Sharoff (Dr.) Liens Contacter l’organisation Opens in new window Site web Opens in new window Coût total Aucune donnée