Skip to main content
European Commission logo
italiano italiano
CORDIS - Risultati della ricerca dell’UE
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary
Contenuto archiviato il 2024-05-28

Pattern REcognition-based Statistically Enhanced MT

Descrizione del progetto


Language-based interaction

This proposal describes PRESEMT, a flexible and adaptable MT system, based on a language-independent method, whose principles ensure easy portability to new language pairs. This method attempts to overcome well-known problems of other MT approaches, e.g. bilingual corpora compilation or creation of new rules per language pair. PRESEMT will address the issue of effectively managing multilingual content and is expected to suggest a language-independent machine-learning-based methodology. The key aspects of PRESEMT involve syntactic phrase-based modelling, pattern recognition approaches (such as extended clustering or neural networks) or game theory techniques towards the development of a language-independent analysis, evolutionary algorithms for system optimisation. It is intended to be of a hybrid nature, combining linguistic processing with the positive aspects of corpus-driven approaches, such as SMT and EBMT.In order for PRESEMT to be easily amenable to new language pairs, relatively inexpensive, readily available language resources as well as bilingual lexica will be used. The translation context will be modelled on phrases, as they have been proven to improve the translation quality. Phrases will be produced via a semi-automatic and language-independent process of morphological and syntactic analysis, removing the need of compatible NLP tools per language pair. Parallelisation of the main translation processes will be investigated in order to reach a fast, high-quality translation system. Furthermore, the optimisation and personalisation of the system parameters via automated processes (such as GAs or swarm intelligence) will be studied. To allow for user adaptability, all the corpora used in PRESEMT will be retrieved from web-based sources via the system platform, while the user feedback will be integrated through the use of appropriate interactive interfaces. PRESEMT is expected to be easily customisable to both new language pairs and specific sublanguages.

Invito a presentare proposte

FP7-ICT-2009-4
Vedi altri progetti per questo bando

Meccanismo di finanziamento

CP - Collaborative project (generic)

Coordinatore

ATHINA-EREVNITIKO KENTRO KAINOTOMIAS STIS TECHNOLOGIES TIS PLIROFORIAS, TON EPIKOINONION KAI TIS GNOSIS
Contributo UE
€ 738 530,00
Indirizzo
ARTEMIDOS 6 KAI EPIDAVROU
151 25 Maroussi
Grecia

Mostra sulla mappa

Regione
Αττική Aττική Βόρειος Τομέας Αθηνών
Tipo di attività
Research Organisations
Contatto amministrativo
GEORGE TAMBOURATZIS (Dr.)
Collegamenti
Costo totale
Nessun dato

Partecipanti (5)