Descripción del proyecto
Mejora de la traducción automática neuronal de lenguas con pocos recursos
En un mundo en el que la información precisa y oportuna se ha convertido en un imperativo, los periodistas necesitan contar permanentemente con las herramientas adecuadas para una traducción precisa y rápida de lenguas con muy pocos recursos. A pesar de que la tecnología de traducción automática neuronal avanza rápidamente, todavía no ha logrado ofrecer traducciones útiles de la mayoría de pares de lenguas del mundo debido a la falta de datos y corpus paralelos. El proyecto financiado con fondos europeos GoURMET tiene como objetivo mejorar la solidez y aplicabilidad de la traducción automática neuronal de pares de lenguas y ámbitos con pocos recursos. El proyecto se centrará en la creación de contenidos globales proporcionando traducciones automáticas para ser corregidas por humanos, así como en el seguimiento de medios informativos internacionales de pares de lenguas con pocos recursos.
Objetivo
Machine translation (MT) is an increasingly important technology for supporting communication in a globalised world. MT technology has gradually increased over the last ten years, but recent advances in neural machine translation (NMT), have resulted in significant interest in industry and have lead to very rapid adoption of the new paradigm (eg. Google, Facebook, UN, World International Patent Office). Although these models have shown significant advances in state-of-the-art performance they are data intensive and require parallel corpora of many millions of human translated sentences for training. Neural Machine translation is currently not able to deliver usable translations for the vast majority of language pairs in the world. This is especially problematic for our user partners, the BBC and DW who need access to fast and accurate translation for languages with very few resources.
The aim of GoURMET is to significantly improve the robustness and applicability of neural machine translation for low-resource language pairs and domains.
GoURMET has five objectives:
- Development of a high-quality machine translation for under-resourced language pairs and domains;
- Adaptable to new and emerging languages and domains;
- Development of tools for analysts and journalists;
- Sustainable, maintainable platform and services;
- Dissemination and communication of project results to stakeholders and user group.
The project will focus on two use cases:
- Global content creation - managing content creation in several languages efficiently by providing machine translations for correction by humans;
- Media monitoring for low resource language pairs - tools to address the challenge of international news monitoring problem.
The outputs of the project will be field-tested at partners BBC and DW, and the platform will be further validated through innovation intensives such as the BBC NewsHack.
Ámbito científico
Palabras clave
Programa(s)
Convocatoria de propuestas
Consulte otros proyectos de esta convocatoriaConvocatoria de subcontratación
H2020-ICT-2018-2
Régimen de financiación
RIA - Research and Innovation actionCoordinador
EH8 9YL Edinburgh
Reino Unido