Servicio de Información Comunitario sobre Investigación y Desarrollo - CORDIS

Cracking the Language Barrier: Coordination, Evaluation and Resources for European MT Research

Desde 2015-01-01 hasta 2017-12-31, proyecto en curso

Detalles del proyecto

Coste total:

EUR 999 995

Aportación de la UE:

EUR 999 995

Convocatoria de propuestas:

H2020-ICT-2014-1See other projects for this call

Régimen de financiación:

CSA - Coordination and support action

Objetivo

Deliverables

  • Report on IWSLT 2015

    Analysis of the 2015 shared task results, together with the training and test materials which will be uploaded to META-SHARE.

  • New version of META-SHARE software

    This deliverable will provide an extended and improved version of the META-SHARE platform with a streamlined and adapted data model as well as an adapted licensing toolkit.

  • Kick-off meeting of the ICT-17 group of funded projects

    Organisation of a kick-off meeting of the whole ICT-17 pillar of funded projects, i.e., the big research project to be funded under ICT-17a and the innovation pilots to be funded under ICT-17b.

  • Website for the QT initiative

    The web portal for the QT initiative (group of ICT17 projects and other projects) will be the public face of the QT initiative which CRACKER will help to coordinate and to build a community around.

  • Project infrastructure (final version)

    Completed version of project infrastructure (especially the website), which is specifically geared towards communication and dissemination purposes.

  • Project infrastructure (initial version)

    Initial version of project infrastructure for the coordination of the project, including email lists, project website, intranet, etc.

  • Report on QT Marathon 2015

    This report will provide data about the QT Marathon 2015 (number of participants, overview of talks etc.) and a summary of the research projects. In addition, final presentations of the project results will be uploaded to the web.

  • Report on META-FORUM 2015

    Organisation of the conference META-FORUM 2015 with the help of a subcontractor; currently foreseen as organiser and location is the LSP Tilde in Riga, Latvia.

  • Report on META-FORUM 2016

    Organisation of the conference META-FORUM 2016 with the help of a subcontractor; currently foreseen as organiser and location is the University of Lisbon in Lisbon, Portugal.

  • Survey on the state of HQMT in industry and LSPs

    This report summaries the results of the survey on the economic impact and uptake of recent EC-funded MT actions, especially with regard to industry and language service providers (LSPs). It is foreseen to subcontract TAUS and GALA.

  • Coordination with and support of MLi

    This report will provide a summary of the support of and collaboration with MLi on resource infrastructures in relation to their deployment for building and offering multilingual digital services.

  • Data Management Plan

    The CRACKER Data Management Plan will provide the data management policy of the project with regard to the produced data sets, containing, among others, information on standards and metadata used as well as on sharing, archiving and preservation.

  • Data Management Plan (Update)

    The CRACKER Data Management Plan will provide the data management policy of the project with regard to the produced data sets, containing, among others, information on standards and metadata used as well as on sharing, archiving and preservation.

  • Coordination with and support of LIDER

    This deliverable will report on the support and coordination activities with LIDER in rendering the META-SHARE data model in RDF following the recommendations of the W3C.

  • Strategic Research and Innovation Agenda for the LT/MT field

    A joint deliverable with LT_Observatory.

  • Position Paper and preliminary joint Strategic Research and Innovation Agenda for the LT/MT field

    Position Paper prepared jointly and endorsed by CRACKER and LT_Observatory underpinned by a preliminary version of a joint Strategic Research and Innovation Agenda for the LT/MT field (SRIA). The form and content of these documents will be agreed between the CRACKER and LT_Observatory projects.

Publications

  • One Ontology to Bind Them All: The META-SHARE OWL Ontology for the Interoperability of Linguistic Datasets on the Web
    Author(s): McCrae, John P., P. Labropoulou, J. Gracia, M. Villegas, V. Rodriguez-Doncel & P. Cimiano
    Published in: The Semantic Web: ESWC 2015 Satellite Events, 2015. 
  • Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
    Author(s): Georg Rehm; Jan Hajic; Josef van Genabith; Andrejs Vasiljevs;
    Published in: Proceedings of the Tenth International Conference on Language Resources and Evaluation,, 2016. 
  • Cracking The Language Barrier For A Multilingual Europe
    Author(s): Georg Rehm
    Published in: Language Use In Public Administration. Contributions To The Annual Conference 2015 Of EFNIL In Helsinki, 2016. 
  • The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources
    Author(s): Georg Rehm
    Published in: Proceedings of International Conference on Language Resources and Evaluation (LREC 2016), 2015. 
  • Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities
    Author(s): Fernández-Barrera, M., V. Popescu, A. Toral, F. Gaspari and K. Choukri (2016)
    Published in: Proceedings of the 10th Language Resources and Evaluation Conference (LREC), 2016. Page(s) 4550-4556. 
  • Investigating Continuous Space Language Models for Machine Translation Quality Estimation
    Author(s): Kashif Shah, Raymond W.M. Ng, Fethi Bougares, and Lucia Specia
    Published in: Proceedings of Conference on Empirical Methods in Natural Language Processing, 2015. 
  • SHEF-Multimodal: Grounding Machine Translation on Images
    Author(s): Kashif Shah, Josiah Wang, and Lucia Specia
    Published in: Proceedings of First Conference on Machine Translation (WMT16), Volume 2: Shared Task Papers, 2016. 
  • Neural versus Phrase-Based Machine Translation Quality: a Case Study
    Author(s): L. Bentivogli, A. Bisazza, M. Cettolo, M. Federico
    Published in: Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016. 
  • Large-scale Multitask Learning for Machine Translation Quality Estimation
    Author(s): Kashif Shah and Lucia Specia
    Published in: Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016. 
  • Technology Landscape for Quality Evaluation: Combining the Needs of Research and Industry
    Author(s): Kim Harris; Aljoscha Burchardt; Georg Rehm; Lucia Specia
    Published in: Proceedings of the LREC 2016 Workshop “Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem”, 2016. Page(s) 50-54. 
  • Word embeddings and discourse information for Quality Estimation
    Author(s): Carolina Scarton, Daniel Beck, Kashif Shah, Karin Sim Smith, and Lucia Specia.
    Published in: Proceedings of First Conference on Machine Translation (WMT16), Volume 2: Shared Task Papers, 2016. 
  • Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem
    Author(s): Georg Rehm; Aljoscha Burchardt; Ondrej Bojar; Christian Dugast; Marcello Federico; Josef van Genabith; Barry Haddow; Jan Hajic; Kim Harris; Philipp Koehn; Matteo Negri; Martin Popel; Lucia Specia; Marco Turchi; Hans Uszkoreit (eds.)
    Published in: Proceedings of the LREC 2016 Workshop “Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem”, 2016. 
  • Findings of the 2016 Conference on Machine Translation
    Author(s): Bojar, Ondrej and Chatterjee, Rajen and Federmann, Christian and Graham, Yvette and Haddow, Barry and Huck, Matthias and Jimeno Yepes, Antonio and Koehn, Philipp and Logacheva, Varvara and Monz, Christof and Negri, Matteo and Neveol, Aurelie and Neves, Mariana and Popel, Martin and Post, Matt and Rubino, Raphael and Scarton, Carolina and Specia, Lucia and Turchi, Marco and Verspoor, Karin and Zampieri, Marcos
    Published in: Proceedings of the First Conference on Machine Translation, 2016. 
  • Pronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation
    Author(s): Christian Hardmeier, Preslav Nakov, Sara Stymne, Jörg Tiedemann, Yannick Versley, Mauro Cettolo
    Published in: In Proceedings of the EMNLP Second Workshop on Discourse in Machine Translation (DiscoMT), 2015. Page(s) pages 1–16. 
  • Digital Representation of Rights for Language Resources
    Author(s): Rodriguez-Doncel, V. and P. Labropoulou
    Published in: Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, ACL-IJCNLP, 2015. 
  • SHEF-LIUM-NN: Sentence level Quality Estimation with Neural Network Features
    Author(s): Kashif Shah, Fethi Bougares, Loic Barrault, and Lucia Specia.
    Published in: Proceedings of First Conference on Machine Translation (WMT16), Volume 2: Shared Task Papers, 2016. 
  • SHEF-NN: Translation Quality Estimation with Neural Networks.
    Author(s): Kashif Shah, Varvara Logacheva, Gustavo Paetzold, Frédéric Blain, Daniel Beck, Fethi Bougares, and Lucia Specia
    Published in: Proceedings of Tenth Workshop on Statistical Machine Translation (WMT15), 2015. 
  • The IWSLT 2015 Evaluation Campaign
    Author(s): M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico
    Published in: Proceedings of the 12th Workshop on Spoken Language Translation, 2015. 
  • Findings of the 2015 Workshop on Statistical Machine Translation.
    Author(s): Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, and Marco Turchi.
    Published in: Proceedings of Tenth Workshop on Statistical Machine Translation (WMT15), 2015. Page(s) 1-46. 
  • Towards a Systematic and Human-Informed Paradigm for High-Quality Machine Translation
    Author(s): Aljoscha Burchardt; Kim Harris; Georg Rehm; Hans Uszkoreit
    Published in: Proceedings of the LREC 2016 Workshop “Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem”, 2016. 
  • Ten Years of WMT Evaluation Campaigns: Lessons Learnt
    Author(s): Ondřej Bojar, Christian Federmann, Barry Haddow, Philipp Koehn, Matt Post and Lucia Specia
    Published in: Proceedings of the LREC 2016 Workshop Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem, 2016. 
  • The IWSLT Evaluation Campaign: Challenges, Achievements, Future Directions
    Author(s): L. Bentivogli, M. Federico, S. Stüker, M. Cettolo, J. Niehues
    Published in: "Proceedings of the LREC 2016 Workshop ""Translation Evaluation - From Fragmented Tools and Data Sets to an Integrated Ecosystem""", 2016. 
  • Findings of the 2016 Conference on Machine Translation
    Author(s): Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurelie Neveol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, and Marcos Zampieri.
    Published in: Proceedings of First Conference on Machine Translation (WMT16), Volume 2: Shared Task Papers, 2016. 
  • CRACKER - Cracking the Language Barrier: Coordination, Evaluation and Resources for European MT Research
    Author(s): Georg Rehm
    Published in: In Proceedings of the 19th Annual Conference of the European Association for Machine Translation (EAMT 2016), 2015. 
  • Technologies for Overcoming Language Barriers towards a truly integrated European Online Market
    Author(s): Georg Rehm (ed.)
    Published in: Strategic Agenda for the Multilingual Digital Single Market, 2015. 
  • Der Mensch bleibt im Mittelpunkt. Smarte Technologien für alle Branchen
    Author(s): Georg Rehm
    Published in: Vitako Aktuell. Zeitschrift der Bundes-Arbeitsgemeinschaft der Kommunalen IT-Dienstleister e.V., 2016. 
  • Language as a Data Type and Key Challenge for Big Data. Enabling the Multilingual Digital Single Market through technologies for translating, analysing, processing and curating natural language content
    Author(s): Georg Rehm (ed.).
    Published in: Strategic Research and Innovation Agenda for the Multilingual Digital Single Market, 2016. 
  • Language technologies for a multilingual Europe
    Author(s): Georg Rehm, Felix Sasaki, Daniel Stein, Andreas Witt (Volume Editor)
    Published in: Series: Translation and Multilingual Natural Language Processing, 2016. 
Síganos en: RSS Facebook Twitter YouTube Gestionado por la Oficina de Publicaciones de la UE Arriba