Skip to main content

Browser-based Multilingual Translation

Deliverables

Translation software with initial CPU optimizations

Translation software with initial CPU optimizations

Training-time domain adaptation (report + code)

Training-time domain adaptation (report + code)

Faster software including non-autoregressive

Faster software including non-autoregressive

Building on Mozilla cluster

Building on Mozilla cluster

Basic Firefox integration

Basic Firefox integration

First Dissemination Report

First Dissemination Report

Weakly supervised Quality Estimation (report)

Weakly supervised Quality Estimation (report)

Collected webpage texts and metadata (report)

Collected webpage texts and metadata (report)

Ethics plan updated and reviewed

Ethics plan updated and reviewed

Lowering source sentence complexity (report)

Lowering source sentence complexity (report)

Searching for OpenAIRE data...

Publications

Multi-Hypothesis Machine Translation Evaluation

Author(s): Marina Fomicheva, Lucia Specia, Francisco Guzmán
Published in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020, Page(s) 1218-1232
DOI: 10.18653/v1/2020.acl-main.113

Compressing Neural Machine Translation Models with 4-bit Precision

Author(s): Alham Fikri Aji, Kenneth Heafield
Published in: Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020, Page(s) 35-42
DOI: 10.18653/v1/2020.ngt-1.4

Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces

Author(s): Bojar, Ondřej; Barančíková, Petra
Published in: Proceedings of the 23nd International Conference on Text, Speech and Dialogue - TSD 2020, 2020

Quality In, Quality Out: Learning from Actual Mistakes

Author(s): Frederic Blain, Nikolaos Aletras, Lucia Specia
Published in: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020, Page(s) 145--153

Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task

Author(s): Jindřich Libovický, Zdeněk Kasner, Jindřich Helcl, Ondřej Dušek
Published in: Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020, Page(s) 153-160
DOI: 10.18653/v1/2020.ngt-1.18

Efficiently Reusing Old Models Across Languages via Transfer Learning

Author(s): Tom Kocmi, Ondřej Bojar
Published in: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020, Page(s) 19-28

Multimodal Quality Estimation for Machine Translation

Author(s): Shu Okabe, Frédéric Blain, Lucia Specia
Published in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020, Page(s) 1233-1240
DOI: 10.18653/v1/2020.acl-main.114

CUNI System for the WMT19 Robustness Task

Author(s): Jindřich Helcl, Jindřich Libovický, Martin Popel
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 539-543
DOI: 10.18653/v1/w19-5364

The University of Edinburgh’s Submissions to the WMT19 News Translation Task

Author(s): Rachel Bawden, Nikolay Bogoychev, Ulrich Germann, Roman Grundkiewicz, Faheem Kirefu, Antonio Valerio Miceli Barone, Alexandra Birch
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 103-115
DOI: 10.18653/v1/w19-5304

SAO WMT19 Test Suite: Machine Translation of Audit Reports

Author(s): Tereza Vojtěchová, Michal Novák, Miloš Klouček, Ondřej Bojar
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 481-493
DOI: 10.18653/v1/w19-5355

CUNI Submission for Low-Resource Languages in WMT News 2019

Author(s): Tom Kocmi, Ondřej Bojar
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 234-240
DOI: 10.18653/v1/w19-5322

Findings of the WMT 2019 Shared Tasks on Quality Estimation

Author(s): Erick Fonseca, Lisa Yankovskaya, André F. T. Martins, Mark Fishel, Christian Federmann
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), 2019, Page(s) 1-10
DOI: 10.18653/v1/w19-5401

Quality Estimation and Translation Metrics via Pre-trained Word and Sentence Embeddings

Author(s): Elizaveta Yankovskaya, Andre Tättar, Mark Fishel
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), 2019, Page(s) 101-105
DOI: 10.18653/v1/w19-5410

University of Tartu’s Multilingual Multi-domain WMT19 News Translation Shared Task Submission

Author(s): Andre Tättar, Elizaveta Korotkova, Mark Fishel
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 382-385
DOI: 10.18653/v1/w19-5342

COSTRA 1.0: A Dataset of Complex Sentence Transformations

Author(s): Bojar, Ondřej; Barančíková, Petra
Published in: Proceedings of the 12th International Conference on Language Resources and Evaluation ({LREC} 2020), Issue 6, 2020, Page(s) 3535-3541

From Research to Production and Back: Ludicrously Fast Neural Machine Translation

Author(s): Young Jin Kim, Marcin Junczys-Dowmunt, Hany Hassan, Alham Fikri Aji, Kenneth Heafield, Roman Grundkiewicz, Nikolay Bogoychev
Published in: Proceedings of the 3rd Workshop on Neural Generation and Translation, 2019, Page(s) 280-288
DOI: 10.18653/v1/d19-5632

Edinburgh’s Submissions to the 2020 Machine Translation Efficiency Task

Author(s): Nikolay Bogoychev, Roman Grundkiewicz, Alham Fikri Aji, Maximiliana Behnke, Kenneth Heafield, Sidharth Kashyap, Emmanouil-Ioannis Farsarakis, Mateusz Chudyk
Published in: Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020, Page(s) 218-224
DOI: 10.18653/v1/2020.ngt-1.26

Character Mapping and Ad-hoc Adaptation: Edinburgh’s IWSLT 2020 Open Domain Translation System

Author(s): Pinzhen Chen, Nikolay Bogoychev, Ulrich Germann
Published in: Proceedings of the 17th International Conference on Spoken Language Translation, 2020, Page(s) 122-129
DOI: 10.18653/v1/2020.iwslt-1.14

Outbound Translation User Interface Ptakopet: A Pilot Study

Author(s): Zouhar, Vilém; Bojar, Ondřej
Published in: Proceedings of The 12th Language Resources and Evaluation Conference, 2020, Page(s) 6967-6975

Findings of the 2019 Conference on Machine Translation (WMT19)

Author(s): Loïc Barrault, Ondřej Bojar, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias Müller, Santanu Pal, Matt Post, Marcos Zampieri
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 1-61
DOI: 10.18653/v1/w19-5301

Findings of the Fourth Workshop on Neural Generation and Translation

Author(s): Kenneth Heafield, Hiroaki Hayashi, Yusuke Oda, Ioannis Konstas, Andrew Finch, Graham Neubig, Xian Li, Alexandra Birch
Published in: Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020, Page(s) 1-9
DOI: 10.18653/v1/2020.ngt-1.1

Exploring Benefits of Transfer Learning in Neural Machine Translation

Author(s): Kocmi, Tom
Published in: 2019

Replacing Linguists with Dummies: A Serious Need for Trivial Baselines in Multi-Task Neural Machine Translation

Author(s): Daniel Kondratyuk, Ronald Cardenas, Ondřej Bojar
Published in: The Prague Bulletin of Mathematical Linguistics, Issue 113/1, 2019, Page(s) 31-40, ISSN 1804-0462
DOI: 10.2478/pralin-2019-0005

Unsupervised Quality Estimation for Neural Machine Translation

Author(s): Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia
Published in: Transactions of the Association for Computational Linguistics, 2020, ISSN 2307-387X