Skip to main content

European Live Translator

Deliverables

Public Project Web Site and Updates

Public Project Web Site and Updates. The first version will be available after the third month of the project duration and it will be continuously updated and checked, esp. after 12., 24., and 36 months.

Initial Training Data, Separating Confidential and Public Version

Initial Training Data, Separating Confidential and Public Version

Year 1 Test Sets

Year 1 Test Sets

Year 2 Test Sets

Year 2 Test Sets

Initial Report on Summarization

Initial Report on Summarization

Report on NLP Technologies Workshop at EUROSAI Congress

Report on NLP Technologies Workshop at EUROSAI Congress

Report 1 on Spoken Language Translation

Report 1 on Spoken Language Translation

Report on Dissemination Activites: Intermediate, Final

The Intermediate Report on Dissemination Activites will be published by 18th month of the project, the Final Report on Dissemination Activites will be prepared at the end of the project (36th month).

Initial Report on Multi-Lingual MT

Initial Report on Multi-Lingual MT

Report 1 on Initial ASR Systems

Report 1 on Initial ASR Systems

Searching for OpenAIRE data...

Publications

ELITR: European Live Translator

Author(s): Haddow, Barry; Sagar, Sangeet; Simonini, Ivan; Franceschini, Dario; Stüker, Sebastian; Sennrich, Rico; Nguyen, Thai-Son; Macháček, Dominik; Canton, Chiara; Ansari, Ebrahim; Smrž, Otakar; Williams, Philip; Waibel, Alex; Kratochvíl, Jonáš; Schneider, Felix; Bojar, Ondřej
Published in: 1, 2020
Publisher: European Association for Machine Translation

Adaptive Feature Selection for End-to-End Speech Translation

Author(s): Biao Zhang; Ivan Titov; Barry Haddow; Rico Sennrich
Published in: Zhang , B , Titov , I , Haddow , B & Sennrich , R 2020 , Adaptive Feature Selection for End-to-End Speech Translation . in Findings of the Association for Computational Linguistics: EMNLP 2020 . pp. 2533-2544 , The 2020 Conference on Empirical Methods in Natural Language Processing , Virtual conference , 16/11/20 . https://doi.org/10.18653/v1/2020.findings-emnlp.230, 11, 2020
Publisher: Association for Computational Linguistics
DOI: 10.5167/uzh-191666

CUNI Systems for the Unsupervised News Translation Task in WMT 2019

Author(s): Ivana Kvapilíková, Dominik Macháček, Ondřej Bojar
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 241-248
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/w19-5323

Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation

Author(s): Felix Schneider; Alex Waibel
Published in: Proceedings of the 17th International Conference on Spoken Language Translation, 2020
Publisher: ACL

Neural Codes to Factor Language in Multilingual Speech Recognition

Author(s): Markus Müller; Sebastian Stüker; Alex Waibel
Published in: In proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Publisher: IEEE

Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

Author(s): Zhang, Biao; Williams, Philip; Titov, Ivan; Sennrich, Rico
Published in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 25, 2020
Publisher: ACL
DOI: 10.5167/uzh-188226

FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN

Author(s): Ebrahim Ansari; Amittai Axelrod; Nguyen Bach; Ondrej Bojar; Roldano Cattoni; Fahim Dalvi; Nadir Durrani; Marcello Federico; Christian Federmann; Jiatao Gu; Fei Huang; Kevin Knight; Xutai Ma; Ajay Nagesh; Matteo Negri; Jan Niehues; Juan Pino; Elizabeth Salesky; Xing Shi; Sebastian Stüker; Marco Turchi; Alex Waibel; Changhan Wang
Published in: 17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 4, 2020
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2020.iwslt-1.1

Removing European Language Barriers with Innovative Machine Translation Technology

Author(s): Bojar, Ondřej; Macháček, Dominik; Ha, Thanh-Le; Simonini, Ivan; Franceschini, Dario; Williams, Phil; Stüker, Sebastian; Sagar, Sangeet; Nguyen, Thai-Son; Schneider, Felix; Waibel, Alex; Canton, Chiara; Smrž, Otakar; Glott, Adelheid; Haddow, Barry; Sennrich, Rico; Schweinfurth, Armin
Published in: 1, 2020
Publisher: European Language Resources Association

Efficient Weight factorization for Multilingual Speech Recognition

Author(s): Ngoc-Quan Pham; Tuan-Nam Nguyen; Sebastian Stüker; Alex Waibel
Published in: 1, 2021
Publisher: ISCA
DOI: 10.21437/interspeech.2021-216

ELITR Multilingual Live Subtitling: Demo and Strategy

Author(s): Ondřej Bojar; Dominik Macháček; Sangeet Sagar; Otakar Smrž; Jonáš Kratochvíl; Peter Polák; Ebrahim Ansari; Mohammad Mahmoudi; Rishu Kumar; Dario Franceschini; Chiara Canton; Ivan Simonini; Thai-Son Nguyen; Felix Schneider; Sebastian Stüker; Alex Waibel; Barry Haddow; Rico Sennrich; Philip Williams
Published in: EACL (System Demonstrations), 5, 2021
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2021.eacl-demos.32

Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention

Author(s): Rico Sennrich
Published in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing; all authors: Rico Sennrich; Biao Zhang; Ivan Titov, 12, 2019
Publisher: ACL
DOI: 10.5167/uzh-176330

Context-Aware Monolingual Repair for Neural Machine Translation

Author(s): Rico Sennrich
Published in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP); all authors: Elena Voita; Rico Sennrich, Ivan Titov, 13, 2019
Publisher: ACL
DOI: 10.5167/uzh-176331

Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation

Author(s): Thai-Son Nguyen; Sebastian Stüke;r Jan Niehues; Alex Waibel
Published in: In Proceedings of the 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020
Publisher: IEEE

Operating a Complex SLT System with Speakers and Human Interpreters

Author(s): Srdečný, Vojtěch; Williams, Phil; Schneider, Felix; Canton, Chiara; Kumar, Rishu; Smrž, Otakar; Haddow, Barry; Bojar, Ondřej
Published in: 16, 2021
Publisher: Association for Machine Translation in the Americas

Towards Automatic Minuting of Meetings

Author(s): Nedoluzhko, Anna; Bojar, Ondřej
Published in: Proceedings of the 19th Conference ITAT 2019: Slovenskočeský NLP workshop (SloNLP 2019), 7, 2019
Publisher: CreateSpace Independent Publishing Platform

Modeling Confidence in Sequence-to-Sequence Models

Author(s): Niehues, Jan; Pham, Ngoc-Quan
Published in: In Proceedings of the 12th International Conference on Natural Language Generation - INLG, 13, 2019
Publisher: ACL

Keyphrase Generation: A Multi-Aspect Survey

Author(s): Erion Cano, Ondrej Bojar
Published in: 2019 25th Conference of Open Innovations Association (FRUCT), 2019, Page(s) 85-94, ISBN 978-952-69244-0-3
Publisher: IEEE
DOI: 10.23919/FRUCT48121.2019.8981519

SAO WMT19 Test Suite: Machine Translation of Audit Reports

Author(s): Tereza Vojtěchová, Michal Novák, Miloš Klouček, Ondřej Bojar
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019, Page(s) 481-493
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/w19-5355

Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study

Author(s): Erion Çano, Ondřej Bojar
Published in: Proceedings of the 12th International Conference on Natural Language Generation, 2019, Page(s) 229-239
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/w19-8630

Lost in Interpreting: Speech Translation from Source or Interpreter?

Author(s): Dominik Machácek; Matús Zilinec; Ondrej Bojar
Published in: Proc. Interspeech 2021, 1, 2021
Publisher: ISCA
DOI: 10.21437/interspeech.2021-2232

On Sparsifying Encoder Outputs in Sequence-to-Sequence Models

Author(s): Biao Zhang; Ivan Titov; Rico Sennrich
Published in: Zhang , B , Titov , I & Sennrich , R 2021 , On Sparsifying Encoder Outputs in Sequence-to-Sequence Models . in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 . Online , pp. 2888-2900 , The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing , Bangkok , Tha, 11, 2021
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2021.findings-acl.255

Very Deep Self-Attention Networks for End-to-End Speech Recognition

Author(s): Pham, Ngoc-Quan; Nguyen, Thai-Son; Niehues, Jan; Müller, Markus; Stüker, Sebastian; Waibel, Alexander
Published in: In Proceedings of the 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019, 4, 2019
Publisher: International Speech Communication Association

When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

Author(s): Rico Sennrich
Published in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics; all authors: Elena Voita; Rico Sennrich, Ivan Titov, 12, 2019
Publisher: ACL
DOI: 10.5167/uzh-172620

Root Mean Square Layer Normalization

Author(s): Zhang, Biao; Sennrich, Rico
Published in: Advances in Neural Information Processing Systems 32, 19, 2019
Publisher: Conference on Neural Information Processing Systems
DOI: 10.5167/uzh-177483

FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN

Author(s): Salesky, Elizabeth; Elbayad, Maha; Niehues, Jan; Ma, Xutai; Negri, Matteo; Stüker, Sebastian; Waibel, Alex; Federico, Marcello; Cattoni, Roldano; Bremerman, Jacob; Sudoh, Katsuhito; Turchi, Marco; Wang, Changhan; Nakamura, Satoshi; Anastasopoulos, Antonios; Pino, Juan; Bojar, Ondřej; Wiesner, Matthew
Published in: Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021), 4, 2021
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2021.iwslt-1.1

Self-Attentional Models for Lattice Inputs

Author(s): Sperber, Matthias; Neubig, Graham; Pham, Ngoc-Quan; Waibel, Alex
Published in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics,, 3, 2019, Page(s) 1185–1197
Publisher: ACL

ELITR Non-Native Speech Translation at IWSLT 2020

Author(s): Dominik Macháček; Jonáš Kratochvíl; Sangeet Sagar; Matúš Žilinec; Ondřej Bojar; Thai-Son Nguyen; Felix Schneider; Philip Williams; Yuekun Yao
Published in: Crossref, 2, 2020
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2020.iwslt-1.25

A Test Suite and Manual Evaluation of Document-Level NMT at WMT19

Author(s): Rysová, Magdaléna; Poláková, Lucie; Rysová, Kateřina; Bojar, Ondřej; Musil, Tomáš
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 5, 2019
Publisher: ACL

Toward Cross-Domain Speech Recognition with End-to-End Models

Author(s): Thai-Son Nguyen; Sebastian Stüker; Alex Waibel
Published in: In Proceedings of the Life Long Learning for Spoken Language Systems Workshop colocated with ASRU 2019, 2019
Publisher: ASRU

SLTEV: Comprehensive Evaluation of Spoken Language Translation

Author(s): Ebrahim Ansari; Ondřej Bojar; Barry Haddow; Mohammad Mahmoudi
Published in: EACL (System Demonstrations), 6, 2021
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2021.eacl-demos.9

KIT's Submission to the IWSLT 2019 Shared Task on Text Translation

Author(s): Felix Schneider; Alex Waibel
Published in: In Proceedings of the 16th International Workshop on Spoken Language Translation 2019 --- IWSLT, 2019
Publisher: IWSLT

Improving Zero-shot Translation with Language-Independent Constraints

Author(s): Pham, Ngoc-Quan; Niehues, Jan; Ha, Thanh-Le; Waibel, Alex
Published in: In proceedings of the 4th Conference in Machine Translation (WMT), ACL 2019, 4, 2019
Publisher: ACL

ELITR Non-Native Speech Translation at IWSLT 2020

Author(s): Macháček, Dominik; Kratochvíl, Jonáš; Sagar, Sangeet; Žilinec, Matúš; Bojar, Ondřej; Nguyen, Thai-Son; Schneider, Felix; Williams, Philip; Yao, Yuekun
Published in: Proceedings of the 17th International Conference on Spoken Language Translation (IWSLT 2020), 11, 2020
Publisher: IWSLT

Paper Length Prediction from the Metadata

Author(s): Erion Çano; Ondřej Bojar
Published in: NLPIR, 2, 2020, ISBN 9781450377607
Publisher: Association for Computing Machinery
DOI: 10.1145/3443279.3443305

Removing European Language Barriers with Innovative Machine Translation Technology

Author(s): Dario Franceschini; Chiara Canton; Ivan Simonini; Armin Schweinfurth; Adeleid Glott; Sebatian Stüker; Thai-Son Nguyen; Felix Schneider; Thanh-Le Ha; Alex Waibel; Barry Haddow; Philip Williams; Rico Sennrich; Ondřej Bojar; Sangeet Sagar; Dominik Macháček; Otakar Smrž
Published in: Proceedings of the 1st International Workshop on Language Technology Platforms, 2020, ISBN 979-10-95546-64-1
Publisher: ELRA

Large Corpus of Czech Parliament Plenary Hearings

Author(s): Polák, Peter; Kratochvíl, Jonáš; Bojar, Ondřej
Published in: 1, 2020
Publisher: European Language Resources Association

Improving Zero-shot Translation with Language-Independent Constraints

Author(s): Ngoc-Quan Pham; Jan Niehues; Thanh-Le Ha; Alex Waibel
Published in: Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers), 2, 2019
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/w19-5202

Findings of the 2020 Conference on Machine Translation (WMT20)

Author(s): Zampieri, Marcos; Kocmi, Tom; Barrault, Loïc; Ljubešić, Nikola; Morishita, Makoto; Lo, Chi-kiu; Nakazawa, Toshiaki; Pal, Santanu; Joanis, Eric; Costa-Jussà, Marta; Monz, Christof; Grundkiewicz, Roman; Huck, Matthias; Biesialska, Magdalena; Graham, Yvette; Nagata, Masaaki; Haddow, Barry; Koehn, Philipp; Bojar, Ondřej; Post, Matt; Federmann, Christian
Published in: 10, 2020
Publisher: Association for Computational Linguistics

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition

Author(s): Thai-Son Nguyen; Ngoc-Quan Pham; Sebastian Stüker; Alex Waibel
Published in: To appear in Proceedings of the 21st INTERSPEECH, 2020
Publisher: International Speech Communication Association

A Test Suite and Manual Evaluation of Document-Level NMT at WMT19

Author(s): Kateřina Rysová; Magdaléna Rysová; Tomáš Musil; Lucie Poláková; Ondřej Bojar
Published in: WMT (2), 8, 2019
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/w19-5352

The IWSLT 2019 KIT Speech Translation System

Author(s): Ngoc-Quan Pham; Thai-Son Nguyen; Thanh-Le Ha; Juan Hussain; Felix Schneider; Jan Niehues; Sebastian Stüker; Alexander Waibel
Published in: In Proceedings of the 16th International Workshop on Spoken Language Translation 2019 --- IWSLT, 2019
Publisher: IWSLT

Two Huge Title and Keyword Generation Corpora of Research Articles

Author(s): Çano, Erion; Bojar, Ondřej
Published in: Proceedings of The 12th Language Resources and Evaluation Conference, 21, 2020
Publisher: ELRA

Keyphrase Generation: A Text Summarization Struggle

Author(s): Erion Çano, Ondřej Bojar
Published in: Proceedings of the 2019 Conference of the North, 2019, Page(s) 666-672
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/N19-1070

CUNI Neural ASR with Phoneme-Level Intermediate Step for~Non-Native~SLT at IWSLT 2020

Author(s): Peter Polák; Sangeet Sagar; Dominik Macháček; Ondřej Bojar
Published in: Crossref, 1, 2020
Publisher: Association for Computational Linguistics
DOI: 10.18653/v1/2020.iwslt-1.24

Presenting Simultaneous Translation in Limited Space

Author(s): Macháček, Dominik; Bojar, Ondřej
Published in: Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020), 3, 2020
Publisher: CEUR-WS.org

WMT20 Document-Level Markable Error Exploration

Author(s): Vojtěchová, Tereza; Zouhar, Vilém; Bojar, Ondřej
Published in: 5, 2020
Publisher: Association for Computational Linguistics

Report on the SIGDial 2021 Special Session on Summarization of Dialogues and Multi-Party Meetings (SummDial)

Author(s): Tirthankar Ghosal; Muskaan Singh; Anja Nedoluzhko; Ondřej Bojar
Published in: Crossref, 2, 2022, ISSN 0163-5840
Publisher: Association for Computing Machinery
DOI: 10.1145/3527546.3527561

On Sparsifying Encoder Outputs in Sequence-to-Sequence Models

Author(s): Zhang, Biao; Titov, Ivan; Sennrich, Rico
Published in: arXiv e-print, 2020
Publisher: arXiv e-print

Human or Machine: Automating Human Likeliness Evaluation of NLG Texts

Author(s): Çano, Erion; Bojar, Ondřej
Published in: arXiv e-print, 2020
Publisher: arXiv e-print

Dynamic Masking for Improved Stability in Spoken Language Translation

Author(s): Yao, Yuekun; Haddow, Barry
Published in: arXiv e-prints, 1, 2020
Publisher: arXiv e-prints

Adaptive Feature Selection for End-to-End Speech Translation

Author(s): Biao Zhang; Ivan Titov; Barry Haddow; Rico Sennrich
Published in: arXiv e-print, 2020
Publisher: arXiv e-print

Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation

Author(s): Sperber, Matthias; Neubig, Graham; Niehues, Jan; Waibel, Alex
Published in: Transactions of the Association for Computational Linguistics, 7, 2019, ISSN 2307-387X
Publisher: MIT Press

Promoting the Knowledge of Source Syntax in Transformer NMT Is Not Needed

Author(s): Thuong Hai Pham, Dominik Macháček, Ondřej Bojar
Published in: Computación y Sistemas, 23/3, 2019, ISSN 1405-5546
Publisher: Centro de Investigacion en Computacion (CIC) del Instituto Politecnico Nacional (IPN)
DOI: 10.13053/cys-23-3-3265

A Speech Test Set of Practice Business Presentations with Additional Relevant Texts

Author(s): Dominik Macháček, Jonáš Kratochvíl, Tereza Vojtěchová, Ondřej Bojar
Published in: Statistical Language and Speech Processing - 7th International Conference, SLSP 2019, Ljubljana, Slovenia, October 14–16, 2019, Proceedings, 11816, 2019, Page(s) 151-161, ISBN 978-3-030-31371-5
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-31372-2_13

ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata

Author(s): Matyáš Kopp; Vladislav Stankov; Jan Oldřich Krůza; Pavel Straňák; Ondřej Bojar
Published in: TSD, 18, 2021
Publisher: Springer
DOI: 10.1007/978-3-030-83527-9_25