CORDIS provides links to public deliverables and publications of HORIZON projects.
Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .
Deliverables
This deliverable consists of initial set of textual data acquired from web and non-web sources, both in monolingual and parallel parts, after cleaning done in WP2.
Free and open-source software will be released on GitHub.
First language models trained (opens in new window)Language models will be made available for download however it may not have all or the cleanest data.
Translation models for select language pairs (opens in new window)Models available for download trained using the pipeline.
Publications
Author(s):
Ona De Gibert, Raúl Vázquez, Mikko Aulamo, Yves Scherrer, Sami Virpioja, Jörg Tiedemann
Published in:
2023, ISBN 978-1-959429-91-3
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.AMERICASNLP-1.20
Author(s):
Popel, Martin; Libovický, Jindřich; Helcl, Jindřich
Published in:
2022, ISBN 978-1-959429-29-6
Publisher:
Association for Computational Linguistics
DOI:
10.48550/ARXIV.2212.00486
Author(s):
Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay Cohen, Manish Shrivastava, Barry Haddow
Published in:
2023, ISBN 979-8-89176-061-5
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.FINDINGS-EMNLP.777
Author(s):
Vivek Iyer, Pinzhen Chen, and Alexandra Birch
Published in:
2023, ISBN 979-8-89176-041-7
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.WMT-1.44
Author(s):
Luukkonen, Risto; Komulainen, Ville; Luoma, Jouni; Eskelinen, Anni; Kanerva, Jenna; Kupari, Hanna-Mari; Ginter, Filip; Laippala, Veronika; Muennighoff, Niklas; Piktus, Aleksandra; Wang, Thomas; Tazi, Nouamane; Scao, Teven Le; Wolf, Thomas; Suominen, Osma; Sairanen, Samuli; Merioksa, Mikko; Heinonen, Jyrki; Vahtola, Aija; Antao, Samuel; Pyysalo, Sampo
Published in:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023, ISBN 979-8-89176-060-8
Publisher:
Association for Computational Linguistics
DOI:
10.48550/arxiv.2311.05640
Author(s):
David Samuel and Lilja Øvrelid
Published in:
2023, ISBN 978-1-959429-62-3
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.FINDINGS-ACL.890
Author(s):
Yang, Kailai; Ji, Shaoxiong; Zhang, Tianlin; Xie, Qianqian; Kuang, Ziyan; Ananiadou, Sophia
Published in:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023, ISBN 979-8-89176-060-8
Publisher:
Association for Computational Linguistics
DOI:
10.48550/arxiv.2304.03347
Author(s):
Jörg Tiedemann and Ona de Gibert
Published in:
2023, ISBN 978-1-959429-70-8
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.ACL-DEMO.30
Author(s):
Bogoychev, Nikolay and Chen, Pinzhen
Published in:
2023, ISBN 979-8-89176-041-7
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.WMT-1.80
Author(s):
Chen, Pinzhen; Ji, Shaoxiong; Bogoychev, Nikolay; Kutuzov, Andrey; Haddow, Barry; Heafield, Kenneth
Published in:
EACL, 2023, ISBN 979-8-89176-088-2
Publisher:
Association for Computational Linguistics
DOI:
10.48550/arxiv.2309.08958
Author(s):
Mikko Aulamo, Ona de Gibert, Sami Virpioja, and Jörg Tiedemann
Published in:
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023, ISBN 978-952-03-2947-1
Publisher:
European Association for Machine Translation
Author(s):
Pinzhen Chen, Gerasimos Lampouras
Published in:
2023, ISBN 978-1-959429-47-0
Publisher:
Association for Computational Linguistics
Author(s):
Muennighoff, Niklas; Rush, Alexander M.; Barak, Boaz; Scao, Teven Le; Piktus, Aleksandra; Tazi, Nouamane; Pyysalo, Sampo; Wolf, Thomas; Raffel, Colin
Published in:
2023, ISSN 2331-8422
Publisher:
NeurIPS'23
DOI:
10.48550/arxiv.2305.16264
Author(s):
Helcl, Jindřich
Published in:
2022, ISBN 978-1-959429-29-6
Publisher:
Association for Computational Linguistics
DOI:
10.48550/ARXIV.2212.00477
Author(s):
Proyag Pal, Kenneth Heafield
Published in:
2023, ISBN 978-1-959429-47-0
Publisher:
Association for Computational Linguistics
DOI:
10.18653/V1/2023.FINDINGS-EACL.120
Author(s):
Nikolay Bogoychev and Pinzhen Chen and Barry Haddow and Alexandra Birch
Published in:
AAAI Workshop on Deployable AI, 2024, ISSN 2331-8422
Publisher:
arXiv
DOI:
10.48550/ARXIV.2311.09709
Author(s):
Tiedemann J.; Aulamo M.; Bakshandaeva D.; Boggia M.; Grönroos S. A.; Nieminen T.; Raganato A.; Scherrer Y.; Vázquez R.; Virpioja S.
Published in:
Springer, 2023, ISSN 2193-1801
Publisher:
Springer Science and Business Media Deutschland GmbH
DOI:
10.48550/ARXIV.2212.01936
Author(s):
Hajič, Jan
Published in:
2023
Publisher:
Oral presentation at Skeikampen, Norway
Author(s):
Chen, Pinzhen and Guo, Zhicheng and Haddow, Barry and Heafield, Kenneth
Published in:
2023, ISSN 2331-8422
Publisher:
arXiv
DOI:
10.48550/ARXIV.2306.03856
Author(s):
Zhanghao Hu and Yijun Yang and Junjie Xu and Yifu Qiu and Pinzhen Chen
Published in:
2024, ISSN 2331-8422
Publisher:
arXiv
DOI:
10.48550/ARXIV.2403.02176
Author(s):
Libovický, Jindřich
Published in:
2023
Publisher:
Talk at FI MUNI, Brno, Czechia
Author(s):
Nikolay Bogoychev and Jelmer van der Linde and Graeme Nail and Barry Haddow and Jaume Zaragoza-Bernabeu and Gema Ramírez-Sánchez and Lukas Weymann and Tudor Nicolae Mateiu and Jindřich Helcl and Mikko Aulamo
Published in:
2023, ISSN 2331-8422
Publisher:
arXiv
DOI:
10.48550/ARXIV.2311.14838
Searching for OpenAIRE data...
There was an error trying to search data from OpenAIRE
No results available