Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Multilingual and Cross-cultural interactions for context-aware, and bias-controlled dialogue systems for safety-critical applications

CORDIS provides links to public deliverables and publications of HORIZON projects.

Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .

Publications

Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems (opens in new window)

Author(s): Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos
Published in: Findings of the Association for Computational Linguistics ACL 2024, 2024
Publisher: Association for Computational Linguistics
DOI: 10.18653/V1/2024.FINDINGS-ACL.431

BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis (opens in new window)

Author(s): Jan Pešán, Vojtěch Juřík, Martin Karafiát, Jan Černocký
Published in: Interspeech 2024, 2024
Publisher: ISCA
DOI: 10.21437/INTERSPEECH.2024-42

BUT systems and analyses for the ASVspoof 5 Challenge (opens in new window)

Author(s): Johan Rohdin, Lin Zhang, Plchot Oldřich, Vojtěch Staněk, David Mihola, Junyi Peng, Themos Stafylakis, Dmitriy Beveraki, Anna Silnova, Jan Brukner, Lukáš Burget
Published in: The Automatic Speaker Verification Spoofing Countermeasures Workshop (ASVspoof 2024), 2024
Publisher: ISCA
DOI: 10.21437/ASVSPOOF.2024-4

TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR (opens in new window)

Author(s): Shashi Kumar, Srikanth Madikeri, Juan Pablo Zuluaga Gomez, Iuliia Thorbecke, Esaú Villatoro-tello, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju
Published in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Publisher: Association for Computational Linguistics
DOI: 10.18653/V1/2024.EMNLP-MAIN.1167

Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units (opens in new window)

Author(s): Bolaji Yusuf, Jan Honza Cernocky, Murat Saraçlar
Published in: Interspeech 2024, 2024
Publisher: ISCA
DOI: 10.21437/INTERSPEECH.2024-1713

Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers (opens in new window)

Author(s): Umberto Cappellazzo, Daniele Falavigna, Alessio Brutti, Mirco Ravanelli
Published in: 2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP), 2024
Publisher: IEEE
DOI: 10.1109/MLSP58920.2024.10734776

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers (opens in new window)

Author(s): Shashi Kumar, Srikanth Madikeri, Iuliia Nigmatulina, Esaú Villatoro-Tello, Petr Motlicek, Karthik Pandia, S. Pavankumar Dubagunta, Aravind Ganapathiraju
Published in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024
Publisher: IEEE
DOI: 10.1109/ICASSP48485.2024.10446130

Large Language Models are Strong Audio-Visual Speech Recognition Learners (opens in new window)

Author(s): Umberto Cappellazzo, Minsu Kim, Honglie Chen, Pingchuan Ma, Stavros Petridis, Daniele Falavigna, Alessio Brutti, Maja Pantic
Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
Publisher: IEEE
DOI: 10.1109/ICASSP49660.2025.10889251

Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction (opens in new window)

Author(s): Sergio Burdisso, Srikanth Madikeri, Petr Motlicek
Published in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Publisher: Association for Computational Linguistics
DOI: 10.18653/V1/2024.EMNLP-MAIN.310

Multimodal Emotion Recognition Using Compressed Graph Neural Networks (opens in new window)

Author(s): Tijana Đurkić, Nikola Simić, Siniša Suzić, Dragana Bajović, Zoran Perić, Vlado Delić
Published in: Lecture Notes in Computer Science, Speech and Computer, 2025
Publisher: Springer Nature Switzerland
DOI: 10.1007/978-3-031-78014-1_9

BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge (opens in new window)

Author(s): Alexander Polok, Dominik Klement, Jiangyu Han, Šimon Sedláček, Bolaji Yusuf, Matthew Maciejewski, Matthew S Wiesner, Lukáš Burget
Published in: 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024), 2024
Publisher: ISCA
DOI: 10.21437/CHIME.2024-4

Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters (opens in new window)

Author(s): Umberto Cappellazzo, Daniele Falavigna, Alessio Brutti
Published in: Interspeech 2024, 2024
Publisher: ISCA
DOI: 10.21437/INTERSPEECH.2024-38

MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation (opens in new window)

Author(s): Javier García Gilabert, Carlos Escolano, Audrey Mash, Xixian Liao, Maite Melero
Published in: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (System Demonstrations), 2025
Publisher: Association for Computational Linguistics
DOI: 10.18653/V1/2025.NAACL-DEMO.6

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available

My booklet 0 0