Skip to main content

Cost-effective, Multilingual, Privacy-driven voice-enabled Services

Rezultaty

Initial scientific evaluation

First combined evaluation of baseline speech, dialog, and translation tools.

Platform hardware and software architecture

Platform specification including main requirements, hardware and software architecture

SDK software architecture

SDK specification including main requirements and software architecture

Data protection and GDPR requirements

Guidelines, procedures and recommendations how to implement personal data protection in the platform.

Dissemination and communication action plan

Summarises all planned dissemination actions and provides communication material to the partners (logo, graphical chart, public website, short presentation of the project, templates, poster, leaflet)

First dissemination and communication report

Update on the actions conducted and web document defining the COMPRISE knowledge repository on multilingual voice-enabled applications.

Baseline speech and text transformation and model learning library

Design, implementation, and evaluation of baseline transformations focusing on deleting the user’s identity and words carrying critical information, and model learning.

Improved transformation library and initial privacy guarantees

Design, implementation, and evaluation of speech and text transformations addressing more types of private information and initial statistical utility/privacy bounds.

Initial multilingual interaction library

Software components and documentation for speech-to-speech translation and integration of dialog systems in the operating branch.

Initial weakly supervised learning library

Design, implementation, and evaluation of weakly supervised learning for spoken language understanding.

Initial personalised learning library for speech-to-text

Design, implementation, and evaluation of initial model personalisation strategies for speech-to-text.

Publikacje

Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages

Autorzy: Michael A. Hedderich; David Ifeoluwa Adelani; Dawei Zhu; Jesujoba O. Alabi; Udia Markus; Dietrich Klakow
Opublikowane w: 2020 Conference on Empirical Methods in Natural Language Processing, 16/11/2020, 2020
Wydawca: ACL
DOI: 10.18653/v1/2020.emnlp-main.204

Distant supervision and noisy label learning for low resource named entity recognition: A study on Hausa and Yorùbá

Autorzy: Adelani, David Ifeoluwa; Hedderich, Michael,; Zhu, Dawei; Van Den Berg, Esther; Klakow, Dietrich
Opublikowane w: AfricaNLP / PML4DC Workshop 2020 @ICLR 2020, 26/04/2020, 2020
Wydawca: OpenReview.net

Preventing author profiling through zero-shot multilingual back-translation

Autorzy: Adelani, David,; Zhang, Miaoran; Shen, Xiaoyu; Davody, Ali; Kleinbauer, Thomas; Klakow, Dietrich
Opublikowane w: 2021 Conference on Empirical Methods in Natural Language Processing, 07/11/2021, 2021
Wydawca: ACL

The effect of domain and diacritics in Yorùbá-English neural machine translation

Autorzy: Adelani, David,; Ruiter, Dana; Alabi, Jesujoba,; Adebonojo, Damilola; Ayeni, Adesina; Adeyemi, Mofetoluwa; Awokoya, Ayodele; Espana-Bonet, Cristina
Opublikowane w: 18th Biennial Machine Translation Summit, 16/08/2021, 2021
Wydawca: AMTA

Benchmarking and challenges in security and privacy for voice biometrics

Autorzy: Bonastre, Jean-Francois; Delgado, Hector; Evans, Nicholas; Kinnunen, Tomi; Lee, Kong Aik; Liu, Xuechen; Nautsch, Andreas; Noe, Paul-Gauthier; Patino, Jose; Sahidullah, Md; Srivastava, Brij Mohan Lal; Todisco, Massimiliano; Tomashenko, Natalia; Vincent, Emmanuel; Wang, Xin; Yamagishi, Junichi
Opublikowane w: 1st ISCA Symposium on Security and Privacy in Speech Communication, 10/11/2021, 2021
Wydawca: ISCA

Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?

Autorzy: Srivastava, Brij Mohan Lal; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Opublikowane w: INTERSPEECH 2019, 15/09/2019, 2019, Page(s) 3700-3704
Wydawca: ISCA

Introducing the VoicePrivacy initiative

Autorzy: Tomashenko, Natalia; Srivastava, Brij Mohan Lal,; Wang, Xin; Vincent, Emmanuel; Nautsch, Andreas; Yamagishi, Junichi; Evans, Nicholas; Patino, Jose; Bonastre, J.-F; Noé, Paul-Gauthier; Todisco, Massimiliano
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

Evaluating Voice Conversion-based Privacy Protection against Informed Attackers

Autorzy: Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Sahidullah, Md; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Opublikowane w: 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, 04/05/2020, 2020, Page(s) 2802-2806
Wydawca: IEEE

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks

Autorzy: Thomas, Aleena; Adelani, David; Davody, Ali; Mogadala, Aditya; Klakow, Dietrich
Opublikowane w: 23rd International Conference on Text, Speech and Dialogue, 08/09/2020, 2020
Wydawca: Springer

Design Choices for X-vector Based Speaker Anonymization

Autorzy: Srivastava, Brij Mohan Lal; Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel; Yamagishi, Junichi; Maouche, Mohamed; Bellet, Aurélien; Tommasi, Marc
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

The COMPRISE Cloud Platform

Autorzy: Skadiņš, Raivis; Salimbajevs, Askars
Opublikowane w: 1st International Workshop on Language Technology Platforms, 16/05/2020, 2020, Page(s) 108–111
Wydawca: European Language Resources Association

Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

Autorzy: Salimbajevs, Askars
Opublikowane w: 9th International Conference on Human Language Technologies – the Baltic Perspective, 22/09/2020, 2020
Wydawca: IOS Press

Assessing Unintended Memorization in Neural Discriminative Sequence Models

Autorzy: Helali, Mossad; Kleinbauer, Thomas; Klakow, Dietrich
Opublikowane w: 23rd International Conference on Text, Speech and Dialogue, 08/09/2020, 2020
Wydawca: Springer

Private Protocols for U-Statistics in the Local Model and Beyond

Autorzy: Bell, James; Bellet, Aurélien; Gascón, Adrià; Kulkarni, Tejas
Opublikowane w: International Conference on Artificial Intelligence and Statistics, 27/08/2020, 2020, Page(s) 1573-1583
Wydawca: Proceedings of Machine Learning Research

Data Augmentation for Pipeline-Based Speech Translation

Autorzy: Alves, Diego; Salimbajevs, Askars; Pinnis, Mārcis
Opublikowane w: 9th International Conference on Human Language Technologies – the Baltic Perspective, 22/09/2020, 2020
Wydawca: IOS Press

A Comparative Study of Speech Anonymization Metrics

Autorzy: Maouche, Mohamed; Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data

Autorzy: Sheikh, Imran; Vincent, Emmanuel; Illina, Irina
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation

Autorzy: Turan, M. A. Tuğtekin; Vincent, Emmanuel; Jouvet, Denis
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

Privacy Guarantees for De-identifying Text Transformations

Autorzy: Adelani, David Ifeoluwa; Davody, Ali; Kleinbauer, Thomas; Klakow, Dietrich
Opublikowane w: INTERSPEECH 2020, 25/10/2020, 2020
Wydawca: ISCA

Who Started this Rumor? Quantifying the Natural Differential Privacy Guarantees of Gossip Protocols

Autorzy: Bellet, Aurélien; Guerraoui, Rachid; Hendrikx, Hadrien
Opublikowane w: 34th International Symposium on Distributed Computing, 12/10/2020, 2020
Wydawca: LIPIcs

Fully Decentralized Joint Learning of Personalized Models and Collaboration Graphs

Autorzy: Zantedeschi, Valentina; Bellet, Aurélien; Tommasi, Marc
Opublikowane w: 23rd International Conference on Artificial Intelligence and Statistics, 26/08/2020, 2020
Wydawca: PMLR

MasakhaNER: Named Entity Recognition for African Languages

Autorzy: David Ifeoluwa Adelani; Jade Abbott; Graham Neubig; Daniel D’souza; Julia Kreutzer; Constantine Lignos; Chester Palen-Michel; Happy Buzaaba; Shruti Rijhwani; Sebastian Ruder; Stephen Mayhew; Israel Abebe Azime; Shamsuddeen H. Muhammad; Chris Chinenye Emezue; Joyce Nakatumba-Nabende; Perez Ogayo; Aremu Anuoluwapo; Catherine Gitau; Derguene Mbaye; Jesujoba Alabi; Seid Muhie Yimam; Tajuddeen Rabiu
Opublikowane w: Transactions of the ACL, 07/10/2021, 2021, ISSN 2307-387X
Wydawca: MIT Press
DOI: 10.1162/tacl_a_00416

How can Private Information Recorded by Voice-Enabled Systems be Identified?

Autorzy: Moretón Poch, Álvaro; Jaramillo, Ariadna
Opublikowane w: European Data Protection Law Review, to appear, 2020, ISSN 2364-2831
Wydawca: Lexxion

Monolingual and cross-lingual intent detection without training data in target languages

Autorzy: Jurgita Kapočiūtė-Dzikienė; Askars Salimbajevs; Raivis Skadiņš
Opublikowane w: Electronics, 11/06/2021, 2021, ISSN 2079-9292
Wydawca: MDPI
DOI: 10.3390/electronics10121412

Zbiory danych

COMPRISE_Data13_Y-BBCTopics_V1.0

Autorzy: Adelani, David; Hedderich, Michael
Opublikowane w: Zenodo

COMPRISE_Data04_CommonVoice_V1.0

Autorzy: SRIVASTAVA, Brij Mohan Lal
Opublikowane w: Zenodo

COMPRISE_Data12_Y-GV-NER_V1.0

Autorzy: Adelani, David
Opublikowane w: Zenodo

COMPRISE_Data03_LibriSpeech_V1.0

Autorzy: SRIVASTAVA, Brij Mohan Lal
Opublikowane w: Zenodo

COMPRISE_Data10_MENYO-20K_V1.0

Autorzy: Adelani, David
Opublikowane w: Zenodo

COMPRISE_Data11_M-NER_V1.0

Autorzy: Adelani, David
Opublikowane w: Zenodo

COMPRISE_Data06_YELP_V1.0

Autorzy: Adelani, David
Opublikowane w: Zenodo

COMPRISE_Data09_H-VOA-Topics_V1.0

Autorzy: Adelani, David; Hedderich, Michael
Opublikowane w: Zenodo

COMPRISE_Data08_H-VOA-NER_V1.0

Autorzy: Adelani, David; Hedderich, Michael
Opublikowane w: Zenodo