Cost-effective, Multilingual, Privacy-driven voice-enabled Services

Informazioni relative al progetto

COMPRISE

ID dell’accordo di sovvenzione: 825081

DOI

10.3030/825081

Progetto chiuso

Data della firma CE 24 Ottobre 2018

Data di avvio 1 Dicembre 2018

Data di completamento 30 Novembre 2021

Finanziato da

INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)

Costo totale

€ 3 201 016,25

Contributo UE

€ 3 201 016,00

3 201 016,00

0,25

Coordinato da

INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET AUTOMATIQUE
France

CORDIS fornisce collegamenti ai risultati finali pubblici e alle pubblicazioni dei progetti ORIZZONTE.

I link ai risultati e alle pubblicazioni dei progetti del 7° PQ, così come i link ad alcuni tipi di risultati specifici come dataset e software, sono recuperati dinamicamente da .OpenAIRE .

Risultati finali

Initial COMPRISE SDK prototype

First prototype integrating the research results of WP2 WP3 and T41

Final platform demonstrator and updated data protection and GDPR requirements

Final platform demonstrator populated with additional data and trained models Updated version of D51 taking the latest research and legal advances into account

Initial platform demonstrator

Fully functional platform demonstrator implemented populated with a few initial data and trained models deployed in working environment ready for use in other WPs

Final COMPRISE SDK prototype and documentation

Final prototype integrating the research results of WP2 WP3 and T41 and Swagger online documentation

Second dissemination and communication report

Update on the actions conducted lessons learned on the innovation ecosystem proposed in COMPRISE and planned dissemination and communication after the end of the project

Dissemination and communication action plan

Summarises all planned dissemination actions and provides communication material to the partners (logo, graphical chart, public website, short presentation of the project, templates, poster, leaflet)

First dissemination and communication report

Update on the actions conducted and web document defining the COMPRISE knowledge repository on multilingual voice-enabled applications.

Final weakly supervised learning library

Final design implementation and evaluation of weakly supervised learning for all considered tasks

Baseline speech and text transformation and model learning library

Design, implementation, and evaluation of baseline transformations focusing on deleting the user’s identity and words carrying critical information, and model learning.

Improved transformation library and initial privacy guarantees

Design, implementation, and evaluation of speech and text transformations addressing more types of private information and initial statistical utility/privacy bounds.

Final personalised learning library

Final design implementation and evaluation of model personalisation strategies for speechtotext spoken language understanding and dialog management

Initial multilingual interaction library

Software components and documentation for speech-to-speech translation and integration of dialog systems in the operating branch.

Final transformation library and privacy guarantees

Final design implementation and evaluation of speech and text transformations and final statistical utilityprivacy bounds

Data collection and curation features of the platform

Platform with data collection and curation features implemented and deployed in working environment

Final multilingual interaction library

Software components and documentation for speechtospeech translation and integration of dialog systems in both the operating and the training branch

Initial weakly supervised learning library

Design, implementation, and evaluation of weakly supervised learning for spoken language understanding.

Initial personalised learning library for speech-to-text

Design, implementation, and evaluation of initial model personalisation strategies for speech-to-text.

Initial data management plan
Final data management plan

Initial scientific evaluation

First combined evaluation of baseline speech, dialog, and translation tools.

Platform hardware and software architecture

Platform specification including main requirements, hardware and software architecture

SDK software architecture

SDK specification including main requirements and software architecture

Data protection and GDPR requirements

Guidelines, procedures and recommendations how to implement personal data protection in the platform.

Pubblicazioni

Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages

Autori: Michael A. Hedderich; David Ifeoluwa Adelani; Dawei Zhu; Jesujoba O. Alabi; Udia Markus; Dietrich Klakow
Pubblicato in: 2020 Conference on Empirical Methods in Natural Language Processing, Numero 16/11/2020, 2020
Editore: ACL
DOI: 10.18653/v1/2020.emnlp-main.204

Distant supervision and noisy label learning for low resource named entity recognition: A study on Hausa and Yorùbá

Autori: Adelani, David Ifeoluwa; Hedderich, Michael,; Zhu, Dawei; Van Den Berg, Esther; Klakow, Dietrich
Pubblicato in: AfricaNLP / PML4DC Workshop 2020 @ICLR 2020, Numero 26/04/2020, 2020
Editore: OpenReview.net

Preventing author profiling through zero-shot multilingual back-translation

Autori: Adelani, David,; Zhang, Miaoran; Shen, Xiaoyu; Davody, Ali; Kleinbauer, Thomas; Klakow, Dietrich
Pubblicato in: 2021 Conference on Empirical Methods in Natural Language Processing, Numero 07/11/2021, 2021
Editore: ACL

The effect of domain and diacritics in Yorùbá-English neural machine translation

Autori: Adelani, David,; Ruiter, Dana; Alabi, Jesujoba,; Adebonojo, Damilola; Ayeni, Adesina; Adeyemi, Mofetoluwa; Awokoya, Ayodele; Espana-Bonet, Cristina
Pubblicato in: 18th Biennial Machine Translation Summit, Numero 16/08/2021, 2021
Editore: AMTA

Benchmarking and challenges in security and privacy for voice biometrics

Autori: Bonastre, Jean-Francois; Delgado, Hector; Evans, Nicholas; Kinnunen, Tomi; Lee, Kong Aik; Liu, Xuechen; Nautsch, Andreas; Noe, Paul-Gauthier; Patino, Jose; Sahidullah, Md; Srivastava, Brij Mohan Lal; Todisco, Massimiliano; Tomashenko, Natalia; Vincent, Emmanuel; Wang, Xin; Yamagishi, Junichi
Pubblicato in: 1st ISCA Symposium on Security and Privacy in Speech Communication, Numero 10/11/2021, 2021
Editore: ISCA

Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?

Autori: Srivastava, Brij Mohan Lal; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Pubblicato in: INTERSPEECH 2019, Numero 15/09/2019, 2019, Pagina/e 3700-3704
Editore: ISCA

Introducing the VoicePrivacy initiative

Autori: Tomashenko, Natalia; Srivastava, Brij Mohan Lal,; Wang, Xin; Vincent, Emmanuel; Nautsch, Andreas; Yamagishi, Junichi; Evans, Nicholas; Patino, Jose; Bonastre, J.-F; Noé, Paul-Gauthier; Todisco, Massimiliano
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

Evaluating Voice Conversion-based Privacy Protection against Informed Attackers

Autori: Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Sahidullah, Md; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Pubblicato in: 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, Numero 04/05/2020, 2020, Pagina/e 2802-2806
Editore: IEEE

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks

Autori: Thomas, Aleena; Adelani, David; Davody, Ali; Mogadala, Aditya; Klakow, Dietrich
Pubblicato in: 23rd International Conference on Text, Speech and Dialogue, Numero 08/09/2020, 2020
Editore: Springer

Design Choices for X-vector Based Speaker Anonymization

Autori: Srivastava, Brij Mohan Lal; Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel; Yamagishi, Junichi; Maouche, Mohamed; Bellet, Aurélien; Tommasi, Marc
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

The COMPRISE Cloud Platform

Autori: Skadiņš, Raivis; Salimbajevs, Askars
Pubblicato in: 1st International Workshop on Language Technology Platforms, Numero 16/05/2020, 2020, Pagina/e 108–111
Editore: European Language Resources Association

Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training

Autori: Salimbajevs, Askars
Pubblicato in: 9th International Conference on Human Language Technologies – the Baltic Perspective, Numero 22/09/2020, 2020
Editore: IOS Press

Assessing Unintended Memorization in Neural Discriminative Sequence Models

Autori: Helali, Mossad; Kleinbauer, Thomas; Klakow, Dietrich
Pubblicato in: 23rd International Conference on Text, Speech and Dialogue, Numero 08/09/2020, 2020
Editore: Springer

Private Protocols for U-Statistics in the Local Model and Beyond

Autori: Bell, James; Bellet, Aurélien; Gascón, Adrià; Kulkarni, Tejas
Pubblicato in: International Conference on Artificial Intelligence and Statistics, Numero 27/08/2020, 2020, Pagina/e 1573-1583
Editore: Proceedings of Machine Learning Research

Data Augmentation for Pipeline-Based Speech Translation

Autori: Alves, Diego; Salimbajevs, Askars; Pinnis, Mārcis
Pubblicato in: 9th International Conference on Human Language Technologies – the Baltic Perspective, Numero 22/09/2020, 2020
Editore: IOS Press

A Comparative Study of Speech Anonymization Metrics

Autori: Maouche, Mohamed; Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Bellet, Aurélien; Tommasi, Marc; Vincent, Emmanuel
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data

Autori: Sheikh, Imran; Vincent, Emmanuel; Illina, Irina
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation

Autori: Turan, M. A. Tuğtekin; Vincent, Emmanuel; Jouvet, Denis
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

Privacy Guarantees for De-identifying Text Transformations

Autori: Adelani, David Ifeoluwa; Davody, Ali; Kleinbauer, Thomas; Klakow, Dietrich
Pubblicato in: INTERSPEECH 2020, Numero 25/10/2020, 2020
Editore: ISCA

Who Started this Rumor? Quantifying the Natural Differential Privacy Guarantees of Gossip Protocols

Autori: Bellet, Aurélien; Guerraoui, Rachid; Hendrikx, Hadrien
Pubblicato in: 34th International Symposium on Distributed Computing, Numero 12/10/2020, 2020
Editore: LIPIcs

Fully Decentralized Joint Learning of Personalized Models and Collaboration Graphs

Autori: Zantedeschi, Valentina; Bellet, Aurélien; Tommasi, Marc
Pubblicato in: 23rd International Conference on Artificial Intelligence and Statistics, Numero 26/08/2020, 2020
Editore: PMLR

Anonymisation and re-identification risk for voice data

Autori: Moretón, Alvaro; Jaramillo, Ariadna
Pubblicato in: European Data Protection Law Review, Numero 19/07/2021, 2021, ISSN 2364-284X
Editore: Lexxion

MasakhaNER: Named Entity Recognition for African Languages

Autori: David Ifeoluwa Adelani; Jade Abbott; Graham Neubig; Daniel D’souza; Julia Kreutzer; Constantine Lignos; Chester Palen-Michel; Happy Buzaaba; Shruti Rijhwani; Sebastian Ruder; Stephen Mayhew; Israel Abebe Azime; Shamsuddeen H. Muhammad; Chris Chinenye Emezue; Joyce Nakatumba-Nabende; Perez Ogayo; Aremu Anuoluwapo; Catherine Gitau; Derguene Mbaye; Jesujoba Alabi; Seid Muhie Yimam; Tajuddeen Rabiu
Pubblicato in: Transactions of the ACL, Numero 07/10/2021, 2021, ISSN 2307-387X
Editore: MIT Press
DOI: 10.1162/tacl_a_00416

How can Private Information Recorded by Voice-Enabled Systems be Identified?

Autori: Moretón Poch, Álvaro; Jaramillo, Ariadna
Pubblicato in: European Data Protection Law Review, Numero 12/10/2020, 2020, ISSN 2364-2831
Editore: Lexxion

Monolingual and cross-lingual intent detection without training data in target languages

Autori: Jurgita Kapočiūtė-Dzikienė; Askars Salimbajevs; Raivis Skadiņš
Pubblicato in: Electronics, Numero 11/06/2021, 2021, ISSN 2079-9292
Editore: MDPI
DOI: 10.3390/electronics10121412

The VoicePrivacy 2020 Challenge: Results and findings

Autori: Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel; Patino, Jose; Srivastava, Brij Mohan Lal; Noé, Paul-Gauthier; Nautsch, Andreas; Evans, Nicholas; Yamagishi, Junichi; O'Brien, Benjamin; Chanclu, Anaïs; Bonastre, Jean-François; Todisco, Massimiliano; Maouche, Mohamed
Pubblicato in: Computer Speech and Language, Numero to appear, 2022, ISSN 1095-8363
Editore: Elsevier

È in corso la ricerca di dati su OpenAIRE...

Risultati finali

Pubblicazioni

Scarica Scarica il contenuto della pagina