Skip to main content

Machine learning to augment shared knowledge in federated privacy-preserving scenarios


Project communication and engagement activities

Details for communication and engagement activities and materials along with the time line and success indicators. It includes a record of communication activities that have been undertaken during the first half of the project, and those planned for the second period.

Architecture design – Initial version

A document describing the first version of the platform architecture, how the architecture meets the (initial) requirements of the federated and privacy-preserving machine learning services designed in WP 4.1, how it addresses the (initial) user stories developed in WP7, and how it aligns with existing Industrial Data Platform standards.

Industrial and technical requirements

Report containing an exhaustive list of domain-specific and business requirements coming from the two scenarios and from other complementary domains, and assessment of available datasets. Report containing a complete specification of technical requirements to drive technical developments in WPs 3,4,5,6 and WP7 integration.

Key performance indicators selection and definition

Detailed description of the technical and domain business-specific KPIs that will be used for validating the MUSKETEER data platform.

Client connectors’ architecture design – Initial version

A document describing the main functionalities of the client connector. It will contain the design of the component and how it interact with services at server side. This is the first version of the report.

Architecture design – Final version

A document describing the final version of the platform architecture, how it meets the final requirements of the federated and privacy-preserving machine learning services designed in WP 4.1, how it addresses the final user stories developed in WP7, how it supports incorporating active security measures against adversarial attacks (data poisoning, evasion), and how it aligns with existing Industrial Data Platform standards.

Investigative overview of targeted architecture and algorithms

A technical report containing the final structure of the privacy operation modes describing how every algorithm will operate over these modes and the details about the SW architecture and design patterns that will facilitate future extensions of the SW library.

Threat analysis for federated machine learning algorithms

A report describing the main threats and vulnerabilities that may be present in federated machine learning algorithms considering both, attacks at training and test time and defining requirements for the design, deployment and testing of federated machine learning algorithms. This report would also form a strong basis from which governance and or legislative input could be drawn.

Dissemination and communication plan

his task aims to detail the strategy for dissemination of project results through appropriate channels and during various stages of the project. The underlying goal of dissemination activities will to ensure public awareness of the project and promote interest. This task will ensure practical dissemination using the different instruments mentioned in WP8.

Assessment Framework design and specification

A document describing the main common evaluation framework. It will contain the design of the different tests and datasets that will be used in the evaluation, as well as the merit performance measurements to be obtained.

Project website and communication material

MUSKETEER public website, to be active and regularly updated during the whole project and maintained for 1 year following the project’s completion. The MUSKETEER factsheet will be an early leaflet for dissemination and communication purposes, including the most relevant information of the project in a nutshell, and will be available from the very beginning.

First prototype of the MUSKETEER platform

A demonstration (and report) of a first prototype of the MUSKETEER platform, demonstrating the end-to-end execution of data sharing and federated machine learning for synthetic data and at least one use case, supporting privacy operating modes POM1-POM3. Includes demonstration of basic dashboard reporting.

Pre-processing, normalization, data alignment and data value estimation algorithms – Initial version

This deliverable is in the form of software will present Version 0 of the library covering a set of pre-processing steps needed before using the machine learning algorithms. Models for the estimation of the data value will also be provided.

Searching for OpenAIRE data...


Increasing trust within an ecosystem with Federated learning

Author(s): S. Bonura, D. Dalle Carbonare, R. Díaz-Morales, A. Navia-Vázquez, M. Purcell and S. Rossello
Published in: BDVA Book Chapter, 2021

Data protection in the era of artificial intelligence. Trends, existing solutions and recommendations for  privacy-preserving technologies

Author(s): T. Timan, Z. Mann (Eds.), R. Araujo, A. Crespo-García, A. Farkash, A. Garnier, A. Vivian-Kiousi, P. Koster, A. Kung, G. Livraga, R. Díaz-Morales, M. Önen, A. Pa-lomares, A. Navia-Vázquez, A. Metzger (contributors).
Published in: BDVA position paper, 2019

Big Data challenges in Smart Manufacturing Industry

Author(s): Gusmeroli S., Dalle Carbonare D. (eds).
Published in: BDVA Whitepaper, 2020

Privacy Preserving Technologies for Trusted Data Spaces

Author(s): S. Bonura, D. Dalle Carbonare, R. Díaz-Morales, M. Fernández-Díaz, L. Morabi-to, L. Muñoz-González,  C. Napione, A. Navia-Vázquez, M. Purcell
Published in: BDVA Book Chapter, 2021

Security and Robustness in Federated Machine Learning

Author(s): A. Rawat, G. Zizzo, M. Zaid Hameed, and L. Muñoz-González
Published in: Federated Learning: A Comprehensive Overview of Methods and Applications, 2021

Recycling weak labels for multiclass classification

Author(s): Miquel Perello-Nieto; Raul Santos-Rodriguez; Dario Garcia-Garcia; Jesús Cid-Sueiro
Published in: Journal of Neurocomputing, 2020, Page(s) 206-215, ISSN 0925-2312
DOI: 10.1016/j.neucom.2020.03.002

Privacy-Preserving Distributed Support Vector Machines

Author(s): Bottoni, Simone, Stefano Braghin, Theodora Brisimi, and Alberto Trombetta
Published in: Heterogeneous Data Management, Polystores, and Analytics for Healthcare: VLDB Workshops, Poly 2021 and DMAH 2021, 2021, Page(s) 85-102

FAT: Federated Adversarial Training

Author(s): G. Zizzo, A. Rawat, M. Sinn, B. Buesser
Published in: NeurIPS 2020 Workshop on Scalability, Privacy, and Security in Federated Learning (SpicyFL), 2020

Defending against poisoning attacks in online learning settings

Author(s): Greg Collinge, Emil C Lupu, Luis Muñoz-González
Published in: ESANN 2019 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Issue Year., 2019, Page(s) 43-48

Double Confidential Federated Machine Learning Logistic Regression for Industrial Data Platforms

Author(s): A. Navia-Vazquez M. Vazquez-Lopez J. Cid-Sueiro
Published in: FL-ICML 2020 : International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2020, 2020