Periodic Reporting for period 1 - TEADAL (Trustworthy, Energy-Aware federated DAta Lakes along the computing continuum)
Periodo di rendicontazione: 2022-09-01 al 2024-02-29
adopted to assist organizations in making reliable, accurate, and fast decisions. Although the initial approaches to address these issues
saw the data lakes as the evolution of data warehouses to be implemented on-premises, cloud providers are nowadays including in
their offerings platforms able to set up and run them. Nevertheless, the increasing amount of data generated at the edge and the
need to enable data sharing among organizations are posing new challenges in terms of performance, energy efficiency, and
privacy/confidentiality, which can be properly addressed with data lakes which are deployed along the whole computing continuum
as well as building a federation of such data lakes.
The ambition of TEADAL is to provide key cornerstone technologies to create stretched data lakes spanning the cloud-edge
continuum and multi-cloud, providing privacy, confidentiality, and energy-efficient data management. The TEADAL data lake
technologies will enable trusted, verifiable, and energy efficient data flows, both in a stretched data lake and across a trustworthy
mediators federation of them, based on a shared approach for defining, enforcing, and tracking privacy/confidentiality
requirements balanced with the need for energy reduction.
- Requirement collection from the 5 active pilots was conducted.
- Design of the Teadal Node Architecture.
- First (mvp) version of the Teadal Node was developed and integrated.
- All 5 Pilots have the Teadal Node mvp version deployed/installed.
The proposed solution aims to lower the effort required for efficient energy and data analysis with privacy- and confidentiality-by-design data management, thus reducing the barriers to sharing and using data among organizations to provide timely and reliable informed decisions. TEADAL will be beneficial for data lake providers, which can exploit additional functions in terms of definition, enforcement, and tracking of privacy- /confidentiality-based policies, as well as business analysts and data scientists which can make full use of data coming from several organizations without implementing complex systems to ensure privacy/confidentiality requirements.