Periodic Reporting for period 1 - DS2 (DataSpace, DataShare 2.0)
Período documentado: 2024-01-01 hasta 2025-06-30
The overall objective of DS2 is to develop a data space interoperability solution that enables data space members to share data using data space technology with members of another data space, when these two data spaces have agreed to do so. The interoperability solution needs to support the data sovereignty and governance through the complex life cycle of data. Because data spaces and DS2 are novel ways for a company/organisation to get and share data with third parties, it needs to be integrated with their data management systems. The DS2 will also tackle the problems with managing and governing these aspects, and with supporting process where human involvement is needed with advanced AI/LLM technologies.
Data space interoperability is needed in common European data spaces that are essentially sectoral interoperability solutions with data spaces as their members. Data sharing enabled by them and DS2 solution can have significant impacts in enabling the AI/LLM revolution in industrial and commercial settings, where data is not openly available. Development, maintenance and use of AI is fully dependent on the data, and DS2 provides means to add confidential data as an element in these processes. More traditional use cases such as supply network optimisation, asset management, various cyber-physical system, and smart services will benefit from data that is not otherwise accessible. The expected economic impact of data economy is hundreds of billions of Euros.
· definition of data space interoperability problems space,
· the analysis of trust creation and trust enforcement processes,
· analysis of regulatory landscape of data spaces,
· data product and data sharing agreement concepts,
· specification of the three pilot use cases and defining the use cases setups,
· creation of DS2 architecture model, including ideas for deployment, interoperability, and data pipeline systems,
· specification of more than 20 DS2 modules including research related to how the initial ideas of the module functionalities could be implemented,
· implementation of first module prototypes, and
· initial analysis of business model and sustainability of data spaces and interoperability operator.
The main achievements at M18 are
· deep understanding of the various alternatives and their future directions of data spaces, and a scientific paper about the core features of data spaces,
· conference paper on the relationship between European regulation and data spaces’ innovation potential,
· user survey on end-users’ attitudes related to data sharing and data spaces,
· data space interoperability architecture model,
· DS2 module documentation in DS2 GitHub that provides the basic information of all developed modules.
Several DS2 modules contain functionality that goes beyond current state of the art.
· Natural language assistant for configuration of data processing pipelines and DS2 functionalities for supporting user’s data management
· Data analysis and modelling method for supporting and improving the queries to meta-data catalogue and LLM services through improved understanding of the context and language independency
· Natural language assistant for data product search from data product catalogue
· Natural language service for accessing complex data from data sources