Skip to main content
European Commission logo
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

Automated end-to-end data life cycle management for FAIR data integration, processing and re-use

Project description

Data-centric business models for data lifecycle governance

Integrating data from multiple sources using AI-powered applications can provide organisations with a competitive edge. However, manual data lifecycle governance often results in data silos. There is a need for a systematic mechanism to collect, combine, and process data to develop new data-centric business models. The EU-funded CyclOps project proposes a new framework designed to handle large volumes of data generated from diverse sources to manage and maintain the complete data lifecycle. The project uses knowledge graphs (KGs) to automate the generation and execution of data processing pipelines while incorporating human input. CyclOps aims to enable organisations to easily share, access, and analyse both machine and human-generated data from various data spaces, thereby facilitating the provision of value-added services.

Objective

The ability to integrate data from multiple sources is nowadays a major competitive advantage for organizations. Data-driven applications using AI techniques are reshaping various industries such as manufacturing, tourism, and mobility. The European Strategy for Data aims to create a single market for data while ensuring Europe's global competitiveness and data sovereignty. This has led to the development of Common European data spaces, yet the governance of the data life cycle in organizations has not kept up with the rapid technology evolution and remains largely manual. This is especially evident in scenarios where tens or hundreds of continuously evolving data sources produce semi-structured data, and create significant challenges when governed manually, causing organizations to end up with data silos. A systematic and standardized mechanism is needed to ingest, integrate, and process data, thus boosting the ability to develop new data-centric business models. However, current research and development efforts typically target one aspect of the end-to-end data lifecycle, such as scalable data management, ML performance, AI explainability, or sharing, while dismissing its governance. To overcome this limitation, CyclOps proposes a new framework for the governance and maintenance of the complete data lifecycle for large-scale volumes of data generated in heterogeneous distributed sources to enable data sharing and exchange. CyclOps intelligently automates, by means of knowledge graphs (KGs) and with a human-in-the-loop approach, the generation and execution of data processing pipelines. KGs are the established formal models to represent data and metadata while providing context and guaranteeing interoperability with other systems adhering to the FAIR Guiding Principles. CyclOps will enable organizations to seamlessly provide, cross and analyze machine- and human-generated data from and for data spaces, thus facilitating the provision of added-value services on top.

Coordinator

NTT DATA SPAIN, SL
Net EU contribution
€ 730 286,25
Address
CAMINO FUENTE DE LA MORA 1
28050 Madrid
Spain

See on map

Region
Comunidad de Madrid Comunidad de Madrid Madrid
Activity type
Private for-profit entities (excluding Higher or Secondary Education Establishments)
Links
Total cost
€ 780 947,50

Participants (26)