Periodic Reporting for period 1 - enRichMyData (Enabling Data Enrichment Pipelines for AI-driven Business Products and Services)
Période du rapport: 2022-10-01 au 2023-09-30
The objective of enRichMyData is to develop an open software toolbox – the enRichMyData toolbox – comprising practical, robust and scalable components to support organizations in enriching their data with reference data they may have limited knowledge of, as well as supporting data providers in making their data reusable and available in data enrichment processes. The aim of the toolbox is to lower the technological entry barriers by providing support for the definition of highly scalable and replicable data enrichment pipelines through a set of tools and infrastructure services related to capabilities needed during the lifecycle of enrichment pipelines. The toolbox will make the data enrichment process accessible to a wider set of stakeholders by reducing the level of expertise required and enhancing the level of tool support.
The process consisted of an interview-based analysis of the Business Cases, paying close attention to current data management processes as a baseline for the improvements and innovations that will be enabled by taking into use the enRichMyData toolbox. The business case requirements have been documented in Deliverable D4.1.
In addition, an extensive analysis of the state of the art in research on data enrichment, as well as an overview of the state of practice regarding tools for data enrichment both within and outside the enRichMyData consortium. The results of these analyses are documented in Deliverable D4.1.
Based on this work, initial versions of the enRichMyData tools and the toolbox have been designed and implemented, resulting in Deliverables D2.1 and D3.1.