Final Report Summary - PRODIMA (PRODIMA: PRObabilistic Data and information Integration with provenance MAnagement)
With probabilistic information integration, PRODIMA has tackled a crucial problem at the core of our knowledge and information society, namely to deal in a meaningful way with huge amounts of distributed and independently created data and information, as available in the Web, but also in many other environments nowadays. Information integration is one of the main current challenges of information technology. It has strong relations to the challenge of managing big data, which are usually stored in a distributed data environment. Solutions are desperately needed, and they have to deal with uncertainty, which e.g. results from the automatic creation of (huge amounts of) mappings. Hence, obviously, the results and insights gained in PRODIMA are very important for our knowledge and information society by contributing to making intelligent access to (integrated) information easier, faster, and better. In addition, the results of the project pave the way to the development of profitable business solutions on an international level. In addition, the results and insights on provenance in general and on provenance in probabilistic information integration have a huge impact on other areas as well. Provenance is a very fundamental concept that is important in many areas (very much like information integration itself) such as digital preservation and scientific data management, to name a few. The application-independent results of PRODIMA affect all these areas as well. For example, consider archives where provenance is the most important principle for organizing archival records. The insights of PRODIMA have the potential to also lead to a significant improvement of digital preservation strategies, and, hence, to better digital archives. In this way, both the preservation of and the access to digital cultural heritage will be enhanced.