Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Data Engineering for Data Science

CORDIS provides links to public deliverables and publications of HORIZON projects.

Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .

Deliverables

Research Plan on Storage and Processing (opens in new window)
Research Plan on Preparation (opens in new window)
State of the Art on Analysis (opens in new window)
Doctoral Training Plan (opens in new window)

Doctoral Training Plan will summarise the objectives of the doctoral training of the ESRs

First Comm., Dissem., and TT Report (opens in new window)

First Comm Dissem and TT Report will provide the results of the dissemination actions during the first reporting period

State of the Art on Preparation (opens in new window)
State of the Art on Data Governance (opens in new window)
Selection Report (opens in new window)

Selection Report will summarise the ESR’s application, selection, and recruitment process, and the procedures for the continuous monitoring and evaluation of the ESRs

Comm., Dissem., and TT Plan (opens in new window)

Comm., Dissem., and TT Plan will describe the actions ensuring a maximal outreach of the project activities and results.

Quality Assement Plan (opens in new window)

Quality Assessment Plan will summarise the procedure for assessing the programme and define an initial set of KPIs.

Research Plan on Analysis (opens in new window)
State of the Art on Storage and Processing (opens in new window)
Research Plan on Data Governance (opens in new window)

Publications

Assessing adversarial attacks in real-world fraud detection (opens in new window)

Author(s): Daniele Lunghi, Alkis Simitsis, Gianluca Bontempi
Published in: 2024 IEEE International Conference on Web Services (ICWS), Issue 1807.01069, 2024, Page(s) 27-34
Publisher: IEEE
DOI: 10.1109/icws62655.2024.00021

Impact of filter feature selection on classification: an empirical study

Author(s): Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca
Published in: DOLAP co-located with EDBT/ICDT, Issue 3130, 2022
Publisher: CEUR-WS.org

Adversarial Learning in Real-World Fraud Detection: Challenges and Perspectives (opens in new window)

Author(s): Lunghi, Daniele and Simitsis, Alkis and Caelen, Olivier and Bontempi, Gianluca
Published in: Proceedings of the Second ACM Data Economy Workshop, 2023, Page(s) 27--33
Publisher: Association for Computing Machinery
DOI: 10.1145/3600046.3600051

Performance Analysis of Distributed GPU-Accelerated Task-Based Workflows (opens in new window)

Author(s): Marcos N. L. Carvalho, Anna Queralt, Oscar Romero, Alkis Simitsis, Cristian Tatu, Rosa M. Badia
Published in: EDBT/ICDT 2024, 2024
Publisher: OpenProceedings.org
DOI: 10.48786/edbt.2024.59

Quality of Hybrid GNSS Sampling Methods (opens in new window)

Author(s): Rodrigo Sasse David, Kristian Torp, Anders Zinck Justesen, Mahmoud Sakr, Esteban Zimányi
Published in: 2025 26th IEEE International Conference on Mobile Data Management (MDM), 2025, Page(s) 195-205
Publisher: IEEE
DOI: 10.1109/mdm65600.2025.00045

Uncertainty-Aware Ship Location Estimation using Multiple Cameras in Coastal Areas (opens in new window)

Author(s): Song Wu, Alexandros Troupiotis-Kapeliaris, Dimitris Zissis, Kristian Torp, Esteban Zimányi, Mahmoud Sakr
Published in: 2024 25th IEEE International Conference on Mobile Data Management (MDM), 2024, Page(s) 109-118
Publisher: IEEE
DOI: 10.1109/mdm61037.2024.00034

Wrapper Methods for Multi-Objective Feature Selection (opens in new window)

Author(s): Njoku, Uchechukwu Fortune and Abell{\'o} Gamazo, Alberto and Bilalli, Besim and Bontempi, Gianluca
Published in: 26th International Conference on Extending Database Technology, 2023
Publisher: OpenProceedings
DOI: 10.48786/edbt.2023.58

Effective Ship Trajectory Imputation with Multiple Coastal Cameras (opens in new window)

Author(s): Song Wu, Kristian Torp, Alexandros Troupiotis-Kapeliaris, Dimitris Zissis, Esteban Zimányi, Mahmoud Sakr
Published in: 2025 26th IEEE International Conference on Mobile Data Management (MDM), 2025, Page(s) 145-155
Publisher: IEEE
DOI: 10.1109/mdm65600.2025.00038

Finding Relevant Information in Big Datasets with ML. (opens in new window)

Author(s): Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca
Published in: 2024
Publisher: OpenProceedings
DOI: 10.48786/edbt.2024.85

Mitigating Data Sparsity in Integrated Data through Text Conceptualization (opens in new window)

Author(s): Md Ataur Rahman, Sergi Nadal, Oscar Romero, Dimitris Sacharidis
Published in: 2024 IEEE 40th International Conference on Data Engineering (ICDE), 2024, Page(s) 3490-3504
Publisher: IEEE
DOI: 10.1109/icde60146.2024.00269

Mobility Data Science (Dagstuhl Seminar 22021) (opens in new window)

Author(s): Mokbel, Mohamed ; Sakr, Mahmoud ; Xiong, Li ; Züfle, Andreas ; Almeida, Jussara ; Anderson, Taylor ; Aref, Walid ; Andrienko, Gennady ; Andrienko, Natalia ; Cao, Yang ; Chawla, Sanjay ; Cheng, Reynold ; Chrysanthis, Panos ; Fei, Xiqi ; Ghinita, Gabriel ; Graser, Anita ; Gunopulos, Dimitrios ; Jensen, Christian ; Kim, Joon-Sook ; Kim, Kyoung-Sook ; Kröger, Peer ; Krumm, John ; Lauer, Johannes ; M
Published in: Dagstuhl Reports, 2022, ISSN 2192-5283
Publisher: Schloss Dagstuhl -- Leibniz-Zentrum fur Informatik
DOI: 10.4230/dagrep.12.1.1

Semantic Segmentation of AIS Trajectories for Detecting Complete Fishing Activities (opens in new window)

Author(s): Song Wu; Esteban Zimányi; Mahmoud Sakr; Kristian Torp
Published in: 2022 23rd IEEE International Conference on Mobile Data Management (MDM), 2022
Publisher: IEEE
DOI: 10.1109/mdm55031.2022.00092

HYPPO: Using Equivalences to Optimize Pipelines in Exploratory Machine Learning (opens in new window)

Author(s): Antonios Kontaxakis, Dimitris Sacharidis, Alkis Simitsis, Alberto Abelló, Sergi Nadal
Published in: 2024 IEEE 40th International Conference on Data Engineering (ICDE), 2024, Page(s) 221-234
Publisher: IEEE
DOI: 10.1109/icde60146.2024.00024

The Susceptibility of Example-Based Explainability Methods to Class Outliers (opens in new window)

Author(s): Ikhtiyor Nematov, Dimitris Sacharidis, Tomer Sagi, Katja Hose
Published in: 2024
Publisher: Association for Computing Machinery
DOI: 10.48550/arxiv.2407.20678

Evaluation of Vessel CO2 Emissions Methods Using AIS Trajectories (opens in new window)

Author(s): Wu, Song and Torp, Kristian and Sakr, Mahmoud and Zimanyi, Esteban
Published in: Proceedings of the 18th International Symposium on Spatial and Temporal Data, 2023
Publisher: Association for Computing Machinery
DOI: 10.1145/3609956.3609960

Synthesizing Accurate Relational Data under Differential Privacy (opens in new window)

Author(s): Antheas Kapenekakis, Daniele Dell’Aglio, Charles Vesteghem, Laurids Poulsen, Martin Bøgsted, Minos Garofalakis, Katja Hose
Published in: 2024 IEEE International Conference on Big Data (BigData), 2025, Page(s) 433-439
Publisher: IEEE
DOI: 10.1109/bigdata62323.2024.10825515

A Study on Efficient Indexing for Table Search in Data Lakes (opens in new window)

Author(s): Ibraheem Taha, Matteo Lissandrini, Alkis Simitsis, Yannis Ioannidis
Published in: 2024 IEEE 18th International Conference on Semantic Computing (ICSC), 2024, Page(s) 245-252
Publisher: IEEE
DOI: 10.1109/icsc59802.2024.00046

A Framework for Automated Junction Monitoring (opens in new window)

Author(s): Rodrigo Sasse David, Kristian Torp, Mahmoud Sakr, Esteban Zimányi
Published in: Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, 2025, Page(s) 304-313
Publisher: ACM
DOI: 10.1145/3678717.3691322

Roundabouts and the Energy Consumption of Electrical Vehicles (opens in new window)

Author(s): Rodrigo Sasse David, Esteban Zimányi, Kristian Torp, Mahmoud Sakr
Published in: Proceedings of the 16th ACM SIGSPATIAL International Workshop on Computational Transportation Science, 2025, Page(s) 9-18
Publisher: ACM
DOI: 10.1145/3615895.3628165

Database Optimizers in the Era of Learning (opens in new window)

Author(s): Dimitris Tsesmelis and Alkis Simitsis
Published in: 38th {IEEE} International Conference on Data Engineering, {ICDE}, 2022
Publisher: IEEE
DOI: 10.1109/icde53745.2022.00301

Speed and energy consumption for electrical vehicles (opens in new window)

Author(s): Rodrigo Sasse David and Esteban Zim{\'{a}}nyi and Kristian Torp and Mahmoud Attia Sakr
Published in: Proceedings of the 15th {ACM} {SIGSPATIAL} International Workshop on Computational Transportation Science, {IWCTS} 2022,, 2022
Publisher: ACM
DOI: 10.1145/3557991.3567802

A data-science pipeline to enable the Interpretability of Many-Objective Feature Selection

Author(s): Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca
Published in: 2024
Publisher: CEUR-WS.org

Document Attribution in Retrieval-Augmented Generation

Author(s): Ikhtiyor Nematov, Tarik Kalai, Elizaveta Kuzmenko, Gabriele Fugagnoli, Dimitris Sacharidis, Katja Hose, Tomer Sagi
Published in: 2025
Publisher: Association for Computing Machinery (ACM)

An Adversary Model of Fraudsters’ Behavior to Improve Oversampling in Credit Card Fraud Detection (opens in new window)

Author(s): Daniele Lunghi, Gian Marco Paldino, Olivier Caelen, Gianluca Bontempi
Published in: IEEE Access, Issue 11, 2023, Page(s) 136666-136679, ISSN 2169-3536
Publisher: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/access.2023.3337635

Intent-Aware Example-Based Explainability (opens in new window)

Author(s): Ikhtiyor Nematov
Published in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, Issue 7, 2025, Page(s) 43-46, ISSN 3065-8365
Publisher: Association for the Advancement of Artificial Intelligence (AAAI)
DOI: 10.1609/aies.v7i2.31906

On many-objective feature selection and the need for interpretability (opens in new window)

Author(s): Uchechukwu F. Njoku, Alberto Abelló, Besim Bilalli, Gianluca Bontempi
Published in: Expert Systems with Applications, Issue 267, 2025, Page(s) 126191, ISSN 0957-4174
Publisher: Pergamon Press Ltd.
DOI: 10.1016/j.eswa.2024.126191

AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations (opens in new window)

Author(s): Ikhtiyor Nematov, Dimitris Sacharidis, Katja Hose, Tomer Sagi
Published in: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, Issue 7, 2024, Page(s) 1051-1062, ISSN 3065-8365
Publisher: Association for the Advancement of Artificial Intelligence (AAAI)
DOI: 10.1609/aies.v7i1.31702

Towards fair machine learning using many-objective feature selection (opens in new window)

Author(s): Uchechukwu F. Njoku, Alberto Abelló, Besim Bilalli, Gianluca Bontempi
Published in: Applied Soft Computing, Issue 181, 2025, Page(s) 113411, ISSN 1568-4946
Publisher: Elsevier BV
DOI: 10.1016/j.asoc.2025.113411

And synopses for all: A synopses data engine for extreme scale analytics-as-a-service (opens in new window)

Author(s): Antonios Kontaxakis; Nikos Giatrakos; Dimitris Sacharidis; Antonios Deligiannakis
Published in: Information Systems, Issue 1, 2023, ISSN 0306-4379
Publisher: Elsevier Science & Technology
DOI: 10.1016/j.is.2023.102221

Scalable Model-Based Management of Massive High Frequency Wind Turbine Data with ModelarDB (opens in new window)

Author(s): Abduvoris Abduvakhobov, Søren Kejser Jensen, Torben Bach Pedersen, Christian Thomsen
Published in: Proceedings of the VLDB Endowment, Issue 17, 2025, Page(s) 4723-4732, ISSN 2150-8097
Publisher: Association for Computing Machinery (ACM)
DOI: 10.14778/3704965.3704978

Workload Placement on Heterogeneous CPU-GPU Systems (opens in new window)

Author(s): Marcos N. L. Carvalho, Alkis Simitsis, Anna Queralt, Oscar Romero
Published in: Proceedings of the VLDB Endowment, Issue 17, 2024, Page(s) 4241-4244, ISSN 2150-8097
Publisher: Association for Computing Machinery (ACM)
DOI: 10.14778/3685800.3685845

Comparative Analysis of Indexing Techniques for Table Search in Data Lakes (opens in new window)

Author(s): Ibraheem Taha, Matteo Lissandrini, Alkis Simitsis, Yannis Ioannidis
Published in: International Journal of Semantic Computing, Issue 19, 2025, Page(s) 173-196, ISSN 1793-351X
Publisher: World Scientific Pub Co Pte Ltd
DOI: 10.1142/s1793351x25420024

Pasteur: Scaling Privacy-Aware Data Synthesis (opens in new window)

Author(s): Antheas Kapenekakis, Daniele Dell’Aglio, Martin Bøgsted, Minos Garofalakis, Katja Hose
Published in: Lecture Notes in Computer Science, Advances in Databases and Information Systems, 2025, Page(s) 164-180
Publisher: Springer Nature Switzerland
DOI: 10.1007/978-3-032-05281-0_11

Evaluating Quality of Disparate Data Sources: A Discord-Driven Approach (opens in new window)

Author(s): Yeasmin Ara Akter, Alberto Abelló, Petar Jovanovic, Tomer Sagi, Katja Hose
Published in: Lecture Notes in Computer Science, Advances in Databases and Information Systems, 2025, Page(s) 147-163
Publisher: Springer Nature Switzerland
DOI: 10.1007/978-3-032-05281-0_10

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available

My booklet 0 0