Skip to main content
European Commission logo print header

Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning

Rezultaty

DSL runtime design

Report on the initial design of distribution primitives and existing framework integration

Initial System Architecture

Report on requirements of endtoend data analysis pipelines and design of the initial system architecture

Scheduler design for pipelines and tasks

Report on the initial overall design of the scheduling components scheduling of pipelines and workflows as well as task and data placement

SotA survey of benchmarks from DM, HPC, and ML Sys

Report on the stateoftheart of benchmarks for database systems data management highperformance computing and ML systems

Initial pipeline definition all use cases

Report on use case studies with technical details and the definition of initial pipelines that can be used for testing

Language Design Specification

Report on the language abstractions APIs and DSL as well as the central internal representation

Design of integration HW accelerators

Report on the planned overall design of integration HW accelerators as well as details on accelerated operations and primitives as well as its compiler and runtime support

Report on search space analysis, automatic capability configuration

Report on stateoftheart techniques for computational storage neardata processing and potential side effects as well as an overview of automatically determining the capabilities of a storage configuration

1st Annual Project Report

Public report describing the project progress until M12 achievements and impact as well as a calculation of efforts and costs

Compiler Prototype

Software artifact of the initial compiler prototype

Publikacje

DaphneSched: A Scheduler for Integrated Data Analysis Pipelines

Autorzy: Ahmed Eleliemy, Florina M. Ciorba
Opublikowane w: ISPDC23; IEEE, 2023
Wydawca: ISPDC23; IEEE

I/O Interface Independence with xNVMe

Autorzy: Simon Lund, Philippe Bonnet, Klaus Jensen, Javier Gonzalez
Opublikowane w: Proceedings of the 15th ACM International Systems and Storage Conference, Issue annually, 2022
Wydawca: ACM - Association for Computing Machinery

DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines

Autorzy: Patrick Damme, Marius Birkenbach, Constantinos Bitsakos, Matthias Boehm, Philippe Bonnet, Florina Ciorba, Mark Dokter, Pawel Dowgiallo, Ahmed Eleliemy, Christian Faerber, Georgios Goumas, Dirk Habich, Niclas Hedam, Marlies Hofer, Wenjun Huang, Kevin Innerebner, Vasileios Karakostas, Roman Kern, Tomaž Kosar, Daniel Krems, Andreas Laber, Wolfgang Lehner, Eric Mier, Marcus Paradies, Bernhard Peischl
Opublikowane w: Conference on Innovative Data Systems Research, CIDR, Issue 9.1.2022-12.1.2022, 2022
Wydawca: Conference on Innovative Data Systems Research, CIDR

Micro-architectural Analysis of a Learned Index

Autorzy: Mikkel Møller Andersen, Pınar Tözün
Opublikowane w: Proceedings of the International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, Issue annually, 2022
Wydawca: ACM - Association for Computing Machinery
DOI: 10.1145/3533702.3534917

Not your Grandpa's SSD: The Era of Co-Designed Storage Devices

Autorzy: Alberto Lerner, Philippe Bonnet
Opublikowane w: Proceedings of the 2021 International Conference on Management of Data, 2021
Wydawca: ACM

Evaluating Multi-GPU Sorting with Modern Interconnects

Autorzy: Tobias Maltenberger, Ivan Ilic, Ilin Tolovski, Tilmann Rabl
Opublikowane w: Proceedings of the 2022 International Conference on Management of Data (SIGMOD ’22), Issue annually, 2022
Wydawca: ACM - Association for Computing Machinery

Efficient Multi-Model Management

Autorzy: Nils Strassenburg, Dominic Kupfer, Julia Kowal, Tilmann Rabl
Opublikowane w: 26th International Conference on Extending Database Technology (EDBT), Issue annually, 2023
Wydawca: OpenProceedings.org

Enabling Integrated Data Analysis Pipelines on Heterogeneous Hardware through Holistic Extensibility

Autorzy: Patrick Damme, Matthias Boehm
Opublikowane w: 2nd Workshop on Novel Data Management Ideas on Heterogeneous Hardware Architectures (NoDMC), 2023
Wydawca: Gesellschaft für Informatik

A Survey of Big Data, High Performance Computing, and Machine Learning Benchmarks

Autorzy: Nina Ihde, Paula Marten, Ahmed Eleliemy, Gabrielle Poerwawinata, Pedro Silva, Ilin Tolovski, Florina M. Ciorba, Tilmann Rabl
Opublikowane w: Proceedings of the Thirteenth TPC Technology Conference on Performance Evaluation & Benchmarking, 2021
Wydawca: Springer

Delilah: eBPF-offload on computational storage

Autorzy: Niclas Hedam, Morten Tychsen Clausen, Philippe Bonnet, Sangjin Lee, Ken Friis Larsen
Opublikowane w: 19th International Workshop on Data Management on New Hardware (DaMoN), Issue annually, 2023
Wydawca: ACM

Parallelization of benchmarking using HPC: text summarization in natural language processing (NLP), glider piloting in deep-sea missions, and search algorithms in computational intelligence (CI)

Autorzy: Aleš Zamuda
Opublikowane w: Proceedings of the Austrian-Slovenian HPC Meeting 2021 - ASHPC21, 2021, ISBN 978-961-6980-77-7
Wydawca: University of Ljubljana

DeGNN: Improving Graph Neural Networks with Graph Decomposition

Autorzy: Miao, Xupeng; Gürel, Nezihe Merve; id_orcid0000-0002-4747-2406; Zhang, Wentao; Han, Zhichao; Li, Bo; Min, Wei; Rao, Susie; id_orcid0000-0003-2379-1506; Ren, Hansheng; Shan, Yinan; Shao, Yingxia; Wang, Yujie; Wu, Fan; Xue, Hui; Yang, Yaming; Zhang, Zitao; Zhao, Yang; Zhang, Shuai; id_orcid0000-0002-7866-4611; Wang, Yujing; Cui, Bin; Zhang, Ce
Opublikowane w: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21), Issue annually, 2021
Wydawca: ACM

Evaluating In-Memory Hash Joins on Persistent Memory

Autorzy: Tobias Maltenberger, Till Lehmann, Lawrence Benson, Tilmann Rabl
Opublikowane w: 25th International Conference on Extending Database Technology (EDBT), Issue annually, 2022
Wydawca: OpenProceedings.org
DOI: 10.48786/edbt.2022.23

Evaluating SIMD Compiler-Intrinsics for Database Systems

Autorzy: Lawrence Benson, Richard Ebeling and Tilmann Rabl
Opublikowane w: VLDBW -- ADMS 23, 2023
Wydawca: ACM - Association for Computing Machinery

TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems

Autorzy: Christoph Brücke, Philipp Härtling, Rodrigo D Escobar Palacios, Hamesh Patel and Tilmann Rabl
Opublikowane w: 2023
Wydawca: VLDB23; ACM - Association for Computing Machinery

Desis: Efficient Window Aggregation in Decentralized Networks

Autorzy: Wang Yue, Lawrence Benson, Tilmann Rabl
Opublikowane w: 26th International Conference on Extending Database Technology (EDBT), Issue annually, 2023
Wydawca: OpenProceedings.org

Darwin: Scale-In Stream Processing

Autorzy: Lawrence Benson, Tilmann Rabl
Opublikowane w: Conference on Innovative Data Systems Research, CIDR 22, Issue annually, 2022
Wydawca: Conference on Innovative Data Systems Research, CIDR 22

Considering a Fear and Greed Index in Bitcoin Price Prediction Through Long Short-Term Memory

Autorzy: Nataša Ošep Ferš, Aleš Zamuda
Opublikowane w: IEEE Slovenia Section, Issue annually, 2021
Wydawca: IEEE

Maximizing Persistent Memory Bandwidth Utilization for OLAP Workloads

Autorzy: Björn Daase, Lars Jonas Bollmeier, Lawrence Benson, Tilmann Rabl
Opublikowane w: Proceedings of the 2021 International Conference on Management of Data (SIGMOD 2021), 2021
Wydawca: ACM

DaphneSched: A Scheduler for Integrated Data Analysis Pipelines

Autorzy: Ahmed Eleliemy, Florina M. Ciorba, Jonas H. Müller Korndörfer
Opublikowane w: ISPDC23, 2023
Wydawca: ISPDC23

How Do OS and Application Schedulers Interact? An Investigation with Multithreaded Applications

Autorzy: Jonas H. Müller Korndorfer, Ahmed Eleliemy, Osman Simsek, Thomas Ilsche, Robert Schöne, Florina M. Ciorba
Opublikowane w: Springer, 2023, ISBN 978-3-031-39697-7
Wydawca: Euro-Par 2023

PerMA-Bench: Benchmarking Persistent Memory Access

Autorzy: Benson, Lawrence and Papke, Leon and Rabl, Tilmann
Opublikowane w: Proceedings of the Very Large Data Base Endowment (VLDB) Endowment, Issue annually, 2022
Wydawca: ACM - Association for Computing Machinery
DOI: 10.14778/3551793.3551807

Predicting Ion Beam Tuning in Semiconductor Manufacturing

Autorzy: Andreas Laber, Martin Gebser, Konstantin Schekotihin, Yao Yang
Opublikowane w: IEEE Electron Devices Society, Issue bi-annually, 2022
Wydawca: IEEE

BabelMR: A Polyglot Framework for Serverless MapReduce

Autorzy: Fabian Mahling, Paul Rößler, Thomas Bodner and Tilmann Rabl
Opublikowane w: VLDBW -- SDA 23, 2023
Wydawca: ACM - Association for Computing Machinery

VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space Decomposition

Autorzy: Li, Yang; Shen, Yu; Zhang, Wentao; Jiang, Jiawei; Ding, Bolin; Li, Yaliang; Zhou, Jingren; Yang, Zhi; Wu, Wentao; Zhang, Ce; Cui, Bin
Opublikowane w: Proceedings of the VLDB Endowment, 14 (11), Issue annually, 2021
Wydawca: PVLDB

Ease. ML: A Lifecycle Management System for Machine Learning

Autorzy: Aguilar Melgar, Leonel; id_orcid0000-0001-6864-4492; Dao, David; Gan, Shaoduo; Gürel, Nezihe M.; Hollenstein, Nora; id_orcid0000-0001-7936-4170; Jiang, Jiawei; Karlaš, Bojan; Lemmin, Thomas; id_orcid0000-0001-5705-4964; Li, Tian; Li, Yang; Rao, Susie; id_orcid0000-0003-2379-1506; Rausch, Johannes; Renggli, Cedric; Rimanic, Luka; Weber, Maurice; Zhang, Shuai; id_orcid0000-0002-7866-4611; Zhao, Zh
Opublikowane w: Proceedings of the Annual Conference on Innovative Data Systems Research (CIDR), 2021, Issue 1, 2021
Wydawca: CIDR 2021
DOI: 10.3929/ethz-b-000458916

Drop It In Like It’s Hot: An Analysis of Persistent Memory as a Drop-in Replacement for NVMe SSDs

Autorzy: Maximilian Böther, Otto Kißig, Lawrence Benson, Tilmann Rabl
Opublikowane w: International Workshop on Data Management on New Hardware (DAMON’21), 2021
Wydawca: ACM SIGMOD/PODS

Efficiently Managing Deep Learning Models in a Distributed Environment

Autorzy: Nils Strassenburg, Ilin Tolovski, Tilmann Rabl
Opublikowane w: 25th International Conference on Extending Database Technology (EDBT), Issue annually, 2022
Wydawca: OpenProceedings.org
DOI: 10.48786/edbt.2022.12

DaxVM: Stressing the Limits of Memory as a File Interface

Autorzy: Chloe Averti, Vasileios Karakostas, Nikhita Kunati, Georgios Goumas, Michael Swift
Opublikowane w: MICRO 2022 - 55th IEEE/ACM International Synopsium on Microarchitecture, Issue annually, 2022
Wydawca: ACM/IEEE

A Resourceful Coordination Approach for Multilevel Scheduling

Autorzy: Eleliemy, Ahmed; Ciorba, Florina M.
Opublikowane w: International Conference on High Performance Computing & Simulation (HPCS) 2021, Issue annual, 2021
Wydawca: HPCS

TPCx-AI on NVIDIA Jetsons

Autorzy: Robert Bayer, Jon Voigt Tøttrup, and Pınar Tözün
Opublikowane w: Proceedings of the Fourteenth TPC Technology Conference on Performance Evaluation & Benchmarking, 2022
Wydawca: ACM - Association for Computing Machinery

RMG Sort: Radix-Partitioning-Based Multi-GPU Sorting

Autorzy: Ivan Ilic, Ilin Tolovski, Tilmann Rabl
Opublikowane w: Datenbanksysteme für Business, Technologie und Web (BTW 2023), Issue bi-annually, 2023
Wydawca: Springer

Viper: An Efficient Hybrid PMem-DRAM Key-Value Store

Autorzy: Lawrence Benson, Hendrik Makait, Tilmann Rabl
Opublikowane w: 2021
Wydawca: ACM

Analyzing Vectorized Hash Tables Across CPU Architectures

Autorzy: Maximilian Böther, Lawrence Benson, Ana Klimovic, Tilmann Rabl
Opublikowane w: VLDB23, 2023
Wydawca: ACM - Association for Computer Machinery

DocParser: Hierarchical Document Structure Parsing from Renderings

Autorzy: Rausch, Johannes; Martinez, Octavio; Bissig, Fabian; Zhang, Ce; Feuerriegel, Stefan
Opublikowane w: Proceedings of the AAAI Conference on Artificial Intelligence, 35 (5), 2021, Page(s) 4328-4338, ISSN 2159-5399
Wydawca: AAAI Press
DOI: 10.13039/501100000780

Speeding up Vectorized Benchmarking of Optimization Algorithms

Autorzy: Aleš Zamuda
Opublikowane w: Austrian-Slovenian HPC Meeting 2022 – ASHPC22, Issue annually, 2022
Wydawca: EuroCC Austria

Don’t Compete, Let’s Cooperate: A Cooperative Scheduling Approach

Autorzy: Ahmed Eleliemy, Florina M. Ciorba
Opublikowane w: Platform for Advancing Scientific Computing Conference, 2021
Wydawca: PASC

CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks

Autorzy: Li Peng, Rao Xi, Jennifer Blase, Xu Chu, Yue Zhang, Ce Zhang
Opublikowane w: DeGNN, 2020
Wydawca: ETH Zurich, Institute for Computing Platforms
DOI: 10.13039/501100001711

Single- and Two-Level Dynamic Load Balancing of Scientific Applications

Autorzy: Ahmed Eleliemy, Florina M. Ciorba
Opublikowane w: Platform for Advancing Scientific Computing Conference, 2021
Wydawca: PASC

The urban morphology on our planet – Global perspectives from space

Autorzy: Xiao Xiang Zhu,Chunping, Qiu, Jingliang Hua, Yilei Shi, Yuanyuan Wang, Michael Schmitta, Hannes Taubenböck
Opublikowane w: Remote Sensing of Environment, Issue 16 volumes / year, 2021, ISSN 0034-4257
Wydawca: Elsevier BV
DOI: 10.1016/j.rse.2021.112794

Micro-architectural analysis of in-memory OLTP: Revisited

Autorzy: Utku Sirin, Pınar Tözün, Danica Porobic, Ahmad Yasin, Anastasia Ailamaki
Opublikowane w: The VLDB Journal, Volume 30, Issue every other month, July 2021, 2021, ISSN 1066-8888
Wydawca: Springer Verlag
DOI: 10.1007/s00778-021-00663-8

Better Database Cost/Performance via Batched I/O on Programmable SSD

Autorzy: Jaeyoung Do, Ivan Luiz Picoli, David Lomet, Philippe Bonnet
Opublikowane w: Conference on Very Large Data Bases (VLDB Journal), Issue 18.2.2021, 2021, ISSN 1066-8888
Wydawca: Springer Verlag
DOI: 10.1007/s00778-020-00648-z

LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications

Autorzy: Jonas H. Müller Korndörfer; Ahmed Eleliemy; Ali Mohammed; Florina M. Ciorba
Opublikowane w: IEEE Transactions on Parallel and Distributed Systems, Volume 33, Issue 4, 2021, Page(s) 830 - 841, ISSN 1045-9219
Wydawca: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tpds.2021.3107775

Automated Scheduling Algorithm Selection and Chunk Parameter Calculation in OpenMP

Autorzy: Ali Mohammed, Jonas H. Müller Kornörfer, Ahmed Eleliemy, Florina M. Ciorba
Opublikowane w: IEEE Transactions on Parallel and Distributed Systems, Issue Volume: 33, Issue: 12, December 2022, 2022, ISSN 1045-9219
Wydawca: Institute of Electrical and Electronics Engineers

Wyszukiwanie danych OpenAIRE...

Podczas wyszukiwania danych OpenAIRE wystąpił błąd

Brak wyników