CORDIS
EU research results

CORDIS

English EN
Charting a New Horizon of Big and Fast Data Analysis through Integrated Algorithm Design

Charting a New Horizon of Big and Fast Data Analysis through Integrated Algorithm Design

Objective

This proposal addresses a pressing need from emerging big data applications such as genomics and data center monitoring: besides the scale of processing, big data systems must also enable perpetual, low-latency processing for a broad set of analytical tasks, referred to as big and fast data analysis. Today’s technology falls severely short for such needs due to the lack of support of complex analytics with scale, low latency, and strong guarantees of user performance requirements. To bridge the gap, this proposal tackles a grand challenge: “How do we design an algorithmic foundation that enables the development of all necessary pillars of big and fast data analysis?” This proposal considers three pillars:
1) Parallelism: There is a fundamental tension between data parallelism (for scale) and pipeline parallelism (for low latency). We propose new approaches based on intelligent use of memory and workload properties to integrate both forms of parallelism.
2) Analytics: The literature lacks a large body of algorithms for critical order-related analytics to be run under data and pipeline parallelism. We propose new algorithmic frameworks to enable such analytics.
3) Optimization: To run analytics, today's big data systems are best effort only. We transform such systems into a principled optimization framework that suits the new characteristics of big data infrastructure and adapts to meet user performance requirements.

The scale and complexity of the proposed algorithm design makes this project high-risk, at the same time, high-gain: it will lay a solid foundation for big and fast data analysis, enabling a new integrated parallel processing paradigm, algorithms for critical order-related analytics, and a principled optimizer with strong performance guarantees. It will also broadly enable accelerated information discovery in emerging domains such as genomics, as well as economic benefits of early, well-informed decisions and reduced user payments.
Leaflet | Map data © OpenStreetMap contributors, Credit: EC-GISCO, © EuroGeographics for the administrative boundaries

Host institution

ECOLE POLYTECHNIQUE

Address

Route De Saclay
91128 Palaiseau Cedex

France

Activity type

Higher or Secondary Education Establishments

EU Contribution

€ 2 472 752

Beneficiaries (1)

Sort alphabetically

Sort by EU Contribution

Expand all

ECOLE POLYTECHNIQUE

France

EU Contribution

€ 2 472 752

Project information

Grant agreement ID: 725561

Status

Ongoing project

  • Start date

    1 September 2017

  • End date

    31 August 2022

Funded under:

H2020-EU.1.1.

  • Overall budget:

    € 2 472 752

  • EU contribution

    € 2 472 752

Hosted by:

ECOLE POLYTECHNIQUE

France