CORDIS - Forschungsergebnisse der EU
CORDIS

Adaptive multi-tier intelligent data manager for Exascale

Projektbeschreibung

Intelligente adaptive Speicherung zur Förderung von Hochleistungsrechnern

Da immer mehr extrem große Datensätze verarbeitet werden müssen, werden Hochleistungsrechner immer notwendiger. Allerdings zeigt sich immer mehr, dass die flachen Datenspeicherungshierarchien mit einem parallelen Dateisystem dem nicht gewachsen sind. Aufstrebende mehrstufige Datenspeicherungshierarchien könnten die Anforderungen datenintensiver Anwendungen erfüllen, allerdings mangelt es derzeit noch an angemessenen Steuerungsmechanismen der verfügbaren Ressourcen. Das EU-finanzierte Projekt ADMIRE plant die Entwicklung eines adaptiven Speicherungssystems, mit dem Hochleistungsrechensysteme einen sehr hohen Durchsatz erreichen und die Anwendungsleistung steigern können. Das Ziel besteht darin, die Laufzeit von Anwendungen in Bereichen wie der Wettervorhersage, der Fernerkundung und Deep Learning erheblich zu verbessern.

Ziel

The growing need to process extremely large data sets is one of the main drivers for building exascale HPC systems today. However, the flat storage hierarchies found in classic HPC architectures no longer satisfy the performance requirements of data-processing applications. Uncoordinated file access in combination with limited bandwidth make the centralised back-end parallel file system a serious bottleneck. At the same time, emerging multi-tier storage hierarchies come with the potential to remove this barrier. But maximising performance still requires careful control to avoid congestion and balance computational with storage performance. Unfortunately, appropriate interfaces and policies for managing such an enhanced I/O stack are still lacking.

The main objective of the ADMIRE project is to establish this control by creating an active I/O stack that dynamically adjusts computation and storage requirements through intelligent global coordination, malleability of computation and I/O, and the scheduling of storage resources along all levels of the storage hierarchy. To achieve this, we will develop a software-defined framework based on the principles of scalable monitoring and control, separated control and data paths, and the orchestration of key system components and applications through embedded control points.

Our software-only solution will allow the throughput of HPC systems and the performance of individual applications to be substantially increased – and consequently energy consumption to be decreased – by taking advantage of fast and power-efficient node-local storage tiers using novel, European ad-hoc storage systems and in-transit/in-situ processing facilities. Furthermore, our enhanced I/O stack will offer quality-of-service (QoS) and resilience. An integrated and operational prototype will be validated with several use cases from various domains, including climate/weather, life sciences, physics, remote sensing, and deep learning.

Koordinator

UNIVERSIDAD CARLOS III DE MADRID
Netto-EU-Beitrag
€ 383 937,50
Adresse
CALLE MADRID 126
28903 Getafe (Madrid)
Spanien

Auf der Karte ansehen

Region
Comunidad de Madrid Comunidad de Madrid Madrid
Aktivitätstyp
Higher or Secondary Education Establishments
Links
Gesamtkosten
€ 767 875,00

Beteiligte (18)