SemaGrow: Data intensive techniques to boost the real-time performance of global agricultural data infrastructures

Descripción del proyecto

Intelligent Information Management

As the trend to open up data and provide them freely on the Internet intensifies, the opportunities to create added value by combining and cross-indexing heterogeneous data at a large scale increase. To seize them, we need infrastructure that is not only efficient, real-time responsive and scalable but is also flexible and robust enough to welcome data in any schema and form and to transparently relegate and translate queries from a unifying end-point to the multitude of data services that make up the open data cloud.This relies on detailed and accurate data summaries and other data source annotations, and with increased data volumes and heterogeneity managing these annotations, it becomes by itself a challenging data problem. SemaGrow will (a) develop scalable and robust semantic storage and indexing algorithms that can take advantage of resource naming conventions and other natural groupings of URIs to compress data source annotations about extremely large datasets; (b) develop query decomposition, source selection, and distributed querying methods that take advantage of such algorithms to implement a scalable and robust infrastructure for data service federation; and (c) rigorously test its components and overall architecture over real, complex, interconnected datasets comprising data and document collections, sensor data, and GIS data.SemaGrow will be rigorously tested on the large-scale and complex agricultural data service ecosystem, comprising more than 20 currently operating data services providing today Gigatriples of RDF data, projected to double before SemaGrow ends and to reach Teratriples by 2020. Being able to query across these datasets is a real and present need. SemaGrow envisages to develop the scalable, efficient, and robust data services needed to take full advantage of the data-intensive and inter-disciplinary Science of 2020 and to re-shape the way that data analysis techniques are applied to the heterogeneous data cloud.

Ámbito científico

Coordinador

UNIVERSIDAD DE ALCALA

Aportación de la UE

€ 462 981,00

Dirección

PLAZA DE SAN DIEGO
28801 Alcala De Henares/Madrid
España

Región

Comunidad de Madrid Comunidad de Madrid Madrid

Tipo de actividad

Higher or Secondary Education Establishments

Contacto administrativo

Miguel-Angel Sicilia (Prof.)

Enlaces

Contactar con la organización Sitio web

Red de colaboración de HORIZON

Coste total

Sin datos

Participantes (7)

SEMANTIC WEB COMPANY GMBH

Austria

Aportación de la UE

€ 299 650,00

"NATIONAL CENTER FOR SCIENTIFIC RESEARCH ""DEMOKRITOS"""

Grecia

Aportación de la UE

€ 554 331,00

PSOCHIOS IOANNIS & SIA OE - AGRO-KNOW TECHNOLOGIES

Grecia

Aportación de la UE

€ 185 164,00

UNIVERSITA DEGLI STUDI DI ROMA TOR VERGATA

Italia

Aportación de la UE

€ 321 400,00

THE FOOD AND AGRICULTURE ORGANIZATION OF THE UNITED NATIONS

Italia

Aportación de la UE

€ 187 200,00

STICHTING WAGENINGEN RESEARCH

Países Bajos

Aportación de la UE

€ 363 430,00

INSTITUT ZA FIZIKU

Serbia

Aportación de la UE

€ 95 844,00

Descripción del proyecto

Ámbito científico

Programa(s)

Tema(s)

Convocatoria de propuestas

Régimen de financiación

Coordinador

Participantes (7)

Compartir esta página

Descargar