With the unprecedented advances in the satellite technology, recent years have led to a significant increase in the volume of Earth observation (EO) data archives. Thus, the development of accurate and scalable systems to discover crucial knowledge from massive EO data archives on the state of our planet Earth have recently emerged. Existing systems allow querying satellite images required for the considered EO applications based on keywords/tags in terms of sensor type, geographical location and data acquisition time of the satellite images stored in the archives. However, in the era of big data, the content of the satellite data is much more relevant than the keywords/tags. In order to keep up with the growing need of automatization, knowledge discovery systems and tools that operate on the content of the satellite images are necessary.
In the ERC BigEarth project, we develop cutting-edge methods for: i) large-scale image representation learning; and ii) large-scale image search and retrieval for an accurate and fast discovery of crucial information for observing Earth from big EO archives. The methods developed in the ERC BigEarth project provide the foundations for knowledge discovery systems that index and query the complex content of large-scale EO data in a scalable and accurate manner. In detail, the BigEarth project consists of five main Aims in total, from which four Aims are associated to the development of novel methodologies and tools on the main challenges of Big EO data and also one Aim is related to the benchmark archive construction to validate the algorithms and the software.
Aim 1: Development of novel methods and tools to characterize and exploit high level semantic and spectral information present in remote sensing (RS) images;
Aim 2: Development of novel feature extraction methods and tools to directly extract features from the compressed RS images;
Aim 3: Development of accurate and scalable RS image indexing and retrieval methods together with associated tools;
Aim 4: Development of methods and tools to integrate feature representations of different RS image sources into a unified form of feature representation;
Aim 5: Construction of a benchmark archive with high number of multi-source RS images.
The methods and algorithms developed in the BigEarth project: 1) address the challenges on knowledge discovery from big data archives for EO, which contributes to the EU’s Artificial Intelligence research and innovation agenda; and 2) ease the information discovery from massive archives based on efficient and effective modelling, indexing and querying the complex content of RS images (which go beyond the simple keywords/tags-based search).