Periodic Reporting for period 2 - NEMESIS (Novel Evolutionary Model for the Early stages of Stars with Intelligent Systems)
Reporting period: 2022-03-01 to 2023-02-28
NEMESIS aims to readjust the current classification scheme and its characteristic timescales so that it is concurrent with the most recent observational and theoretical constraints. To meet these goals NEMESIS will compile the largest, panchromatic dataset comprising of all young stellar objects in nearby star-forming regions, harnessing critical information that resides in data from space missions. It will reprocess and analyze this unique dataset with supervised and unsupervised machine learning algorithms, deep learning neural networks for object detection, clustering and regression analysis of images in order to advance the analysis and interpretation beyond the current state-of-the-art. Ultimately, NEMESIS brings big data techniques and hybrid machine learning methods to systematically analyze and interpret large data volumes in order to answer some of the most persisting questions, paving the path toward data-intensive science applications in modern astrophysics.
Being an important pillar of the project, data compilation was initiated immediately. At a first stage catalogued data for nearby star-forming regions were retrieved. For young stellar objects, infrared wavelengths are particularly important, therefore data from space-borne infrared facilities (e.g. Herschel, Spitzer, AKARI, WISE) were given a priority. Nonetheless, data spanning all over the electromagnetic spectrum either from space (e.g. Hubble, XMM-Newton, Chandra) or ground-based facilities (e.g. ALMA, APEX, JCMT, 2MASS etc) can provide important information on the evolution of YSOs and were therefore retrieved. Part of the data compilation is based on reduced/published data which was retrieved from the literature and/or databases, while data of specific interest are being freshly reduced by the NEMESIS team.
Aiming to accelerate the production of early results, we prioritised the data compilation for a single star-forming region: Orion. The selection was based on both the number of young stellar sources, with Orion being the largest nearby star-forming region, but also on the number of available data, since Orion is one of the best studied star-forming regions. The Orion data compilation allowed us to perform a number of test different machine learning methods on the actual data and evaluate their performance.
With NEMESIS we introduce big data and machine learning techniques in the field of Star Formation to an extent that was never attempted before. Aiming to remain in the center of attention in a swiftly advancing field, we are organizing meetings in the context of the largest conferences taking place this year in Europe. These include the annual meeting of the European Astronomical Society (EAS) and the scientific assembly of the Committee on Space Research (COSPAR) hosting as invited speakers some of the leading figures in the field.