The first half of the project was concerned with the development of a machine-learning model that is able to estimate relevant properties of a given molecular structure. A dataset with geometries and properties of molecules relevant for organic photovoltaics was designed. Further, the SchNet model of Schütt et al. [Schütt et al., JCP 2018, DOI:10.1063/1.5019779] was enhanced with a Set2Set readout unit [Vinyals et al., ICLR 2015, arxiv:1511.06391] and trained with the aforementioned dataset. This work has been presented at the Fall Meeting of the European Materials Research Society in September 2022, and a manuscript titled "Machine Learning for Orbital Energies of Organic Molecules Upwards of 100 Atoms" has been published in Physica Status Solidi B (DOI:10.1002/pssb.202200553).
Later, it was discovered that for practical application of the machine-learning model in a materials-search setup [Figure: "Big picture" of Machine Learning in Materials Discovery], the model should not depend on the exact molecular geometry. Rather, it should be able to handle molecular input data in the form of a molecular graph (i.e. specify just the atoms and the bonds between them, but not the exact bond lengths or angles). Such models have recently become available, and the "Uni-Mol+" model [Lu et al. Nature Comm. 2024, DOI:10.1038/s41467-024-51321-w] was extended for the needs of the MALTOSE project (multitask learning, extended training set, transfer-learning and fine-tuning approaches). Finally, we used the OPEP2 dataset [Greenstein & Hutchison, JPCC 2023, DOI:10.1021/acs.jpcc.3c00267] to design a two-stage machine-learning model that relates first the molecular graphs of donor and acceptor with molecular properties, and then the molecular properties with organic photovoltaic performance. This work has been presented at the Spring Meeting of the European Materials Research Society in May 2024 in Strasbourg in a contribution titled "Machine-Learning Driven Materials Search for Organic Photovoltaics".
The MALTOSE project has participated in the "Science is Wonderful!" competition [https://event.scienceiswonderful.eu] organized by the European Commission in 2022 and 2023. The proposal "Assembling molecules to make up our world" has been developed with Miguel Ángel Abril from CEIP "La Santa Cruz" in Caravaca de la Cruz, Spain, and is designed for pupils from 5th grade onwards (approx 10 years old). The didactic unit addresses some of the key concepts underlying the MALTOSE project, namely (a) the basic rules of chemistry, how to combine atoms to molecules with a ball-and-stick model, (b) to relate the molecular model to actual properties of the materials, and (c) to get an idea about the vast number of different compounds that can be built from just a handful of atoms. Even though the contribution was not selected for the Science Fair, we took the chance to deliver the class to the students of fifth grade of CEIP "La Santa Cruz" in Caravaca de la Cruz, Spain.