Periodic Reporting for period 1 - PATTERNCHEM (Shape and Topology as Descriptors of Chemical and Physical Properties in Functional Organic Materials)
Période du rapport: 2022-06-01 au 2024-11-30
The overarching aim of PATTERNCHEM is to enable the rational design and facile pre-screening of functional organic materials for applications involving their non-covalent interactions with molecular targets in silico. Several families of functional organic materials – graphene derivatives, covalent-organic frameworks, and hyperbranched polymers – provide a unique foundation for developing application-oriented fingerprints of their topological and non-covalent interaction features. After elucidating diverse structural descriptors of atomistic arrangement, substitution patterns, and two- and three-dimensional shapes of these materials, we will establish a scheme for quantifying the propensity for non-covalent interactions and assessing the host-guest complementarity. Using this scheme, chemical and physical performance indicators relevant to targeted applications (e.g. as sensors, filters, and nanocarriers) can be computed. Finally, structure-property relationships between computed performance indicators and developed descriptors will be established and implemented into predictive frameworks for functional organic materials. The key deliverable of PATTERNCHEM will be a unified, all-encompassing framework for designing new candidate architectures and evaluating host-guest complementarity, which will require only basic structural information as an input and will predict the ultimate performance in the targeted application as its output.
In the area of graphene derivatives, we performed an extensive analysis of the published literature and compiled a database of relevant experimentally reported characteristics of graphene-based adsorbents and filters. We have benchmarked a range of computational approaches for predicting the interaction energies of these materials with small molecules and implemented the best-performing approaches together with a structure generator into a high-throughput multiscale computational workflow. We next used this workflow to compute the interaction energies and other related properties for a newly generated dataset of over 5,500 complexes between graphene derivatives and small molecules.
In the area of covalent-organic frameworks (COFs), we have identified reliable computational approaches for simulating and analysing their interactions with drug molecules and applied these to engineer in silico new frameworks for a delayed release of a broader scope of therapeutics. We have developed an in-house tool for identifying the building blocks of diverse COFs and commenced studies on how the properties of the frameworks arise from their constituents and topologies. We also developed a variant of our in-house chemical representation (see below) for machine learning the properties of COFs and are currently testing it in conjunction with several published datasets. We have also performed a critical evaluation of the approaches for assigning the three-dimensional arrangement of the two-dimensional frameworks from experimentally measured X-ray diffraction data, and started investigating how this arrangement impacts the electronic properties of the frameworks.
In the area of chemical representations, we have developed a quantum-inspired representation called MAOC (Matrix of Orthogonalized Atomic Orbital Coefficients), which is uniquely suited to describe charged and open-shell compounds and can be applied to perform kernel-based and deep learning on atoms, molecules, and extended materials. We have also developed a Matrix of Reference Similarity (MRS) technique for simultaneously reducing the dimensionality of the chemical space and representing the entities in it with a drastically reduced fingerprint, leading to tens to hundreds of times speed up in machine learning. To automatically identify the building blocks in topologically complex molecules and materials and to analyse how these building blocks affect the properties of the materials, we developed a Substructural Filter Representation (SFR).
• extensive research on hyperbranched polymers,
• customisation of developed fingerprints towards materials of interest,
• development of the non-covalent interactions descriptor,
• testing the utility of the new tools and descriptors across several materials families,
• an in-depth analysis of the structure-property trends across the generated datasets, and
• collaborations with experimentalists to test our predictions.