The work in the BIGCHEM project was structured into ten interrelated research topics designed to develop new technologies for “Big Data” analysis across the preclinical phases of the drug discovery, starting with the target validation phase up to the lead optimization and profiling phases.
Within the target discovery and validation phase of drugs discovery, BIGCHEM exploited Big Data in machine-learning models for compound activity prediction (ESR1, ESR2), with a special emphasis on model interpretation (ESR1). The work of BIGCHEM on new technologies for chemical high-throughput virtual screening (ESR2, ESR3, ESR4) resulted in an improvement of the screening process efficiency, specifically with respect to time and cost. This work also contributed to a deeper understanding of compound promiscuity, the property of chemical compounds to activate multiple targets, and additionally developed tools to detect and filter promiscuous compounds with unwanted compound activity (ESR9, ESR4, ESR5), providing better interpretation of the results of screening assays.
Furthermore, the BIGCHEM fellows created innovative in silico methods for the visualization and analysis of large-scale datasets of compounds (ESR3, ESR4), facilitating the detection of new chemical leads of high-throughput screening campaigns.
The influence of the BIGCHEM fellow’s scientific outcome on the lead optimization and de novo design phase was also remarkable. The proposal of new generative models for the creation of new compounds with desired properties (ESR6, ESR7, ESR8) and the development of innovative approaches for the planning of synthetic routes (ESR7, ESR9) was key for the transformation and advance of this phase.
BIGCHEM also contributed to the creation of new data sharing methodologies, in particular methods to share ADMETox properties by means of graph convolutional deep neural networks and by using Molecule Matched Pairs (ESR10).
The above-mentioned BIGCHEM outcomes were presented at 65 scientific conferences and events and resulted in 52 publications (see
http://bigchem.eu(s’ouvre dans une nouvelle fenêtre)). The project also co-organised the Strasbourg Summer School on Chemoinformatics in 2018 and International Conference on Neural Networks (ICANN2019) in Munich in 2019 as well as edited a special issue on “Big data in Chemistry” at the Journal of Cheminformatics. The impact of BIGCHEM on the scientific community has been really outstanding: its publications were cited more than 600 times only in 2018 according to Google Scholar and include one “Hot paper” as well as four “highly cited” articles (source Web of Science). All ESRs were enrolled at the PhD programs of the respective Universities. Three fellows have already received their PhD degrees and the others are working towards it.