Periodic Reporting for period 2 - DEEPCEPTION (Visual perception in deep neural networks)
Periodo di rendicontazione: 2018-10-01 al 2019-09-30
In this project, we took the first steps in this direction by building accurate predictive models of human and non-human primate neural and behavioral responses in a demanding visual object recognition task. We focused on three major objectives: (i) establish an extensive benchmark of human visual processing; (ii) using this benchmark, evaluate the quality of machine decisions in relation to human performance; and (iii) using the insights gained from such a comparison, develop new, biologically-informed state-of-the art architectures. We successfully reached these goals, building a large-scale integrative bechmarking platform called Brain-Score, evaluating tens of models on it, and developing CORnet, the current best model of visual system. Going forward, we expect our heavily quantitative and engineering-focused approach to understanding visual system to scale to building the models of the entire brain.
We further evaluated how robust current machine learning models are when presented with images that are unlike the images they have been trained to recognize. Surprisingly, we found that models can generalize better than previously thought under minimal retraining, suggesting that in order to build robust models of visual processing, it may be beneficial to train them on even larger image datasets than currently available. In order to facilitate the generation of such large scale datasets that can be precisely controlled for various research questions, we build a 3D photorealistic virtual environment where virtual agents could interact with the virtual reality and allow a rapid testing of hypotheses how such systems learn.
Our work has been published in top neuroscience and machine learning venues, we presented it a multiple conferences, and it has been made accessible to the general public in various formats, including high-profile museum exhibitions, popular lectures and publications on the topics of science and AI.
By making this platform and all these tools open to all, we hope to lead a shift in the approach to neuroscience. While up until recently most research would focus on phenomenological descriptions, we want to encourage researchers to ask quantitative questions and rigorously quantify progress in predicting how the brain will respond. Such shift is long overdue, in our opinion, and could be best likened to the shift that happened in astronomy 500 years ago. Until Kepler’s laws of planetary motion and subsequent mathematical formulations by Newton, most astronomers were busy documenting planetary and star positions but struggled to see clear patterns. Kepler’s laws concentrated that empirical knowledge into a formal predictive model. We are hoping that our benchmarking tools can contribute to such much broader cultural shift in neuroscience as well.
On the other hand, our virtual environment will help researchers to accelerate their experimental cycle. What took months in collecting and processing data, can come now at much lower cost now in a simulated reality where all attributes of every single entity are under experimenter’s control. This platform is being prepared to public access.