Two key steps were necessary to achieve the objectives of COMPLEX ML. In the first one, with collaborators in Jerusalem, we developed new theoretical tools to analyse complex systems. We established a connection between the traditional formal language in which concepts of theoretical physics are expressed, the field theory, and the language of theory of data compression from computer science, part of a broader field of information theory [Phys.Rev.Lett. 126, 240601 (2021)]. We showed that what is "relevant" in data from the point of view of constructing a physical theory is exactly the same as what is "relevant" from the point of view of lossy data compression. This connection is a "Rosetta Stone", allowing to translate physical concepts and quantities into a form more compatible with computational methods of Machine Learning.
In the second step, we used this theoretical result to build a numerical algorithm extracting "relevant quantities" from complex system data based on compression theory. To this end, with collaborators in Zurich, we developed the RSMI-NE code package, written in Python/Tensorflow, and made it publicly available in a GitHub repository [https://github.com/RSMI-NE/RSMI-NE] and as a Python package. We applied this algorithm, based on our theory and state-of-art results in Machine Learning, to an important statistical mechanical model [Phys.Rev.Lett. 127, 240603 (2021)], demonstrating that full theoretical understanding of a system can be obtained in semi-automatic fashion using RSMI-NE. We showed that our methods also provide insights into the symmetry properties of the system [Phys.Rev. E 104, 064106 (2021)].
Having achieved the first part of our objective, implementing novel compression-theory based methods for regular and ordered systems, in the second phase the effort has been to extend these tools to the challenging disordered and inhomogenous cases. To this end a new version of the RSMI-NE code has been implemented. Together with collaborators in Cardiff, Oxford, Jerusalem and Zurich we used it to study a strongly correlated system on a quasicrystal, discovering novel emergent phenomena [arXiv:2301.11934]. This achieved a major goal for COMPLEX ML, providing information about physical systems beyond what is already known and accessible with other methods. We further demonstrated how information-theoretic approaches can be used to construct simplified models of dynamics, and in precision numerics in 3D lattice gauge theories, which are of theoretical importance. Publications summarising these results are in preparation.
A complementary component of the COMPLEX ML project was derivation of new insights to improve ML algorithms. We focused on the question of gradient-free training of Binarized Neural Networks (BNNs), which are of great practical interest due to the potential of orders-of-magnitude savings on the cost (and energy use) of training, compared to their full-precision counterparts. Together with collaborators in Ukraine, working in industry, we investigated the idea of applying physical Monte Carlo methods to this problem and presented proof-of-concept results in the "Binary Networks for Computer Vision" workshop at the CVPR 2021 conference in Machine Learning.
Our results were published in peer-reviewed journals and presented in conferences, workshops and invited talks in Europe and USA. Outreach activities involving primary and high-school students were performed within the "Science is wonderful" programme.