Periodic Reporting for period 1 - SymSim (Symmetry and Similarity)
Reporting period: 2022-10-01 to 2025-03-31
Similarity of graphs has many different facets. We will identify the common core of different approaches to similarity, but also exhibit their differences. We will design methods for comparing different similarity measures and for obtaining a semantic understanding of similarity. We will develop criteria for the suitability of various similarity measures for different types of applications.
A particular focus of our research will be on efficient algorithms for computing similarity. A perfect similarity measure is of little use if we do not have an efficient way of determining how similar two graphs are.
A classic algorithmic problem in this context is the graph isomorphism problem, which involves deciding whether two graphs are structurally identical. Determining the precise computational complexity of this problem, or of the equivalent problem of computing all symmetries of a graph, is regarded as one of the most important open questions in theoretical computer science. Building on recent progress, we will design new algorithms that break barriers towards a polynomial-time algorithm for the isomorphism problem.
A natural declarative approach to similarity is comparing the frequencies of patterns in the two graphs. Technically, this leads to similarities based on homomorphism embeddings, which have been a focus of our attention. By developing a novel mathematical machinery drawing from areas such as representation theory and functional analysis, we were able to connect them to natural operational similarity measures based on matrix norms.
In practice, we typically learn features and similarities of graphs from data. Graph Neural Networks (GNNs) are the method of choice. It is known that they are related to homomorphism embeddings. We studied the expressivity and generalisation properties of GNNs. Our main result is a precise characterisation of GNNs in terms of logic and classical circuit complexity.
Towards the graph isomorphism problem, we gave a new isomorphism algorithm for the class of tournaments, a specific graph class that has played a very interesting role for the graph isomorphism problem. The runtime of our algorithm is parameterised in terms of the twin width of the input tournaments and is very efficient as long as the twin width is small.
The combinatorial Weisfeiler-Leman algorithm plays a central role in both theory and practice of the graph isomorphism problem, and it has applications beyond that, most notably in machine learning. Going back to Fürer (2001), it was a long-standing open question how many iterations the k-dimensional WL algorithm requires in the worst case? We settled this question by establishing a strong lower bound, complemented by a non-trivial upper bound. Our novel proof technique for the lower bound has also found other applications since then.
Two results stand out in my mind.
The first is the lower bound on the number of iterations of the Weisfeiler-Leman algorithm because it solved a long-standing and well-known open problem by a novel technique that has also found other applications since then.
The second is the characterisation of the expressiveness of graph neural networks in terms of logic and circuit complexity. It establishes a surprising and very clean connection between the “analogue” computation model of graph neural networks, operating with real numbers of unbounded precision, and classical computation models based on Boolean logic.