Periodic Reporting for period 4 - SMART (Strong Modular proof Assistance: Reasoning across Theories)
Reporting period: 2021-09-01 to 2022-08-31
In the project we have aimed to develop stronger systems combining automated reasoning with artificial intelligence. For this, we worked on combining and reuse of several hammer components. The for main work packages of the project aimed to develop and improve: (a) uniform learning methods, (b) reusable ATP encoding components for different foundational aspects, (c) integration of proof reconstruction, and (d) methods for knowledge extraction, reuse and content merging. The combination of these methods have improved the efficiency of AI and automated reasoning in several ITP systems.
When it comes to objective (a) of the proposal, we have developed a first version of the CIC0 logic combining data from the various type theoretic systems. We have investigated the various machine learning tasks for theorem proving and created various benchmarks. We have also worked on the characterization of mathematical knowledge that is more appropriate for machine learning methods and improved neural network methods tailored for theorem proving data and tasks.
For objective (b), we have developed a first version of the CoqHammer translation along with its improvements for several different Coq libraries. Its performance is much better than expected on Coq's standard library, but we achieved much less than what we were expecting on the Mathematical Components Library and other developments that rely on similar foundations. For hammering set theory, we have developed the Isabelle/Mizar object logic and moves important parts of Mizar knowledge to that foundation as well as created isomorphisms between the concepts in the object logics. We have developed a number of benchmarks for proof assistant ATP methods, including formalizations of category theory in Coq and game theory in Isabelle. Sledgehammer for Isabelle/HOL has been improved by the the integration of the learned ATP Enigma.
For objective (c), we have developed first proof certification methods in Coq by extending the Ben-Yelles term synthesis algorithm and adding heuristic rewriting. This constitutes the first general proof reconstruction mechanism for intuitionistic type theory. We have not however managed to create any reconstruction mechanism that would make use of any further information from the found proofs beyond the used premises or the unsatisfiable core as reported by SMT solvers. Furthermore, we have developed tactic-prediction models and their integration in several ITPs.
For objective (d), we have extended the alignments between proof assistant libraries from statistical ones to neural ones, in particular constructing an alignment of 6 proof assistants. We have also developed first statistical and deep auto-formalization systems.