CORDIS - Forschungsergebnisse der EU
CORDIS

Quantum Information Access and Retrieval Theory

Periodic Reporting for period 2 - QUARTZ (Quantum Information Access and Retrieval Theory)

Berichtszeitraum: 2019-01-01 bis 2020-12-31

All of us are users of Information Access and Retrieval (IAR) systems such as Web search engines. The peculiar difficulty of IAR is the fact that relevance cannot be precisely and exhaustively described by the data itself. Quantum Mechanics (QM) shares with IAR the intrinsic and unavoidable uncertainty of measurement. As a quantum mechanical system such as particle cannot be fully known through measurement because of incompatibility between observables, the user’s relevance of an information object to his/her information need cannot be fully known through keywords, clicks, or views because of incompatibility between context and data. The quantum mechanical framework aims, at a more foundational level, to study corpuses of documents in order to identify their underlying quantum structures. This will not only enable to better understand optimal mathematical structures to model semantic spaces, thus consistently and efficiently capturing the more abstract level of meaning conveyed by the documents and texts of human language, but also to use quantum structures to enhance the effectiveness of the different models in IAR applications. QUARTZ started from the idea of transferring the scientific research results and expertise from senior researchers in the utilisation of the theory of QM in IAR to the junior researchers, thus stimulating the birth and growth of a networked European community of scholars with a larger, stronger, and deeper expertise in IAR. Thanks to previous research work started from van Rijsbergen’s seminal book and recently culminated with another book by the coordinator, QUARTZ is based on the main pillars of the quantum mechanical framework to build a new approach to IAR. The senior researchers involved in this project as organisation leaders or independent advisory committee members are leading researchers in multidisciplinary fields of QM and IAR.
We investigated and evaluated the geometry of IAR at the level of high-dimensional word embedding vectors and at the level of high-level conceptual relevance dimensions.
* We investigated polyrepresentation, which consists of using the geometry of abstract vector spaces to embed a variety of contextual dimensions, information objects and information needs.
* We proposed a geometric model inspired by the quantum mechanical framework to capture the user's importance given to each dimension of relevance.
* We discussed our methodology to construct incompatible basis for documents from real world query log data, the experiments to test Bell inequalities on this dataset and possible reasons for the lack of violation.
* We put forward that quantum structures can be useful in the decision-making processes that follow from the retrieval of relevant information.
* We proposed a methodology for carrying out such an investigation in large scale and real world data using query logs of a web search engine, and device a test to detect the presence of an irrational user behavior in relevance judgment of documents.
* We further validated this behavior through a Quantum Cognitive explanation of the order and context effects.
* We aimed to understand the extent in which Information Scent, Information Patch and Information Diet, features of Information Foraging Theory, address the conceptual issues of information behaviour research by reviewing approaches to information interaction in the context of information seeking and retrieval.
* We proposed a plan for the application of IFT to information access and retrieval in the process of describing key IFT concepts in an information environment.
* We investigated a query expansion framework on the Quantum Language Modeling (QLM) basis.
* We proposed a binary classification model inspired by quantum signal detection theory in an effort to investigate how much benefit it brings as compared to classical models.
* We gave a few examples of irrational behavior and use a generalized probabilistic model inspired by the quantum mechanical frameworkto model and explain such behavior.
* We examined a multimodal fusion scenario that might be similar to that encountered in physics by firstly measuring two observables (i.e. text-based relevance and image-based relevance) of a multi-modal document without counting on an ensemble of multi-modal documents already labeled in terms of these two variables.
* We investigated the existence of non-classical correlations between pairs of multi-modal documents.
* We investigated the existence of this kind of non-classical correlation through the Bell inequality violation.
* We experimentally tested several novel association methods in a small-scale experiment.
* We presented a series of interesting discussions, which may provide theoretical and empirical insights and inspirations for future development of this direction.

For the final period we included an overview of the results and their exploitation and dissemination in technical annex.

We performed the following training activities:

QUARTZ Winter School (February 7-14, 2018, Padova, Italy) - School Report available in the SIGIR Forum December 2018 Issue
QUARTZ Summer School (June 20-22, 2018, Luton, UK)
QUARTZ Autumn School (September 24-26, 2018, Cottbus, Germany)
QUARTZ Workshop 1: February 7-14, 2018, Padova, Italy (held in conjuction with the QUARTZ Winter School)
QUARTZ Workshop 2
W2a: June 20-22, 2018, Luton, UK (held in conjuction with the QUARTZ Summer School)
W2b: September 24-26, 2018, Cottbus, Germany (held in conjuction with the QUARTZ Autumn School)
QUARTZ Workshop 3: April 3-5, 2019, North Campus of the University of Copenhagen in Denmark
QUARTZ Workshop 4: February 12, 2020, London, UK.

QUARTZ Workshop 3: April 3-5, 2019, North Campus of the University of Copenhagen in Denmark
QUARTZ Workshop 4: February 12, 2020, London, UK.

We also provided the following complementary training:

TS1: Academic writing (June 22, 2018, Luton, UK)
TS2: Research Data Management (June 20, 2018, Luton, UK)
TS3: Presentation techniques and academic writing in English
TS3a: Presentation techniques in English (June 21, 2018, Luton, UK)
TS3b: Academic writing (September 24, 2018, Cottbus, Germany)
TS3c: Academic writing (April 3, 2019, Copenhagen, Denmark)
TS4: Impact and research communication skills (April 5, 2019, Copenhagen, Denmark)
TS5: Career Management for Researchers
TS5a: How to write a competitive proposal for H2020 (April 4, 2019, Copenhagen, Denmark)
TS5b: Employability and knowledge transfer (February 12, 2020, London, UK)
TS6: Technology Transfer: from Research Prototypes to Innovative Product (February 12, 2020, London, UK)
* To replace the conventional IAR paradigm based on keywords and the classical probabilistic and statistical models, by a geometry of IAR, in which non-classical methods are based on complex and heterogeneous feature vectors in abstract vector spaces.

* To extend the usual interaction models, logic and languages utilised by IAR systems to model and make non-distributive nor commutative operators for complex features and interaction which require a different logic from a classical set-based logic, in particular, represented by subspaces and relations thereof.

* To generalise the probability theory adopted by current IAR systems.
Quantum-inspired classification and decision