Project description
Just the facts, please!
Journalism is about disseminating facts. As such, a reporter’s job is to check the facts being presented to the public. In recent years, the verification of social media content has become increasingly important to journalists and news organisations. The proliferation of social media means a larger volume of claims to verify; therefore, automated verification methods could help journalists assess the truthfulness of claims. The EU-funded AVeriTeC project will use machine learning approaches to develop an automated verification system that can process complex claims that require multiple pieces of evidence to cross-check. AVeriTeC’s ultimate goal is to establish the verification of textual claims as a real-world challenge to stimulate progress in natural language processing, machine learning and related fields.
Objective
Verification of textual claims is the task of assessing the truthfulness of a statement in natural language. It is commonly conducted manually by journalists on claims made by public figures such as politicians, with the aim of reducing misinformation. However, the proliferation of social media has created the need to apply verification to a larger volume of claims coming from a greater variety of sources, thus calling for automation.
Research in automated verification of textual claims is at an early stage. The methods developed either assess the truthfulness of the claim without considering evidence, or handle very simple claims such as “UK has 3.2 million EU immigrants” that requires the retrieval of a single factoid from a knowledge base. While useful, claims are often more complex, and taking evidence into account is necessary for the verdicts to be credible.
AVeriTeC will transform automated verification by enabling the verification of more complex claims than previously attempted, such as “the United Kingdom has ten times Italy’s number of immigrants”, which require multiple pieces of evidence. We will achieve this by developing methods able to generate multiple questions per claim, retrieve answers from both knowledge bases and textual sources, and combine them into verdicts. As these tasks are interdependent, we will develop novel machine learning approaches able to handle them jointly so that the verdicts are accompanied by suitable justifications in the form of questions and answers. The latter will be formulated in natural language, thus the process followed by the models developed will be explainable to the users, while the evidence itself can be useful even if the overall verdict is incorrect.
Beyond developing novel methods and creating publicly available evaluation resources, AVeriTeC will establish verification of textual claims as a real-world challenge to stimulate progress in natural language processing, machine learning and related fields.
Fields of science
Not validated
Not validated
- social sciencesmedia and communicationsjournalism
- natural sciencescomputer and information sciencesdata sciencenatural language processing
- social sciencessociologyindustrial relationsautomation
- natural sciencescomputer and information sciencesartificial intelligencemachine learning
- social sciencessociologydemographyhuman migrations
Programme(s)
Funding Scheme
ERC-COG - Consolidator GrantHost institution
CB2 1TN Cambridge
United Kingdom