CORDIS - Resultados de investigaciones de la UE
CORDIS

Formal lexically informed logics for searching the web

Descripción del proyecto

Un razonamiento consistente para los motores de búsqueda semántica

No siempre es fácil realizar una sencilla búsqueda de información en internet. A menudo implica cribar resultados de búsqueda irrelevantes. Uno de los problemas de los motores de búsqueda semántica actuales es que parten de la suposición poco realista de que todos los hechos reales sobre un dominio determinado están explícitamente recogidos en su base de conocimientos o en la web. La información relevante suele estar distribuida en varias bases de conocimiento incoherentes y las teorías del dominio rara vez son completas. El equipo del proyecto FLEXILOG, financiado por el Consejo Europeo de Investigación, pretende mejorar los motores de búsqueda semántica introduciendo una familia de lógicas para tener un razonamiento consistente con conocimiento del mundo real. Utilizará representaciones vectoriales de términos del lenguaje natural para estimar la verosimilitud de los modelos lógicos, lo que permitirá diversas formas de razonamiento de sentido común.

Objetivo

Semantic search engines use structured knowledge to improve traditional web search, e.g. by directly answering questions from users. Current approaches to semantic search rely on the unrealistic assumption that all true facts about a given domain are explicitly stated in their knowledge base or on the web. To reach their full potential, semantic search engines need the ability to reason about known facts. However, existing logics cannot adequately deal with the imperfect nature of knowledge from the web. One problem is that relevant information tends to be distributed over several heterogeneous knowledge bases that are inconsistent with each other. Moreover, domain theories are seldom complete, which means that a form of so-called plausible reasoning is needed. Finally, as relevant logical theories do not exist for many domains, reasoning may need to rely on imperfect probabilistic theories that have been learned from the web.

To overcome these challenges, FLEXILOG will introduce a family of logics for robust reasoning with messy real-world knowledge, based on vector-space representations of natural language terms (i.e. of lexical knowledge). In particular, we will use lexical knowledge to estimate the plausibility of logical models, using conceptual simplicity as a proxy for plausibility (i.e. Occam’s razor). This will enable us to implement various forms of commonsense reasoning, equipping classical logic with the ability to draw plausible conclusions based on regularities that are observed in a knowledge base. We will then generalise our approach to probabilistic logics, and show how we can use the resulting lexically informed probabilistic logics to learn accurate and comprehensive domain theories from the web. This project will enable a robust data-driven approach to logic-based semantic search, and more generally lead to fundamental progress in a variety of knowledge-intensive applications for which logical inference has traditionally been too brittle.

Régimen de financiación

ERC-STG - Starting Grant

Institución de acogida

CARDIFF UNIVERSITY
Aportación neta de la UEn
€ 1 451 656,00
Dirección
NEWPORT ROAD 30 36
CF24 0DE Cardiff
Reino Unido

Ver en el mapa

Región
Wales East Wales Cardiff and Vale of Glamorgan
Tipo de actividad
Higher or Secondary Education Establishments
Enlaces
Coste total
€ 1 451 656,00

Beneficiarios (1)