European Commission logo
español español
CORDIS - Resultados de investigaciones de la UE
CORDIS

Computing Answers to Complex Questions in Broad Domains

Descripción del proyecto

Un marco para obtener respuestas informáticas a preguntas complejas y amplias

Hoy en día, es más fácil acceder a la información en todo el mundo. No obstante, al transmitirse a través de sistemas automatizados, el acceso a ella se ve limitado por la capacidad de estos sistemas de comprender el lenguaje. Los sistemas actuales no pueden responder a preguntas complejas en ámbitos amplios, por lo que no pueden dividir las preguntas en partes y encontrar la información relevante en diferentes fuentes. El proyecto DELPHI, financiado con fondos europeos, tiene como objetivo mostrar que los ordenadores pueden responder a preguntas intrincadas y amplias que requieren que se razone sobre múltiples modalidades. Para ello, propone un marco en el que las preguntas complejas se descomponen simbólicamente en subpreguntas. Se responde a cada una con una red neuronal y la respuesta final se obtiene a partir de toda la información recopilada. Este trabajo puede cambiar las interacciones futuras entre humanos y máquinas.

Objetivo

The explosion of information around us has democratized knowledge and transformed its availability for
people around the world. Still, since information is mediated through automated systems, access is bounded
by their ability to understand language.
Consider an economist asking “What fraction of the top-5 growing countries last year raised their co2 emission?”.
While the required information is available, answering such complex questions automatically is
not possible. Current question answering systems can answer simple questions in broad domains, or complex
questions in narrow domains. However, broad and complex questions are beyond the reach of state-of-the-art.
This is because systems are unable to decompose questions into their parts, and find the relevant information
in multiple sources. Further, as answering such questions is hard for people, collecting large datasets to train
such models is prohibitive.
In this proposal I ask: Can computers answer broad and complex questions that require reasoning over
multiple modalities? I argue that by synthesizing the advantages of symbolic and distributed representations
the answer will be “yes”. My thesis is that symbolic representations are suitable for meaning composition, as
they provide interpretability, coverage, and modularity. Complementarily, distributed representations (learned
by neural nets) excel at capturing the fuzziness of language. I propose a framework where complex questions
are symbolically decomposed into sub-questions, each is answered with a neural network, and the final answer
is computed from all gathered information.
This research tackles foundational questions in language understanding. What is the right representation
for reasoning in language? Can models learn to perform complex actions in the face of paucity of data?
Moreover, my research, if successful, will transform how we interact with machines, and define a role for
them as research assistants in science, education, and our daily life.

Régimen de financiación

ERC-STG - Starting Grant

Institución de acogida

TEL AVIV UNIVERSITY
Aportación neta de la UEn
€ 1 499 375,00
Coste total
€ 1 499 375,00

Beneficiarios (1)