Descripción del proyecto
Facilitar la recuperación de información para textos de dominios específicos
Internet alberga una cantidad abrumadora de datos textuales. Muchos de los métodos de minería de textos existentes no producen resultados satisfactorios cuando se trata de un texto de un dominio específico, ya que no pueden manejar ese complejo lenguaje. Financiado en el marco del programa Marie Skłodowska-Curie, el proyecto DoSSIER reúne a expertos, al mundo académico y a la industria para ofrecer información más detallada sobre cómo los usuarios comprenden, formulan y acceden a la información en los entornos profesionales. Los resultados del proyecto constituirán la base para desarrollar una nueva generación de sistemas de acceso a la información que acelerará la innovación en el mundo académico e industrial. Las preguntas que actualmente no tienen respuesta (por ejemplo, ¿cuál es la diferencia fundamental de innovación entre estas dos patentes?) se responderán ya sea directamente por medio de un sistema de recuperación de información o con herramientas cognitivas.
Objetivo
DoSSIER (Domain Specific Systems for Information Extraction and Retrieval) will elucidate, model, and address the different information needs of professional users. It mobilizes an excellent and highly synergistic team of world-leading Information Retrieval (IR) experts from 5 EU States who, together with 3 academic partners (universities in US, Japan, and Australia), and 11 industrial partners (dynamic SMEs and large corporations) will produce fundamental insights into how users comprehend, formulate, and access information in professional environments. For this, DoSSIER takes a highly innovative intersectorial and multidisciplinary approach, addresses fundamental questions about the nature and representation of information needs, engages in novel qualitative and quantitative evaluation, and provides training towards a structured, rigorous, and practical approach to search systems. It connects premier universities and outstanding industrial partners to provide unique opportunities to young researchers. The research is structured in three areas: 1. fundamental models of users and domain specificity, 2. contextual and personalized search, and 3. workflow, task and the interface. Each area individually and in cross-field fertilisation, will produce breakthroughs in our understanding of computer-supported human information search workflows. The result will be a new generation of information access systems, which will accelerate innovation cycles in EU academia and industry, as well as in society as a whole. To be both concrete and generic, DoSSIER consists of 8 projects identifying a target domain and 7 projects acting horizontally across domains. Three vital domains are used: science & technology innovation, law, and healthcare. Questions currently unanswerable (e.g. What is the key innovation difference between these two patents?) will be answerable either directly by a system, or by the development of cognition-enhancing instruments for interacting with information.
Ámbito científico
Palabras clave
Programa(s)
Régimen de financiación
MSCA-ITN - Marie Skłodowska-Curie Innovative Training Networks (ITN)Coordinador
1040 Wien
Austria