Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Domain Specific Systems for Information Extraction and Retrieval

Project description

Facilitating information retrieval for domain-specific texts

The Internet hosts an overwhelming amount of textual data. Many existing text mining methods fail to produce satisfying results when dealing with domain-specific text, as they cannot handle complex domain language. Funded under the Marie Skłodowska-Curie programme, the DoSSIER project gathers experts, academia and industry to shed further insight into how users comprehend, formulate, and access information in professional environments. Project outcomes will form the basis for developing a new generation of information access systems that will accelerate innovation in academia and industry. Questions currently unanswerable (e.g. What is the key innovation difference between these two patents?) will be answered either directly by an information retrieval system or by cognitive tools.

Objective

DoSSIER (Domain Specific Systems for Information Extraction and Retrieval) will elucidate, model, and address the different information needs of professional users. It mobilizes an excellent and highly synergistic team of world-leading Information Retrieval (IR) experts from 5 EU States who, together with 3 academic partners (universities in US, Japan, and Australia), and 11 industrial partners (dynamic SMEs and large corporations) will produce fundamental insights into how users comprehend, formulate, and access information in professional environments. For this, DoSSIER takes a highly innovative intersectorial and multidisciplinary approach, addresses fundamental questions about the nature and representation of information needs, engages in novel qualitative and quantitative evaluation, and provides training towards a structured, rigorous, and practical approach to search systems. It connects premier universities and outstanding industrial partners to provide unique opportunities to young researchers. The research is structured in three areas: 1. fundamental models of users and domain specificity, 2. contextual and personalized search, and 3. workflow, task and the interface. Each area individually and in cross-field fertilisation, will produce breakthroughs in our understanding of computer-supported human information search workflows. The result will be a new generation of information access systems, which will accelerate innovation cycles in EU academia and industry, as well as in society as a whole. To be both concrete and generic, DoSSIER consists of 8 projects identifying a target domain and 7 projects acting horizontally across domains. Three vital domains are used: science & technology innovation, law, and healthcare. Questions currently unanswerable (e.g. What is the key innovation difference between these two patents?) will be answerable either directly by a system, or by the development of cognition-enhancing instruments for interacting with information.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Keywords

Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MSCA-ITN - Marie Skłodowska-Curie Innovative Training Networks (ITN)

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) H2020-MSCA-ITN-2019

See all projects funded under this call

Coordinator

TECHNISCHE UNIVERSITAET WIEN
Net EU contribution

Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.

€ 528 414,48
Address
KARLSPLATZ 13
1040 Wien
Austria

See on map

Region
Ostösterreich Wien Wien
Activity type
Higher or Secondary Education Establishments
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

€ 528 414,48

Participants (7)

My booklet 0 0