Discovery of content descriptors for documents

Objective

Research objectives and content
In the project, methods for discovering content descriptors, based on the text of documents, are developed and evaluated. Typical examples of content descriptors are keywords. Representative content descriptors are useful, even required, in many application areas, particularly in information retrieval. Extracting of keywords has been studied for a long time. However, the recent development has changed the situation: the appearance of huge document collections for the use of everyone sets new requirements for the methods. In this project, so-called data mining methods are studied and a special objective is to discover complex, structured content descriptors, e.g. phrases, and evaluate their usability in information retrieval.
Training content (objective, benefit and expected impact)
The home department has active Data Mining and Document Management research groups, whereas there is no tradition in information retrieval research. Hence, the training in standard and state-of-the-art information retrieval techniques and in conducting experiments is the key benefit. This combined expertise is invaluable to the new, emerging field of data mining in text.
Links with industry / industrial relevance (22)
The results of the project will be utilized within a project called Structured and Intelligent Documents at the University of Helsinki. The industrial partners of the project include major publishing and media houses in Finland, e.g. Aamulehti Group and Edita.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

natural sciences computer and information sciences data science data mining

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP4-TMR - Specific research and technological development programme in the field of the training and mobility of researchers, 1994-1998

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

RGI - Research grants (individual fellowships)

Coordinator

Eberhard-Karls-Universität Tübingen

EU contribution

No data

Address

10,Auf der Morgenstelle
72076 Tübingen
Germany

Total cost

No data

Participants (1)

Not available

Finland

EU contribution

No data

Address

Total cost

No data

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (1)

Share this page Share this page on social networks

Download Download the content of the page

Discovery of content descriptors for documents

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (1)

Share this page Share this page on social networks

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.