Meta-data extraction techniques for automatic indexing.

Objective

An important research activity within Oce Group Research is the development of document analysis & processing algorithms. These algorithms can be used to develop products and services to manage automatically huge flows of documents. The research on meta-data extraction techniques for automatic indexing, is divided in the following stages:
1. Preprocessing. In this stage scanning artifacts are removed and the document image is positioned upright.
2. Layout analysis. In this stage characters, tables and figures are recognized. Characters are recognized using OCR techniques.
3. Genre classification. Information from earlier steps is used to classify the document as a book, report, and business letter, etc.
4. Logical analysis. In this stage the functional meaning of the individual text blocks is determined, as well as the logical structure and reading order of the document. The goal of this research is to automate the classification, archiving and retrieval of large collections of documents. The research must lead to more knowledge on the structure and meta-data of various kinds of documents. At a later stage new software products in the document management field will be developed, based on this research. The fellows will be part of the research group and will participate in creating new algorithms and strategies and in validating these algorithms. Building up new and extending existing contacts with other research groups in the field is also a part of the training.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP5-HUMAN POTENTIAL - Programme for research, technological development and demonstration on "Improving the human research potential and the socio-economic knowledge base" (1998-2002)

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Data not available

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

BUR - Bursaries, grants, fellowships

Coordinator

OCE TECHNOLOGIES B.V.

EU contribution

No data

Address

43,St. Urbanusweg 43
5900 MA VENLO
Netherlands

Total cost

No data

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Share this page Share this page on social networks

Download Download the content of the page

Meta-data extraction techniques for automatic indexing.

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Share this page Share this page on social networks

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.