CORDIS
EU research results

CORDIS

English EN

IMProving ACcess to Text

Project information

Grant agreement ID: 215064

Status

Closed project

  • Start date

    1 January 2008

  • End date

    30 June 2012

Funded under:

FP7-ICT

  • Overall budget:

    € 16 563 683

  • EU contribution

    € 12 163 911

Coordinated by:

KONINKLIJKE BIBLIOTHEEK

Netherlands

Project description

Digital libraries and technology-enhanced learning IMPACT will push innovation in OCR and language technologies for historical document processing and retrieval and build digitisation capacity in Europe

Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.

With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state of the art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.

The IMPACT project will remove many of these barriers. The project will push innovation in OCR technology and language technology for historical document processing and retrieval, and share expertise to build capacity in digitisation across Europe. During the project, a Centre of Competence will be set up in order to provide a central service entry point for all libraries, archives and museums involved in the digitisation of textual material.

The consortium brings together twenty-six national and regional libraries, research institutions and commercial suppliers who will share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade.

Leaflet | Map data © OpenStreetMap contributors, Credit: EC-GISCO, © EuroGeographics for the administrative boundaries

Coordinator

KONINKLIJKE BIBLIOTHEEK

Address

Prins Willem Alexanderhof 5
2595 Be Den Haag

Netherlands

Activity type

Research Organisations

EU Contribution

€ 1 367 128

Administrative Contact

Hildelies Balk (Mrs)

Participants (29)

UNIVERSITAET INNSBRUCK

Austria

EU Contribution

€ 818 358

Österreichische Nationalbank

Austria

EU Contribution

€ 619 995

INSTITUTE OF INFORMATION AND COMMUNICATION TECHNOLOGIES

Bulgaria

NACIONALNA BIBLIOTEKA SV SV CYRIL I METODIJ (St. St. Cyril and Methodius National Library)

Bulgaria

EU Contribution

€ 30 000

INSTITUTE FOR PARALLEL PROCESSING OF THE BULGARIAN ACADEMY OF SCIENCES

Bulgaria

EU Contribution

€ 66 800

UNIVERZITA KARLOVA

Czechia

EU Contribution

€ 32 600

NARODNI KNIHOVNA CESKE REPUBLIKY

Czechia

EU Contribution

€ 35 345

LUDWIG-MAXIMILIANS-UNIVERSITAET MUENCHEN

Germany

EU Contribution

€ 882 536

GEORG-AUGUST-UNIVERSITAT GOTTINGENSTIFTUNG OFFENTLICHEN RECHTS

Germany

EU Contribution

€ 738 800

DEUTSCHE NATIONALBIBLIOTHEK

Germany

EU Contribution

€ 310 325

Bavarian State Library

Germany

EU Contribution

€ 437 600

FUNDACION BIBLIOTECA VIRTUAL MIGUELDE CERVANTES SAAVEDRA

Spain

EU Contribution

€ 9 000

UNIVERSIDAD DE ALICANTE

Spain

EU Contribution

€ 70 350

BIBLIOTECA NACIONAL DE ESPANA

Spain

EU Contribution

€ 28 640

CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE CNRS

France

EU Contribution

€ 84 003

BIBLIOTHEQUE NATIONALE DE FRANCE

France

EU Contribution

€ 209 172

"NATIONAL CENTER FOR SCIENTIFIC RESEARCH ""DEMOKRITOS"""

Greece

EU Contribution

€ 971 493

IBM ISRAEL - SCIENCE AND TECHNOLOGY LTD

Israel

EU Contribution

€ 1 262 250

STICHTING INSTITUUT VOOR DE NEDERLANDSE TAAL

Netherlands

EU Contribution

€ 1 292 300

INSTYTUT CHEMII BIOORGANICZNEJ POLSKIEJ AKADEMII NAUK

Poland

EU Contribution

€ 66 550

UNIWERSYTET WARSZAWSKI

Poland

EU Contribution

€ 58 080

ABBYY PRODUCTION LLC

Russia

EU Contribution

€ 512 812

NARODNA IN UNIVERZITETNA KNJIZNICA

Slovenia

EU Contribution

€ 50 100

INSTITUT JOZEF STEFAN

Slovenia

EU Contribution

€ 74 300

UNIVERSITY OF BATH

United Kingdom

EU Contribution

€ 504 720

THE BRITISH LIBRARY BOARD

United Kingdom

EU Contribution

€ 586 815

THE UNIVERSITY OF SALFORD

United Kingdom

EU Contribution

€ 1 043 839

UNIVERSITE DE NANCY 2

France

UNIVERSITE DE LORRAINE

France

Project information

Grant agreement ID: 215064

Status

Closed project

  • Start date

    1 January 2008

  • End date

    30 June 2012

Funded under:

FP7-ICT

  • Overall budget:

    € 16 563 683

  • EU contribution

    € 12 163 911

Coordinated by:

KONINKLIJKE BIBLIOTHEEK

Netherlands