Community Research and Development Information Service - CORDIS

FP7 IMPACT project logo

IMPACT

Project reference: 215064
Funded under: FP7-ICT

IMProving ACcess to Text [Print to PDF] [Print to RTF]

From 2008-01-01 to 2012-06-30 | IMPACT Website

Project details

Total cost:

EUR 16 563 683

EU contribution:

EUR 12 163 911

Coordinated in:

Netherlands

Call for proposal:

FP7-ICT-2007-1

Funding scheme:

CP - Collaborative project (generic)

IMPACT will push innovation in OCR and language technologies for historical document processing and retrieval and build digitisation capacity in Europe

Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.

With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state of the art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.

The IMPACT project will remove many of these barriers. The project will push innovation in OCR technology and language technology for historical document processing and retrieval, and share expertise to build capacity in digitisation across Europe. During the project, a Centre of Competence will be set up in order to provide a central service entry point for all libraries, archives and museums involved in the digitisation of textual material.

The consortium brings together twenty-six national and regional libraries, research institutions and commercial suppliers who will share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade.

Objective

Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.\nWith the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state-of-the-art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.\nThe IMPACT project will remove many of these barriers. It brings together fifteen national and regional libraries, research institutions and commercial suppliers - all centres of competence with unequalled experience of large-scale text digitisation processes and technologies. The project will let them share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade. This project will facilitate a more collaborative approach to mass-digitisation. It will build capacity and lower the barriers to entry for organisations in the early stages of their own digitisation activity.

Related information

Coordinator

KONINKLIJKE BIBLIOTHEEK
Netherlands
PRINS WILLEM ALEXANDERHOF 5
DEN HAAG, Netherlands
Administrative contact: Hildelies Balk
Tel.: +31 70 3140138
Fax: +31 70 3140 427
E-mail

Participants

OESTERREICHISCHE NATIONALBIBLIOTHEK
Austria
JOSEFSPLATZ 1
WIEN, Austria
Administrative contact: Maria Bernklau
Tel.: +43-1-53410-221
Fax: +43-1-53410-211
E-mail
UNIVERSITAET INNSBRUCK
Austria
INNRAIN
INNSBRUCK, Austria
Administrative contact: Kurt Habitzel
Tel.: +43-512-5079051
Fax: +43-512-5072607
E-mail
INSTITUTE OF INFORMATION AND COMMUNICATION TECHNOLOGIES
Bulgaria
Acad. G .Bonchev
Sofia, Bulgaria
Administrative contact: Stoyan Mihov
Tel.: +359 896 616 366
E-mail
NACIONALNA BIBLIOTEKA SV SV CYRIL I METODIJ (St. St. Cyril and Methodius National Library)
Bulgaria
Vasil Levski blvd
Sofia, Bulgaria
Administrative contact: Tatyana Dimitrova
Tel.: +359 2 846 7480
Fax: +359 2 843 54 95
E-mail
UNIVERZITA KARLOVA V PRAZE
Czech Republic
Ovocny trh
PRAHA 1, Czech Republic
Administrative contact: Karel Kučera
Tel.: +420224241490
Fax: +420 224 491 895
E-mail
NARODNI KNIHOVNA CESKE REPUBLIKY
Czech Republic
KLEMENTINUM 190
PRAHA 1, Czech Republic
Administrative contact: Tomas Foltyn
Tel.: +420 281 013 320
Fax: +420 281 013 107
E-mail
GEORG-AUGUST-UNIVERSITAET GOETTINGEN STIFTUNG OEFFENTLICHEN RECHTS
Germany
WILHELMSPLATZ
GOETTINGEN, Germany
Administrative contact: Hiltraud Casper-Hehne
Tel.: +49 551 39 7157
Fax: +49 551 39 13383
E-mail
BAYERISCHE STAATSBIBLIOTHEK
Germany
LUDWIGSTRASSE 16
MUNCHEN, Germany
Administrative contact: Markus Brantl
Tel.: +49-89-28638244
Fax: -+49-89-2863826
E-mail
DEUTSCHE NATIONALBIBLIOTHEK
Germany
ADICKESALLEE 1
FRANKFURT, Germany
Administrative contact: Rolf Flormann
Tel.: 069-1525-2100
Fax: 069-1525-2002
E-mail
LUDWIG-MAXIMILIANS-UNIVERSITAET MUENCHEN
Germany
Geschwister-Scholl-Platz
MUENCHEN, Germany
Administrative contact: Klaus U. Schulz
Tel.: +49-89-21809700
Fax: +49-89-2180970
E-mail
FUNDACION BIBLIOTECA VIRTUAL MIGUELDE CERVANTES SAAVEDRA
Spain
PASEO DE LA CASTELLANA
MADRID, Spain
Administrative contact: Rossanna Pinna
Tel.: +34 965 909 718
Fax: +34 965 909715
E-mail
BIBLIOTECA NACIONAL DE ESPANA
Spain
Paseo de Recoletos
Madrid, Spain
Administrative contact: Belén Llera
Tel.: +34 91 580 78 00
Fax: +34 91 516 89 28
E-mail
UNIVERSIDAD DE ALICANTE
Spain
CAMPUS DE SAN VICENTE RASPEIG
ALICANTE, Spain
Administrative contact: Carolina Fernández Lopez
Tel.: +34 965903802
Fax: +34 965 903803
E-mail
BIBLIOTHEQUE NATIONALE DE FRANCE
France
QUAI FRANCOIS MAURIAC 11
PARIS, France
Administrative contact: Gérald Grunberg
Tel.: +33 153 795 910
Fax: +331 537 94 737
E-mail
CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE
France
Rue Michel -Ange
PARIS, France
Administrative contact: Anne Florence Remy
Tel.: +33 383 856 042
Fax: +33 383 324 592
E-mail
NATIONAL CENTER FOR SCIENTIFIC RESEARCH "DEMOKRITOS"
Greece
Patriarchou Gregoriou Str.
AGHIA PARASKEVI, Greece
Administrative contact: Marina Fontara
Tel.: +302106503096
Fax: +302106503139
E-mail
IBM ISRAEL - SCIENCE AND TECHNOLOGY LTD
Israel
94 DERECH EM-HAMOSHAVOT
PETACH TIKVA, Israel
Administrative contact: Efrat Shalom-Hillman
Tel.: +972-4-828-1101
Fax: +972-4-829-6111
E-mail
INSTITUUT VOOR NEDERLANDSE LEXICOLOGIE
Netherlands
Matthias de Vrieshof 2-3, 2311 BZ
LEIDEN, Netherlands
Administrative contact: Petra Belt
Tel.: +31 71 5272474
Fax: +31 71 5272115
E-mail
UNIWERSYTET WARSZAWSKI
Poland
Krakowskie Przedmiescie
WARSAW, Poland
Administrative contact: Diana Pustula
Tel.: +48225524201
Fax: +48225524200
E-mail
INSTYTUT CHEMII BIOORGANICZNEJ PAN
Poland
NOSKOWSKIEGO 12-14
POZNAN, Poland
Administrative contact: Agnieszka Dominiak
Tel.: +48 61 858 21 04
Fax: +48 61 852 59 54
E-mail
ABBYY PRODUCTION LLC
Russia
21 BRATISLAVSKAYA STREET, BUILDING 1
MOSCOW, Russia
Administrative contact: Andrey Isaev
Tel.: +7 495 7833700
Fax: +7 495 7832663
E-mail
NARODNA IN UNIVERZITETNA KNJIZNICA
Slovenia
TURJASKA
LJUBLJANA, Slovenia
Administrative contact: Alenka Kavčič-Čolić
Tel.: +3862001131
Fax: +386 1 4257 293
E-mail
INSTITUT JOZEF STEFAN
Slovenia
Jamova
LJUBLJANA, Slovenia
Administrative contact: Jadran Lenarcic
Tel.: +386 1 4773 513
Fax: +38612519385
E-mail
THE BRITISH LIBRARY BOARD
United Kingdom
96 EUSTON ROAD
LONDON, United Kingdom
Administrative contact: Aly Conteh
Tel.: +442074127478
E-mail
THE UNIVERSITY OF SALFORD
United Kingdom
THE CRESCENT
SALFORD, United Kingdom
Administrative contact: Apostolos Antonacopoulos
Tel.: +44 161 2952653
Fax: +44 161 2955575
E-mail
UNIVERSITY OF BATH
United Kingdom
CLAVERTON DOWN
BATH, United Kingdom
Administrative contact: Hazel Wallis
Tel.: 00441225386822
Fax: 00441225383276
E-mail
Université de Lorraine
France
cours Léopold
Nancy, France
Administrative contact: N/A N/A
Tel.: +33 169 08 90 9
Fax: +33 169 08 21 9
E-mail
Record Number: 85383 / Last updated on: 2014-11-10