Description du projet
Digital libraries and technology-enhanced learning
IMPACT will push innovation in OCR and language technologies for historical document processing and retrieval and build digitisation capacity in Europe
Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.
With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state of the art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.
The IMPACT project will remove many of these barriers. The project will push innovation in OCR technology and language technology for historical document processing and retrieval, and share expertise to build capacity in digitisation across Europe. During the project, a Centre of Competence will be set up in order to provide a central service entry point for all libraries, archives and museums involved in the digitisation of textual material.
The consortium brings together twenty-six national and regional libraries, research institutions and commercial suppliers who will share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade.
Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state-of-the-art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.The IMPACT project will remove many of these barriers. It brings together fifteen national and regional libraries, research institutions and commercial suppliers - all centres of competence with unequalled experience of large-scale text digitisation processes and technologies. The project will let them share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade. This project will facilitate a more collaborative approach to mass-digitisation. It will build capacity and lower the barriers to entry for organisations in the early stages of their own digitisation activity.
Champ scientifique
Appel à propositions
FP7-ICT-2007-1
Voir d’autres projets de cet appel
Régime de financement
CP - Collaborative project (generic)Coordinateur
2595 BE DEN HAAG
Pays-Bas
Voir sur la carte
Participants (29)
6020 Innsbruck
Voir sur la carte
1015 Wien
Voir sur la carte
1113 Sofia
Voir sur la carte
1037 Sofia
Voir sur la carte
Participation terminée
1113 SOFIA
Voir sur la carte
116 36 Praha 1
Voir sur la carte
110 01 PRAHA 1
Voir sur la carte
80539 MUNCHEN
Voir sur la carte
37073 Gottingen
Voir sur la carte
60322 Frankfurt
Voir sur la carte
80539 MUNCHEN
Voir sur la carte
28046 MADRID
Voir sur la carte
03690 Alicante
Voir sur la carte
28071 Madrid
Voir sur la carte
75794 Paris
Voir sur la carte
75013 Paris
Voir sur la carte
15341 Agia Paraskevi
Voir sur la carte
49527 Petach Tikva
Voir sur la carte
2311 GJ Leiden
Voir sur la carte
61 704 Poznan
Voir sur la carte
00-927 WARSZAWA
Voir sur la carte
109451 MOSCOW
Voir sur la carte
1000 LJUBLJANA
Voir sur la carte
1000 Ljubljana
Voir sur la carte
BA2 7AY Bath
Voir sur la carte
NW1 2DB London
Voir sur la carte
M5 4WT Salford
Voir sur la carte
Participation terminée
54001 NANCY
Voir sur la carte
54052 Nancy Cedex
Voir sur la carte