Descripción del proyecto
Digital libraries and technology-enhanced learning
IMPACT will push innovation in OCR and language technologies for historical document processing and retrieval and build digitisation capacity in Europe
Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.
With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state of the art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.
The IMPACT project will remove many of these barriers. The project will push innovation in OCR technology and language technology for historical document processing and retrieval, and share expertise to build capacity in digitisation across Europe. During the project, a Centre of Competence will be set up in order to provide a central service entry point for all libraries, archives and museums involved in the digitisation of textual material.
The consortium brings together twenty-six national and regional libraries, research institutions and commercial suppliers who will share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade.
Text that is not digital is virtually invisible. Today's readers search the internet for electronically accessible texts rather than visit the reading room of a library. Born-digital and digitised contemporary materials contain the richness that allows tools such as text mining and the semantic web to offer superior accessibility but the story is very different for historic documents. A vital part of the European heritage, encompassing more than four centuries of historic books and bound periodicals is becoming less and less visible to the public at large.With the i2010 vision of a European Digital Library, the EU has launched an ambitious plan for large scale digitisation projects transforming Europe's printed heritage into digitally available resources. However, lack of institutional knowledge and expertise slows down the pace with which this vision can be realised. The state-of-the-art in OCR performance and machine understanding of the original document is inadequate, especially for historically important material with archaic fonts and spellings, newspapers with complex layouts, bound volumes, microfilm or typescript.The IMPACT project will remove many of these barriers. It brings together fifteen national and regional libraries, research institutions and commercial suppliers - all centres of competence with unequalled experience of large-scale text digitisation processes and technologies. The project will let them share their know-how and best practices, develop innovative tools to enhance the capabilities of OCR engines and the accessibility of digitised text and lay down the foundations for the mass-digitisation programmes that will take place over the next decade. This project will facilitate a more collaborative approach to mass-digitisation. It will build capacity and lower the barriers to entry for organisations in the early stages of their own digitisation activity.
Ámbito científico
Convocatoria de propuestas
FP7-ICT-2007-1
Consulte otros proyectos de esta convocatoria
Régimen de financiación
CP - Collaborative project (generic)Coordinador
2595 BE DEN HAAG
Países Bajos
Ver en el mapa
Participantes (29)
6020 Innsbruck
Ver en el mapa
1015 Wien
Ver en el mapa
1113 Sofia
Ver en el mapa
1037 Sofia
Ver en el mapa
La participación finalizó
1113 SOFIA
Ver en el mapa
116 36 Praha 1
Ver en el mapa
110 01 PRAHA 1
Ver en el mapa
80539 MUNCHEN
Ver en el mapa
37073 Gottingen
Ver en el mapa
60322 Frankfurt
Ver en el mapa
80539 MUNCHEN
Ver en el mapa
28046 MADRID
Ver en el mapa
03690 Alicante
Ver en el mapa
28071 Madrid
Ver en el mapa
75794 Paris
Ver en el mapa
75013 Paris
Ver en el mapa
15341 Agia Paraskevi
Ver en el mapa
49527 Petach Tikva
Ver en el mapa
2311 GJ Leiden
Ver en el mapa
61 704 Poznan
Ver en el mapa
00-927 WARSZAWA
Ver en el mapa
109451 MOSCOW
Ver en el mapa
1000 LJUBLJANA
Ver en el mapa
1000 Ljubljana
Ver en el mapa
BA2 7AY Bath
Ver en el mapa
NW1 2DB London
Ver en el mapa
M5 4WT Salford
Ver en el mapa
La participación finalizó
54001 NANCY
Ver en el mapa
54052 Nancy Cedex
Ver en el mapa