Skip to main content
Aller à la page d’accueil de la Commission européenne (s’ouvre dans une nouvelle fenêtre)
français français
CORDIS - Résultats de la recherche de l’UE
CORDIS
Contenu archivé le 2024-06-18

tranScriptorium

Description du projet


ICT for access to cultural resources
The aim of tranScriptorium is to develop innovative, cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using modern, holistic HTR tech

tranScriptorium aims to develop innovative, efficient and cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using modern, holistic Handwritten Text Recognition (HTR) technology. TranScriptorium will turn HTR into a mature technology by addressing the following objectives:

  • Enhancing HTR technology for efficient transcription
    Departing from state-of-the-art HTR approaches, tranScriptorium will capitalize on interactive-predictive techniques for effective and user-friendly computer-assisted transcrition.
  • Bringing the HTR technology to users
    Expected users of the HTR technology belong mainly to two groups: a) individual reserachers with experience in handwritten documents transcription interested in transcribing specific documents. b) volunteers which collaborate in large transcription projects.
  • Integrating the HTR results in public web portals
    The HTR technology will become a support in the digitization of the handwritten materials. The outcomes of the tranScriptorium tools will be attached to the published handwritten document images. This includes not only full, correct transcriptions, but also partially correct transcription and other kinds of automatically produced metadata, useful for indexing and searching.

Huge amounts of handwritten historical documents are being published by on-line digital libraries world wide. However, for these raw digital images to be really useful, they need be annotated with informative content. The tranScriptorium project aims to develop innovative, efficient and cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using modern, holistic Handwritten Text Recognition (HTR) technology. For typical handwritten text images of historical documents, currently available text image recognition technologies are not suitable. Traditional Optical Character Recognition (OCR) is simply not usable since characters can not be isolated automatically in these images. Therefore, holistic, segmentation-free HTR techniques, often borrowed from the field of Automatic Speech Recognition are needed. Yet, state-of-the-art holistic HTR approaches still lack the required accuracy, mainly due to the usual poor quality, degradations and writing style variability of historical document images. To cope with this lack of recognition accuracy for handwritten text images of historical documents, three actions are planned in tranScriptorium: i) improve basic image preprocessing and holistic HTR techniques; ii) develop novel indexing and keyword searching approaches, mainly based on byproducts of holistic HTR decoding and word spotting techniques; and iii) capitalize on new, user-friendly interactive-predictive HTR approaches for computer-assisted operation, which minimize the user intervention needed to achieve full, high quality transcripts. HTR tools based on tranScriptorium techniques will be incorporated into HTR web platforms that will be accessible to users through two different means: i) a content provider portal that provides access to handwritten historical documents for casual, individual researchers; and b) a specialized HTR web portal for structured crowd-sourcing transcription projects.

Champ scientifique (EuroSciVoc)

CORDIS classe les projets avec EuroSciVoc, une taxonomie multilingue des domaines scientifiques, grâce à un processus semi-automatique basé sur des techniques TLN. Voir: https://op.europa.eu/fr/web/eu-vocabularies/euroscivoc.

Vous devez vous identifier ou vous inscrire pour utiliser cette fonction

Programme(s)

Programmes de financement pluriannuels qui définissent les priorités de l’UE en matière de recherche et d’innovation.

Thème(s)

Les appels à propositions sont divisés en thèmes. Un thème définit un sujet ou un domaine spécifique dans le cadre duquel les candidats peuvent soumettre des propositions. La description d’un thème comprend sa portée spécifique et l’impact attendu du projet financé.

Appel à propositions

Procédure par laquelle les candidats sont invités à soumettre des propositions de projet en vue de bénéficier d’un financement de l’UE.

FP7-ICT-2011-9
Voir d’autres projets de cet appel

Régime de financement

Régime de financement (ou «type d’action») à l’intérieur d’un programme présentant des caractéristiques communes. Le régime de financement précise le champ d’application de ce qui est financé, le taux de remboursement, les critères d’évaluation spécifiques pour bénéficier du financement et les formes simplifiées de couverture des coûts, telles que les montants forfaitaires.

CP - Collaborative project (generic)

Coordinateur

UNIVERSITAT POLITECNICA DE VALENCIA
Contribution de l’UE
€ 513 836,00
Adresse
CAMINO DE VERA SN EDIFICIO 3A
46022 VALENCIA
Espagne

Voir sur la carte

Région
Este Comunitat Valenciana Valencia/València
Type d’activité
Higher or Secondary Education Establishments
Liens
Coût total

Les coûts totaux encourus par l’organisation concernée pour participer au projet, y compris les coûts directs et indirects. Ce montant est un sous-ensemble du budget global du projet.

Aucune donnée

Participants (5)

Mon livret 0 0