Skip to main content
Aller à la page d’accueil de la Commission européenne (s’ouvre dans une nouvelle fenêtre)
français français
CORDIS - Résultats de la recherche de l’UE
CORDIS
Contenu archivé le 2023-11-13

CEntral and South-east europeAn Resources

Description du projet


Open linguistic infrastructure

Human language technologies crucially depend on language resources and tools that are usable, useful and available. However, even where language resources and respective tools are available they have been developed mostly in a sporadic manner, in response to specific project needs, with relatively little regard to their long-term sustainability, IPR status, interoperability, reusability in different contexts as well as to their potential deployment in multilingual applications.The CESAR project, in close harmony with META-NET intends to address this issue by enhancing, upgrading, standardising and cross-linking a wide variety of language resources and tools and making them available, thus contributing to an open linguistic infrastructure.Partners in the CESAR consortium are key players in their respective language community with a proven track record in European language technology projects including infrastructure initiatives, such as TELRI in the past and notably CLARIN at present.The project will make available a comprehensive set of language resources and tools covering the Hungarian, Polish, Croatian, Serbian, Bulgarian and Slovak languages. Resources will include interoperable mono and multilingual speech databases, corpora, dictionaries and wordnets and relevant language technology processing tools such as tokenisers, lemmatisers, taggers and parsers. A comprehensive initial list of tools and resources identified as potential targets of project activity can be found in Section 3.2. Depending on local capacities and efficiency considerations, the resources may be either made available as web services at partners' site or contributed to the META_SHARE digital exchange facility.In addition to the technical work required, great effort will be made to ensure sustainability through mobilising the LT community, raising awareness of the fundamental role of language resources among the R&D policy makers, the media and the general public.

Human language technologies crucially depend on language resources and tools that are usable, useful and available. However, even where language resources and respective tools are available they have been developed mostly in a sporadic manner, in response to specific project needs, with relatively little regard to their long-term sustainability, IPR status, interoperability, reusability in different contexts as well as to their potential deployment in multilingual applications.The CESAR project, in close harmony with META-NET intends to address this issue by enhancing, upgrading, standardising and cross-linking a wide variety of language resources and tools and making them available, thus contributing to an open linguistic infrastructure.Partners in the CESAR consortium are key players in their respective language community with a proven track record in European language technology projects including infrastructure initiatives, such as TELRI in the past and notably CLARIN at present.The project will make available a comprehensive set of language resources and tools covering the Hungarian, Polish, Croatian, Serbian, Bulgarian and Slovak languages. Resources will include interoperable mono and multilingual speech databases, corpora, dictionaries and wordnets and relevant language technology processing tools such as tokenisers, lemmatisers, taggers and parsers. A comprehensive initial list of tools and resources identified as potential targets of project activity can be found in Section 3.2. Depending on local capacities and efficiency considerations, the resources may be either made available as web services at partners' site or contributed to the META_SHARE digital exchange facility.In addition to the technical work required, great effort will be made to ensure sustainability through mobilising the LT community, raising awareness of the fundamental role of language resources among the R&D policy makers, the media and the general public.

Programme(s)

Programmes de financement pluriannuels qui définissent les priorités de l’UE en matière de recherche et d’innovation.

Thème(s)

Les appels à propositions sont divisés en thèmes. Un thème définit un sujet ou un domaine spécifique dans le cadre duquel les candidats peuvent soumettre des propositions. La description d’un thème comprend sa portée spécifique et l’impact attendu du projet financé.

Appel à propositions

Procédure par laquelle les candidats sont invités à soumettre des propositions de projet en vue de bénéficier d’un financement de l’UE.

CIP-ICT-PSP-2010-4
Voir d’autres projets de cet appel

Régime de financement

Régime de financement (ou «type d’action») à l’intérieur d’un programme présentant des caractéristiques communes. Le régime de financement précise le champ d’application de ce qui est financé, le taux de remboursement, les critères d’évaluation spécifiques pour bénéficier du financement et les formes simplifiées de couverture des coûts, telles que les montants forfaitaires.

PB - Pilot Type B

Coordinateur

HUN REN NYELVTUDOMANYI KUTATOKOZPONT
Contribution de l’UE
€ 323 428,00
Adresse
BENCZUR UTCA 33
1068 Budapest
Hongrie

Voir sur la carte

Région
Közép-Magyarország Budapest Budapest
Type d’activité
Other
Liens
Coût total

Les coûts totaux encourus par l’organisation concernée pour participer au projet, y compris les coûts directs et indirects. Ce montant est un sous-ensemble du budget global du projet.

Aucune donnée

Participants (8)

Mon livret 0 0