Project description
Linking data sets for increased language technologies
Language technologies that rely on large amounts of data and better access to language resources permit the delivery of multilingual solutions to support Europe’s Digital Single Market. However, language technology specialists spend 80 % of their time cleaning, organising and collecting data sets because data is not ‘ready-to-use’. Although an essential part of the extract-transform-load process requires linking data sets to existing designs, linked data technologies remain unexploited. The EU-funded Pret-a-LLOD project will increase the use of language technologies to create ready-to-use multilingual data. The project will combine linked data sets with language technologies that are Linguistic Linked Open Data (LLOD) and develop innovative tools for the transformation and linking of data sets.
Objective
Language technologies increasingly rely on large amounts of data and better access and usage of language resources will enable to provide multilingual solutions that would support the emerging Digital Single Market in Europe. However, data is rarely ‘ready-to-use’ and language technology specialists spend over 80% of their time on cleaning, organizing and collecting datasets. Reducing this effort promises huge cost savings for all sectors where language technologies are required. An essential part of the Extract-Transform-Load process involves linking datasets to existing schemas, yet few specialists take advantage of linked data technologies to perform this task. In this project we aim to increase the uptake of language technologies by exploiting the combination of linked data and language technologies, that is Linguistic Linked Open Data (LLOD), to create ready-to-use multilingual data. Prêt-à-LLOD aims to achieve this by creating a new methodology for building data value chains applicable to a wide-range of sectors and applications and based around language resources and language technologies that can be integrated by means of semantic technologies, in particular the usage of Linguistic Linked Open Data (LLOD). The project will develop novel tools for the transformation and linking of datasets, and apply these to both data and metadata in order to provide multi-portal access to heterogeneous data repositories. We will study how we can automatically analyze licenses in order to deduce how data may be lawfully used and sold by language resource providers. Finally, we will provide tools to combine language services and resources into complex pipelines by use of semantic technologies. This will lead to sustainable data offers and services that can be deployed to many platforms, including as-yet-unknown platforms, and can be self-described with linked data semantics. This toolkit will be validated in four pilots, where novel data value chains will be built for pharma
Programme(s)
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
Multi-annual funding programmes that define the EU’s priorities for research and innovation.
-
H2020-EU.2.1.1. - INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)
MAIN PROGRAMME
See all projects funded under this programme
Topic(s)
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.
Funding Scheme
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.
RIA - Research and Innovation action
See all projects funded under this funding scheme
Call for proposal
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.
(opens in new window) H2020-ICT-2018-20
See all projects funded under this callCoordinator
Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.
H91 Galway
Ireland
The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.