Skip to main content
European Commission logo
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary
Content archived on 2024-06-18

Language Resource Pool for Sentiment Analysis in European Languages

Project description


SME initiative on Digital Content and Languages
Interoperable pool of shared language resources for Sentiment Analysis services

During the last years, there has been a high increase in the use of social networks and blogs so that citizens and consumers express now widely their opinions about different topics like politics, society and media, through these channels. However the development of systems for sentiment analysis of these opinions is hampered by difficulties to access and get the necessary language resources, for several reasons:(i) language resource owners fears for losing competitiveness;(ii) lack of agreed language resource schemas for sentiment analysis and not normalised magnitudes for measuring sentiment strength;(iii) high costs for adapting existing language resources for sentiment analysis;(iv) reduced visibility, accessibility and interoperability of the language resources.
The project aims to develop a large shared data pool for language resources meant to be used by sentiment analysis systems, in order to bundle together scattered resources. One goal is to extend the WordNet Domain to sentiment analysis. The project will also specify a schema for sentiment analysis and normalise the metrics used for sentiment strength. The sharing of resources will be supported by a self-sustainable and profitable framework based on a community governance model, offering contributors the possibility of exploiting commercially the resources they provide.
The project is structured around following steps:- definition of a common schema to ensure interoperability;- acquisition and clean up of language resources;- deployment of the resources and- validation through opinion mining demonstrators in the hotel and electronic domains.
The targeted users are B2B including service developers, content providers, LR owners.The data pool will cover 6 languages: English Catalan German Italian Portuguese and Spanish.

Call for proposal

FP7-ICT-2011-SME-DCL
See other projects for this call

Coordinator Contact

Daniel Molina Mr.

Coordinator

PARADIGMA DIGITAL SL
EU contribution
€ 472 466,00
Address
VIA DE LAS DOS CASTILLAS 33 ATICA 4 PLANTA 2
28224 Pozuelo De Alarcon
Spain

See on map

Region
Comunidad de Madrid Comunidad de Madrid Madrid
Administrative Contact
Fernando Jiménez (Mr.)
Links
Total cost
No data

Participants (5)