CORDIS - EU research results

Cost-effective, Multilingual, Privacy-driven voice-enabled Services

Project description

The future of voice interaction technology is now

Voice-operated technologies have made great strides in recent years. In fact, voice is fast replacing touch and text as the main means of interaction with modern devices. In this context, the EU-funded COMPRISE project will develop the next-generation voice interaction technology. Its aim is to reduce the related costs of voice data collection and labelling for a variety of languages to make it more inclusive. Currently, voice interaction technologies have a strong bias in favour of English and other languages with a wider user base. The sustainability of this new ecosystem, which will service speakers of under resourced languages or accented speakers, will be demonstrated for three sectors with high commercial impact: smart consumer apps, e-commerce, and e-health.


Besides visual and tactile, the Next Generation Internet will rely more and more on voice interaction. This technology requires huge amounts of speech and language data in every language to reach state-of-the-art performance. The standard today is to store the voices of end users in the cloud and label them manually. This approach raises critical privacy concerns, it limits the number of deployed languages, and it has led to market and data concentration in the hands of big non-European companies such as Google, Facebook, etc.

COMPRISE defines a fully private-by-design methodology and tools that will reduce the cost and increase the inclusiveness of voice interaction technology through research advances on privacy-driven data transformations, personalised learning, automatic labelling, and integrated translation. This leads to a holistic easy-to-use software development kit interoperating with a cloud-based resource platform. The sustainability of this new ecosystem will be demonstrated for three sectors with high commercial impact: smart consumer apps, e-commerce, and e-health.

COMPRISE will address the mission-oriented challenges of privacy-by-design, inclusiveness, and cost-effectiveness in a sector-agnostic way; allow virtually unlimited collection of real-life non-private quality speech and language data; enable businesses in the Digital Single Market to quickly develop multilingual voice-enabled services in many languages; allow all citizens to transparently access contents and services available in other languages by voice interaction in their own language; result in cost savings for both technology providers and users.

COMPRISE will find application in many sectors beyond those demonstrated, e.g. e-government, e-justice, e-learning, tourism, culture, media, etc. It will have a huge societal impact in terms of unprecedented verifiable privacy guarantees, service to speakers of under-resourced languages or accented speakers, and overall user experience.

Call for proposal


See other projects for this call

Sub call



Net EU contribution
€ 789 790,00
78153 Le Chesnay Cedex

See on map

Ile-de-France Ile-de-France Yvelines
Activity type
Research Organisations
Total cost
€ 881 740,00

Participants (7)