Skip to main content

Scalable Understanding of Multilingual Media

Objective

Media monitoring enables the global news media to be viewed in terms of emerging trends, people in the news, and the evolution of story-lines. The massive growth in the number of broadcast and Internet media channels means that current approaches can no longer cope with the scale of the problem.

The aim of SUMMA is to significantly improve media monitoring by creating a platform to automate the analysis of media streams across many languages, to aggregate and distil the content, to automatically create rich knowledge bases, and to provide visualisations to cope with this deluge of data.

SUMMA has six objectives: (1) Development of a scalable and extensible media monitoring platform; (2) Development of high-quality and richer tools for analysts and journalists; (3) Extensible automated knowledge base construction; (4) Multilingual and cross-lingual capabilities; (5) Sustainable, maintainable platform and services; (6) Dissemination and communication of project results to stakeholders and user group.

Achieving these aims will require advancing the state of the art in a number of technologies: multilingual stream processing including speech recognition, machine translation, and story identification; entity and relation extraction; natural language understanding including deep semantic parsing, summarisation, and sentiment detection; and rich visualisations based on multiple views and dealing with many data streams.

The project will focus on three use cases: (1) External media monitoring - intelligent tools to address the dramatically increased scale of the global news monitoring problem; (2) Internal media monitoring - managing content creation in several languages efficiently by ensuring content created in one language is reusable by all other languages; (3) Data journalism.

The outputs of the project will be field-tested at partners BBC and DW, and the platform will be further validated through innovation intensives such as the BBC NewsHack.

Field of science

  • /natural sciences/computer and information sciences/internet
  • /natural sciences/computer and information sciences/data science/natural language processing
  • /social sciences/media and communications/journalism
  • /humanities/languages and literature/linguistics/phonetics

Call for proposal

H2020-ICT-2015
See other projects for this call

Funding Scheme

RIA - Research and Innovation action

Coordinator

THE UNIVERSITY OF EDINBURGH
Address
Old College, South Bridge
EH8 9YL Edinburgh
United Kingdom
Activity type
Higher or Secondary Education Establishments
EU contribution
€ 1 999 112,50

Participants (7)

PRIBERAM INFORMATICA SA
Portugal
EU contribution
€ 799 140
Address
Alameda D Afonso Henriques 41 2
1000-123 Lisboa
Activity type
Private for-profit entities (excluding Higher or Secondary Education Establishments)
UNIVERSITY COLLEGE LONDON
United Kingdom
EU contribution
€ 390 802,50
Address
Gower Street
WC1E 6BT London
Activity type
Higher or Secondary Education Establishments
FONDATION DE L'INSTITUT DE RECHERCHE IDIAP
Switzerland
EU contribution
€ 0
Address
Rue Marconi 19
1920 Martigny
Activity type
Research Organisations
LETA
Latvia
EU contribution
€ 1 166 250
Address
Marijas Iela 2
1050 Riga
Activity type
Private for-profit entities (excluding Higher or Secondary Education Establishments)
BRITISH BROADCASTING CORPORATION
United Kingdom
EU contribution
€ 1 142 500
Address
Portland Place Broadcasting House
W1A 1AA London
Activity type
Public bodies (excluding Research Organisations and Secondary or Higher Education Establishments)
DEUTSCHE WELLE
Germany
EU contribution
€ 535 000
Address
Kurt Schumacher Strasse 3
53113 Bonn
Activity type
Public bodies (excluding Research Organisations and Secondary or Higher Education Establishments)
THE UNIVERSITY OF SHEFFIELD
United Kingdom
EU contribution
€ 160 556,25
Address
Firth Court Western Bank
S10 2TN Sheffield
Activity type
Higher or Secondary Education Establishments