European Commission logo
English English
CORDIS - EU research results
CORDIS

Cross-lingual Event-centric Open Analytics Research Academy

Deliverables

Cross-lingual alignment, validation, contextualisation and OEKG (V2)

This report includes 1 A report on the crosslingual alignment validation and contextualisation methods developed in WP4 V22 A release of the corresponding software components V23 A report on The Open Event Knowledge Graph V2 released at MS4

Event-centric cross-lingual analytics, cross-cultural studies and CKPP (V2)

A report including information regarding 1 The methods for eventcentric crosslingual analytics developed in WP6 V22 The crosscultural studies conducted in WP6 V23 A release of the corresponding software components V24 A report on the Cleopatra Knowledge Processing Pipeline V2 released at MS5

Interactive access to multilingual information and hybrid computation (V2)

A report on the methods for interactive access to multilingual information and hybrid computation over crosslingual data developed in WP5 V2

Publications

Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features

Author(s): Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth
Published in: Proceedings of 11th International Conference of the CLEF Association, CLEF 2020, 2020
Publisher: CEUR

EveOut: Reproducible Event Dataset for Studying andAnalyzing the Complex Event-Outlet Relationship

Author(s): Swati, Tomaž Erjavec, Dunja Mladenić
Published in: SiKDD 2020, Issue Vol C, 2020, Page(s) 17-20, ISBN 978-961-264-192-4
Publisher: JSI

Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews

Author(s): G. Thakkar, N. Mikelić Preradović, M. Tadić
Published in: Slavic NLP 2023 The 9th Workshop on Slavic Natural Language Processing, 2023, Page(s) 25, ISBN 978-1-959429-57-9
Publisher: Association for Computational Linguistics
DOI: 10.48550/arxiv.2305.08173

CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus

Author(s): G. Thakkar, N. Mikelić Preradović, M. Tadić
Published in: 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, 2023, Page(s) 94
Publisher: Wydawnictwo Naukowe UAM
DOI: 10.48550/arxiv.2305.08187

User Access Models to Event-Centric Information

Author(s): Sara Abdollahi
Published in: Companion Proceedings of the Web Conference 2022, 2022, Page(s) 329–333
Publisher: Association for Computing Machinery
DOI: 10.1145/3487553.3524193

Building and Evaluating Universal Named-Entity Recognition English corpus

Author(s): Diego Alves, Gaurish Thakkar, Marko Tadić
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Issue vol 2829, 2021, Page(s) 2-16
Publisher: CEUR

A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

Author(s): Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth
Published in: Workshop on Multi-ModalPre-Training for Multimedia Understanding (MMPT 2021), 2021
Publisher: ACM
DOI: 10.1145/3463945.3469058

MLM - A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities

Author(s): Jason Armitage, Endri Kacupaj, Golsa Tahmasebzadeh, Swati, Maria Maleshkova, Ralph Ewerth, Jens Lehmann
Published in: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 2020, Page(s) 2967-2974, ISBN 9781450368599
Publisher: ACM
DOI: 10.1145/3340531.3412783

Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing

Author(s): Tin Kuculo
Published in: 2022, ISBN 978-1-4503-9130-6
Publisher: Association for Computing Machinery
DOI: 10.1145/3487553.3524199

Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-Resourced Languages

Author(s): Diego Alves, Gaurish Thakkar, Marko Tadić
Published in: LREC2020 SLTU-CCURL Workshop, 2020, Page(s) 153-158, ISBN 979-10-95546-35-1
Publisher: ELRA
DOI: 10.6084/m9.figshare.13142756.v1

QuoteKG: A Multilingual Knowledge Graph of Quotes

Author(s): Tin Kuculo, Simon Gottschalk, Elena Demidova
Published in: Lecture Notes in Computer Science, 2022, ISBN 978-3-031-06980-2
Publisher: Springer
DOI: 10.1007/978-3-031-06981-9_21

Multi-task Learning for Cross-Lingual Sentiment Analysis

Author(s): Gaurish Thakkar, Nives Mikelić Preradović, Marko Tadić
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Issue vol 2829, 2021, Page(s) 76-84
Publisher: CEUR

Typological Approach to Improve Dependency Parsing for Croatian Language

Author(s): Diego Alves, Božo Bekavac, Marko Tadić
Published in: Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021), 2021, Page(s) 1-11
Publisher: Association for Computational Linguistics

Negation Detection Using NooJ

Author(s): Gaurish Thakkar; Nives Mikelić Preradović;; Marko Tadić
Published in: 44th International Convention on Information, Communication and Electronic Technology (MIPRO), Issue 1, 2021, Page(s) 263
Publisher: IEEE
DOI: 10.23919/MIPRO52101.2021.9597013

A Dataset for Information Spreading over the News

Author(s): Abdul Sittar, Dunja Mladenić, Tomaž Erjavec
Published in: SiKDD 2020, Issue vol c, 2020, Page(s) 5-8, ISBN 978-961-264-192-4
Publisher: JSI

Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study

Author(s): Jelena Sarajlić, Gaurish Thakkar, Diego Alves, Nives Mikelic Preradovic
Published in: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Issue Vol 2836, 2021
Publisher: CEUR

OEKG: The Open Event Knowledge Graph

Author(s): Simon Gottschalk, Endri Kacupaj, Sara Abdollahi, Diego Alves, Gabriel Amaral, Elisavet Koutsiana, Tin Kuculo, Daniela Major, Caio Mello, Gullal S. Cheema, Abdul Sittar, Swati, Golsa Tahmasebzadeh and Gaurish Thakkar
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Issue vol 2829, 2021, Page(s) 61-75
Publisher: CEUR

Classification of Cross-cultural News Events

Author(s): Abdul Sittar, Dunja Mladenić
Published in: SiKDD 2021, 2021
Publisher: JSI

TIB's Visual Analytics Group at MediaEval '20: Detecting Fake News on Corona Virus and 5G Conspiracy

Author(s): Cheema, Gullal S.; Hakimov, Sherzod; Ewerth, Ralph
Published in: Proceedings of MediaEval 2020 Workshop, Issue 11, 2020
Publisher: CEUR
DOI: 10.48550/arxiv.2101.03529

Training Multimodal Systems for Classification with Multiple Objectives

Author(s): Jason Armitage, Shramana Thakur, Rishi Tripathi, Maria Maleshkova and Jens Lehmann
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Issue vol 2611, 2020, Page(s) 57-71
Publisher: CEUR

Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs

Author(s): Joan Plepi, Endri Kacupaj, Kuldeep Singh, Harsh Thakka, Jens Lehmann
Published in: ESWC, 2021, Page(s) 356--371
Publisher: Springer
DOI: 10.1007/978-3-030-77385-4_21

TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes

Author(s): Sherzod Hakimov, Gullal S. Cheema, Ralph Ewerth
Published in: SemEval Task 5 co-located with NAACL 2022, 2022
Publisher: Association for Computational Linguistics

UNER: Universal Named-Entity Recognition Framework

Author(s): Diego Fernando Válio Antunes Alves, Tin Kuculo, Gabriel Amaral, Gaurish Thakkar and Marko Tadic
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Issue vol 2611, 2020, Page(s) 72-79
Publisher: CEUR

MM-Claims: A Dataset for Multimodal Claim Detection in Social Media

Author(s): Gullal S. Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth
Published in: NAACL Findings, 2022
Publisher: Association for Computational Linguistics

Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora

Author(s): D. Alves, B. Bekavac, M. Tadić
Published in: Workshop on Building and Using Comparable Corpora (BUCC 2022) @LREC2022, 2022, Page(s) 33-42
Publisher: Association for Computational Linguistics

Improving Generalization for Multimodal Fake News Detection

Author(s): Sahar Tahmasebi, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack
Published in: ICMR 2023 - International Conference on Multimedia Retrieval, 2023, ISBN 979-8-4007-0178-8
Publisher: ACM
DOI: 10.1145/3591106.3592230

Analysis of Corpus-based Word-Order Typological Methods

Author(s): Diego Alves; Božo Bekavac; Daniel Zeman; Marko Tadić
Published in: Sixth Workshop on Universal Dependencies, Issue 20, 2023, Page(s) 36-46
Publisher: Association for Computational Linguistics
DOI: 10.5281/zenodo.7947920

EventKG+Click: A Dataset of Language-specific Event-centric User Interaction Traces

Author(s): Sara Abdollahi, Simon Gottschalk and Elena Demidova
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Issue vol 2611, 2020, Page(s) 32-42
Publisher: CEUR

Understanding the Impact of Geographical Bias on NewsSentiment: A Case Study on London and Rio Olympics

Author(s): Swati, Dunja Mladenić
Published in: SiKDD 2021, 2021
Publisher: JSI

On the Role of Images for Analyzing Claims in Social Media

Author(s): Cheema, Gullal S.; Hakimov, Sherzod; Müller-Budack, Eric; Ewerth, Ralph
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Issue vol 2829, 2021, Page(s) 32-46
Publisher: CEUR

A Feature Analysis for Multimodal News Retrieval

Author(s): Golsa Tahmasebzadeh, Sherzod Hakimov, Eric Müller-Budack and Ralph Ewerth
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Issue vol 2611, 2020, Page(s) 43-56
Publisher: CEUR

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages

Author(s): Diego Alves, Gaurish Thakkar, Marko Tadić
Published in: LREC2020, 2020, Page(s) 1859-1866
Publisher: ELRA
DOI: 10.6084/m9.figshare.13142747.v1

GeoWINE: Geolocation based Wiki, Image, News and Event Retrieval

Author(s): Golsa Tahmasebzadeh; Endri Kacupaj; Eric Müller-Budack; Sherzod Hakimov; Jens Lehmann; Ralph Ewerth
Published in: SIGIR, Issue 7, 2021
Publisher: ACM
DOI: 10.1145/3404835.3462786

Corpus-based Syntactic Typological Methods for Dependency Parsing Improvement

Author(s): D. Alves, B. Bekavac, D. Zeman, M. Tadić
Published in: 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP 2023), 2023, Page(s) 76-88
Publisher: Association for Computational Linguistics

"""Using the profile of publishers to predict barriers across news articles"""

Author(s): Abdul Sittar, Dunja Mladenić
Published in: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Issue vol 2829, 2021
Publisher: CEUR

Are You Following the Right News-Outlet? A MachineLearning based approach to outlet prediction

Author(s): Swati, Dunja Mladenić
Published in: SiKDD 2020, Issue vol c, 2020, Page(s) 33-36, ISBN 978-961-264-192-4
Publisher: JSI

MM-Locate-News: Multimodal Focus Location Estimation in News

Author(s): Golsa Tahmasebzadeh, Eric Müller-Budack, Sherzod Hakimov, Ralph Ewerth
Published in: MMM 2023 - International Conference on MultiMedia Modeling, Issue 2023, 2023, Page(s) 204-216
Publisher: Springer
DOI: 10.1007/978-3-031-27077-2_16

Multimodal Geolocation Estimation of News Photos

Author(s): Golsa Tahmasebzadeh, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack
Published in: ECIR 2023 - European Conference on Information Retrieval, 2023, Page(s) 204–220, ISBN 978-3-031-27077-2
Publisher: Springer
DOI: 10.1007/978-3-031-28238-6_14

The Optimization of Portuguese Named-Entity Recognition and Classification by Combining Local Grammars and Conditional Random Fields Trained with a Parsed Corpus

Author(s): Diego Alves; Božo Bekavac; Marko Tadić
Published in: NooJ2020 conference, Issue vol 1389, 2021, Page(s) 196-205
Publisher: Springer
DOI: 10.1007/978-3-030-70629-6_17

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia

Author(s): Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić
Published in: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Issue Vol 2836, 2021
Publisher: CEUR

ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation

Author(s): Endri Kacupaj, Barshana Banerjee, Kuldeep Singh, Jens Lehmann
Published in: ESWC, 2021, Page(s) 598-613
Publisher: Springer
DOI: 10.1007/978-3-030-77385-4_36

Data Augmentation for Pipeline-Based Speech Translation

Author(s): Diego Alves, Askars Salimbajevs, Mārcis Pinnis
Published in: Human Language Technologies – The Baltic Perspective - Proceedings of the Ninth International Conference Baltic HLT 2020, 2020, Page(s) 73-79, ISBN 9781643681160
Publisher: IOS Press
DOI: 10.3233/faia200605

VQuAnDa: Verbalization QUestion ANswering DAtaset

Author(s): Endri Kacupaj, Hamid Zafar, Jens Lehmann, Maria Maleshkova
Published in: The Semantic Web - 17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31–June 4, 2020, Proceedings, Issue 12123, 2020, Page(s) 531-547, ISBN 978-3-030-49460-5
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-49461-2_31

Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets

Author(s): Gaurish Thakkar, Mārcis Pinnis
Published in: Human Language Technologies – The Baltic Perspective - Proceedings of the Ninth International Conference Baltic HLT 2020, 2020, Page(s) 55 - 61, ISBN 9781643681160
Publisher: IOS Press
DOI: 10.3233/faia200602

EveOut: an Event-centric News Dataset to Analyze an Outlet’s Event Selection

Author(s): - Swati, Dunja Mladenić and Tomaž Erjavec
Published in: Informatica - Slovenia, Issue Vol. 45, No. 7, 2021, Page(s) 25-30, ISSN 0350-5596
Publisher: Slovensko Drustvo Informatika
DOI: 10.31449/inf.v45i7.3410

Combining sentiment analysis classifiers to explore multilingual news articles covering London 2012 and Rio 2016 Olympics

Author(s): Caio Mello, Gullal S. Cheema, Gaurish Thakkar
Published in: International Journal of Digital Humanities, 2022, ISSN 2524-7840
Publisher: Springer
DOI: 10.1007/s42803-022-00052-9

LaSER: Language-specific event recommendation

Author(s): Sara Abdollahi, Simon Gottschalk, Elena Demidova
Published in: Journal of Web Semantics, Issue 15708268, 2022, ISSN 1570-8268
Publisher: Elsevier BV
DOI: 10.1016/j.websem.2022.100759

Assessing the Quality of Sources in Wikidata Across Languages: A Hybrid Approach

Author(s): Gabriel Amaral, Alessandro Piscopo, Lucie-Aimée Kaffee, Odinaldo Rodrigues, Elena Simperl
Published in: Journal of Data and Information Quality, Issue 19361955, 2021, Page(s) 1–35, ISSN 1936-1955
Publisher: Association for Computing Machinary, Inc.
DOI: 10.1145/3484828

Political and Economic Patterns in COVID-19 News: From Lockdown to Vaccination

Author(s): Abdul Sittar, Dunja Mladenić, Marko Grobelnik
Published in: IEEE Access, Issue 21693536, 2022, Page(s) 40036-40050, ISSN 2169-3536
Publisher: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/access.2022.3164692

Analysis of information cascading and propagation barriers across distinctive news events

Author(s): Abdul Sittar; Dunja Mladenic; Marko Grobelnik
Published in: Journal of Intelligent Information Systems, Issue 58, 2021, Page(s) 119–152, ISSN 1573-7675
Publisher: Springer
DOI: 10.48550/arxiv.2212.07742

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available