European Commission logo
français français
CORDIS - Résultats de la recherche de l’UE
CORDIS

Cross-lingual Event-centric Open Analytics Research Academy

Livrables

Cross-lingual alignment, validation, contextualisation and OEKG (V2)

This report includes 1 A report on the crosslingual alignment validation and contextualisation methods developed in WP4 V22 A release of the corresponding software components V23 A report on The Open Event Knowledge Graph V2 released at MS4

Event-centric cross-lingual analytics, cross-cultural studies and CKPP (V2)

A report including information regarding 1 The methods for eventcentric crosslingual analytics developed in WP6 V22 The crosscultural studies conducted in WP6 V23 A release of the corresponding software components V24 A report on the Cleopatra Knowledge Processing Pipeline V2 released at MS5

Interactive access to multilingual information and hybrid computation (V2)

A report on the methods for interactive access to multilingual information and hybrid computation over crosslingual data developed in WP5 V2

Publications

Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features

Auteurs: Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth
Publié dans: Proceedings of 11th International Conference of the CLEF Association, CLEF 2020, 2020
Éditeur: CEUR

EveOut: Reproducible Event Dataset for Studying andAnalyzing the Complex Event-Outlet Relationship

Auteurs: Swati, Tomaž Erjavec, Dunja Mladenić
Publié dans: SiKDD 2020, Numéro Vol C, 2020, Page(s) 17-20, ISBN 978-961-264-192-4
Éditeur: JSI

Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews

Auteurs: G. Thakkar, N. Mikelić Preradović, M. Tadić
Publié dans: Slavic NLP 2023 The 9th Workshop on Slavic Natural Language Processing, 2023, Page(s) 25, ISBN 978-1-959429-57-9
Éditeur: Association for Computational Linguistics
DOI: 10.48550/arxiv.2305.08173

CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus

Auteurs: G. Thakkar, N. Mikelić Preradović, M. Tadić
Publié dans: 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, 2023, Page(s) 94
Éditeur: Wydawnictwo Naukowe UAM
DOI: 10.48550/arxiv.2305.08187

User Access Models to Event-Centric Information

Auteurs: Sara Abdollahi
Publié dans: Companion Proceedings of the Web Conference 2022, 2022, Page(s) 329–333
Éditeur: Association for Computing Machinery
DOI: 10.1145/3487553.3524193

Building and Evaluating Universal Named-Entity Recognition English corpus

Auteurs: Diego Alves, Gaurish Thakkar, Marko Tadić
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Numéro vol 2829, 2021, Page(s) 2-16
Éditeur: CEUR

A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

Auteurs: Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth
Publié dans: Workshop on Multi-ModalPre-Training for Multimedia Understanding (MMPT 2021), 2021
Éditeur: ACM
DOI: 10.1145/3463945.3469058

MLM - A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities

Auteurs: Jason Armitage, Endri Kacupaj, Golsa Tahmasebzadeh, Swati, Maria Maleshkova, Ralph Ewerth, Jens Lehmann
Publié dans: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 2020, Page(s) 2967-2974, ISBN 9781450368599
Éditeur: ACM
DOI: 10.1145/3340531.3412783

Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing

Auteurs: Tin Kuculo
Publié dans: 2022, ISBN 978-1-4503-9130-6
Éditeur: Association for Computing Machinery
DOI: 10.1145/3487553.3524199

Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-Resourced Languages

Auteurs: Diego Alves, Gaurish Thakkar, Marko Tadić
Publié dans: LREC2020 SLTU-CCURL Workshop, 2020, Page(s) 153-158, ISBN 979-10-95546-35-1
Éditeur: ELRA
DOI: 10.6084/m9.figshare.13142756.v1

QuoteKG: A Multilingual Knowledge Graph of Quotes

Auteurs: Tin Kuculo, Simon Gottschalk, Elena Demidova
Publié dans: Lecture Notes in Computer Science, 2022, ISBN 978-3-031-06980-2
Éditeur: Springer
DOI: 10.1007/978-3-031-06981-9_21

Multi-task Learning for Cross-Lingual Sentiment Analysis

Auteurs: Gaurish Thakkar, Nives Mikelić Preradović, Marko Tadić
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Numéro vol 2829, 2021, Page(s) 76-84
Éditeur: CEUR

Typological Approach to Improve Dependency Parsing for Croatian Language

Auteurs: Diego Alves, Božo Bekavac, Marko Tadić
Publié dans: Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021), 2021, Page(s) 1-11
Éditeur: Association for Computational Linguistics

Negation Detection Using NooJ

Auteurs: Gaurish Thakkar; Nives Mikelić Preradović;; Marko Tadić
Publié dans: 44th International Convention on Information, Communication and Electronic Technology (MIPRO), Numéro 1, 2021, Page(s) 263
Éditeur: IEEE
DOI: 10.23919/MIPRO52101.2021.9597013

A Dataset for Information Spreading over the News

Auteurs: Abdul Sittar, Dunja Mladenić, Tomaž Erjavec
Publié dans: SiKDD 2020, Numéro vol c, 2020, Page(s) 5-8, ISBN 978-961-264-192-4
Éditeur: JSI

Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study

Auteurs: Jelena Sarajlić, Gaurish Thakkar, Diego Alves, Nives Mikelic Preradovic
Publié dans: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Numéro Vol 2836, 2021
Éditeur: CEUR

OEKG: The Open Event Knowledge Graph

Auteurs: Simon Gottschalk, Endri Kacupaj, Sara Abdollahi, Diego Alves, Gabriel Amaral, Elisavet Koutsiana, Tin Kuculo, Daniela Major, Caio Mello, Gullal S. Cheema, Abdul Sittar, Swati, Golsa Tahmasebzadeh and Gaurish Thakkar
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Numéro vol 2829, 2021, Page(s) 61-75
Éditeur: CEUR

Classification of Cross-cultural News Events

Auteurs: Abdul Sittar, Dunja Mladenić
Publié dans: SiKDD 2021, 2021
Éditeur: JSI

TIB's Visual Analytics Group at MediaEval '20: Detecting Fake News on Corona Virus and 5G Conspiracy

Auteurs: Cheema, Gullal S.; Hakimov, Sherzod; Ewerth, Ralph
Publié dans: Proceedings of MediaEval 2020 Workshop, Numéro 11, 2020
Éditeur: CEUR
DOI: 10.48550/arxiv.2101.03529

Training Multimodal Systems for Classification with Multiple Objectives

Auteurs: Jason Armitage, Shramana Thakur, Rishi Tripathi, Maria Maleshkova and Jens Lehmann
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Numéro vol 2611, 2020, Page(s) 57-71
Éditeur: CEUR

Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs

Auteurs: Joan Plepi, Endri Kacupaj, Kuldeep Singh, Harsh Thakka, Jens Lehmann
Publié dans: ESWC, 2021, Page(s) 356--371
Éditeur: Springer
DOI: 10.1007/978-3-030-77385-4_21

TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes

Auteurs: Sherzod Hakimov, Gullal S. Cheema, Ralph Ewerth
Publié dans: SemEval Task 5 co-located with NAACL 2022, 2022
Éditeur: Association for Computational Linguistics

UNER: Universal Named-Entity Recognition Framework

Auteurs: Diego Fernando Válio Antunes Alves, Tin Kuculo, Gabriel Amaral, Gaurish Thakkar and Marko Tadic
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Numéro vol 2611, 2020, Page(s) 72-79
Éditeur: CEUR

MM-Claims: A Dataset for Multimodal Claim Detection in Social Media

Auteurs: Gullal S. Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth
Publié dans: NAACL Findings, 2022
Éditeur: Association for Computational Linguistics

Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora

Auteurs: D. Alves, B. Bekavac, M. Tadić
Publié dans: Workshop on Building and Using Comparable Corpora (BUCC 2022) @LREC2022, 2022, Page(s) 33-42
Éditeur: Association for Computational Linguistics

Improving Generalization for Multimodal Fake News Detection

Auteurs: Sahar Tahmasebi, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack
Publié dans: ICMR 2023 - International Conference on Multimedia Retrieval, 2023, ISBN 979-8-4007-0178-8
Éditeur: ACM
DOI: 10.1145/3591106.3592230

Analysis of Corpus-based Word-Order Typological Methods

Auteurs: Diego Alves; Božo Bekavac; Daniel Zeman; Marko Tadić
Publié dans: Sixth Workshop on Universal Dependencies, Numéro 20, 2023, Page(s) 36-46
Éditeur: Association for Computational Linguistics
DOI: 10.5281/zenodo.7947920

EventKG+Click: A Dataset of Language-specific Event-centric User Interaction Traces

Auteurs: Sara Abdollahi, Simon Gottschalk and Elena Demidova
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Numéro vol 2611, 2020, Page(s) 32-42
Éditeur: CEUR

Understanding the Impact of Geographical Bias on NewsSentiment: A Case Study on London and Rio Olympics

Auteurs: Swati, Dunja Mladenić
Publié dans: SiKDD 2021, 2021
Éditeur: JSI

On the Role of Images for Analyzing Claims in Social Media

Auteurs: Cheema, Gullal S.; Hakimov, Sherzod; Müller-Budack, Eric; Ewerth, Ralph
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Numéro vol 2829, 2021, Page(s) 32-46
Éditeur: CEUR

A Feature Analysis for Multimodal News Retrieval

Auteurs: Golsa Tahmasebzadeh, Sherzod Hakimov, Eric Müller-Budack and Ralph Ewerth
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC, Numéro vol 2611, 2020, Page(s) 43-56
Éditeur: CEUR

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages

Auteurs: Diego Alves, Gaurish Thakkar, Marko Tadić
Publié dans: LREC2020, 2020, Page(s) 1859-1866
Éditeur: ELRA
DOI: 10.6084/m9.figshare.13142747.v1

GeoWINE: Geolocation based Wiki, Image, News and Event Retrieval

Auteurs: Golsa Tahmasebzadeh; Endri Kacupaj; Eric Müller-Budack; Sherzod Hakimov; Jens Lehmann; Ralph Ewerth
Publié dans: SIGIR, Numéro 7, 2021
Éditeur: ACM
DOI: 10.1145/3404835.3462786

Corpus-based Syntactic Typological Methods for Dependency Parsing Improvement

Auteurs: D. Alves, B. Bekavac, D. Zeman, M. Tadić
Publié dans: 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP 2023), 2023, Page(s) 76-88
Éditeur: Association for Computational Linguistics

"""Using the profile of publishers to predict barriers across news articles"""

Auteurs: Abdul Sittar, Dunja Mladenić
Publié dans: CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, Numéro vol 2829, 2021
Éditeur: CEUR

Are You Following the Right News-Outlet? A MachineLearning based approach to outlet prediction

Auteurs: Swati, Dunja Mladenić
Publié dans: SiKDD 2020, Numéro vol c, 2020, Page(s) 33-36, ISBN 978-961-264-192-4
Éditeur: JSI

MM-Locate-News: Multimodal Focus Location Estimation in News

Auteurs: Golsa Tahmasebzadeh, Eric Müller-Budack, Sherzod Hakimov, Ralph Ewerth
Publié dans: MMM 2023 - International Conference on MultiMedia Modeling, Numéro 2023, 2023, Page(s) 204-216
Éditeur: Springer
DOI: 10.1007/978-3-031-27077-2_16

Multimodal Geolocation Estimation of News Photos

Auteurs: Golsa Tahmasebzadeh, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack
Publié dans: ECIR 2023 - European Conference on Information Retrieval, 2023, Page(s) 204–220, ISBN 978-3-031-27077-2
Éditeur: Springer
DOI: 10.1007/978-3-031-28238-6_14

The Optimization of Portuguese Named-Entity Recognition and Classification by Combining Local Grammars and Conditional Random Fields Trained with a Parsed Corpus

Auteurs: Diego Alves; Božo Bekavac; Marko Tadić
Publié dans: NooJ2020 conference, Numéro vol 1389, 2021, Page(s) 196-205
Éditeur: Springer
DOI: 10.1007/978-3-030-70629-6_17

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia

Auteurs: Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić
Publié dans: Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Numéro Vol 2836, 2021
Éditeur: CEUR

ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation

Auteurs: Endri Kacupaj, Barshana Banerjee, Kuldeep Singh, Jens Lehmann
Publié dans: ESWC, 2021, Page(s) 598-613
Éditeur: Springer
DOI: 10.1007/978-3-030-77385-4_36

Data Augmentation for Pipeline-Based Speech Translation

Auteurs: Diego Alves, Askars Salimbajevs, Mārcis Pinnis
Publié dans: Human Language Technologies – The Baltic Perspective - Proceedings of the Ninth International Conference Baltic HLT 2020, 2020, Page(s) 73-79, ISBN 9781643681160
Éditeur: IOS Press
DOI: 10.3233/faia200605

VQuAnDa: Verbalization QUestion ANswering DAtaset

Auteurs: Endri Kacupaj, Hamid Zafar, Jens Lehmann, Maria Maleshkova
Publié dans: The Semantic Web - 17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31–June 4, 2020, Proceedings, Numéro 12123, 2020, Page(s) 531-547, ISBN 978-3-030-49460-5
Éditeur: Springer International Publishing
DOI: 10.1007/978-3-030-49461-2_31

Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets

Auteurs: Gaurish Thakkar, Mārcis Pinnis
Publié dans: Human Language Technologies – The Baltic Perspective - Proceedings of the Ninth International Conference Baltic HLT 2020, 2020, Page(s) 55 - 61, ISBN 9781643681160
Éditeur: IOS Press
DOI: 10.3233/faia200602

EveOut: an Event-centric News Dataset to Analyze an Outlet’s Event Selection

Auteurs: - Swati, Dunja Mladenić and Tomaž Erjavec
Publié dans: Informatica - Slovenia, Numéro Vol. 45, No. 7, 2021, Page(s) 25-30, ISSN 0350-5596
Éditeur: Slovensko Drustvo Informatika
DOI: 10.31449/inf.v45i7.3410

Combining sentiment analysis classifiers to explore multilingual news articles covering London 2012 and Rio 2016 Olympics

Auteurs: Caio Mello, Gullal S. Cheema, Gaurish Thakkar
Publié dans: International Journal of Digital Humanities, 2022, ISSN 2524-7840
Éditeur: Springer
DOI: 10.1007/s42803-022-00052-9

LaSER: Language-specific event recommendation

Auteurs: Sara Abdollahi, Simon Gottschalk, Elena Demidova
Publié dans: Journal of Web Semantics, Numéro 15708268, 2022, ISSN 1570-8268
Éditeur: Elsevier BV
DOI: 10.1016/j.websem.2022.100759

Assessing the Quality of Sources in Wikidata Across Languages: A Hybrid Approach

Auteurs: Gabriel Amaral, Alessandro Piscopo, Lucie-Aimée Kaffee, Odinaldo Rodrigues, Elena Simperl
Publié dans: Journal of Data and Information Quality, Numéro 19361955, 2021, Page(s) 1–35, ISSN 1936-1955
Éditeur: Association for Computing Machinary, Inc.
DOI: 10.1145/3484828

Political and Economic Patterns in COVID-19 News: From Lockdown to Vaccination

Auteurs: Abdul Sittar, Dunja Mladenić, Marko Grobelnik
Publié dans: IEEE Access, Numéro 21693536, 2022, Page(s) 40036-40050, ISSN 2169-3536
Éditeur: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/access.2022.3164692

Analysis of information cascading and propagation barriers across distinctive news events

Auteurs: Abdul Sittar; Dunja Mladenic; Marko Grobelnik
Publié dans: Journal of Intelligent Information Systems, Numéro 58, 2021, Page(s) 119–152, ISSN 1573-7675
Éditeur: Springer
DOI: 10.48550/arxiv.2212.07742

Recherche de données OpenAIRE...

Une erreur s’est produite lors de la recherche de données OpenAIRE

Aucun résultat disponible