Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Computational Literary Studies Infrastructure

CORDIS provides links to public deliverables and publications of HORIZON projects.

Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .

Deliverables

Inventory of existing data sources and formats supported/required (opens in new window)

Inventory of existing data sources and formats supported/required by tools and services & Transformation matrix capturing available and needed transformation paths.

Report on programmable corpora (opens in new window)

Report on and prototype of programmable corpora as part of a distributed ecosystem for CLS research.

Case studies in data preparation and sharing (opens in new window)

This deliverable will document a number of use cases within the community and in coordination with relevant WP to create case studies for digitisation and transformation processes.

Series of five short survey papers on key methodological concerns (opens in new window)

Series of five short survey papers on key methodological concerns.

Extended transformation matrix including alternative formats (opens in new window)

This deliverable will explore the formats used or usable beyond the default format TEI and put them in relation to each other.

Report on versioning requirements of APIs and corpora within CLS (opens in new window)

Requirements analysis for versioning of programmable corpora (APIs & data) is performed and a concept is implemented in the form of a technical prototype.

Report and prototypes for annotation as enrichment (opens in new window)

Report and prototypes to guide the implementation of text annotation as enrichment.

Initial plan for communication and dissemination, including Identity Guidelines (opens in new window)

Initial plan for communication and dissemination of project results and RI access including audienceinstrument mapping and communications KPIs

Review reports documenting the state of literary data (opens in new window)

This review reveals which data is available across the genres and disciplines including a wide variety of formats, languages and designs in Europe.

Report of the tools for the basic NLP tasks in the CLS context (opens in new window)

Report of the tools for the basic NLP tasks in the CLS context, in light of toolchain extensibility into a greater linguistic variety.

Report on the methodological baseline (opens in new window)

Report on the methodological baseline for (computational) literary studies.

Four showcases accompanied by explanatory papers (opens in new window)

Four showcases will be conducted and documented: (a) Using ELTeC novels for stylometric analysis in multiple languages (UT, PAS); (b) structural analysis of character networks in European drama (UP); (c) poetry scansion, i.e. the extraction of stress patterns of lines or verses (UNED); (d) spatio-temporal mapping in correspondences and/or diaries using Linked Open Data (OEAW).

API libraries for R and Python (opens in new window)

This deliverable will provide libraries for R and Python, which will wrap all API functionalities and bring them directly into two of the most widespread development environments in CLS.

Report on the skills matrix and gap analysis (opens in new window)

Report on the skills matrix for computational literary studies, covering several perspectives on competency development for the field.

Toolkit report for data sharing between researchers and institutions (opens in new window)

This deliverable will produce a report that features: a) an overview of the factors that help or hinder sharing; b) case studies highlighting good practice examples of successful negotiations of the challenges inherent in these relationships; c) reusable template policy instruments.

Project DMP and ORDP (opens in new window)

This document will outline all projectlevel policies regarding the responsible management and curation of data during and after the project as per the DCC DMPonline format

Publications

CLS INFRA D8.1 Report of the tools for the basic Natural Language Processing (NLP) tasks in the CLS context (opens in new window)

Author(s): Cinková, Silvie; Birkholz, Julie M.; Börner, Ingo; Dejaeghere, Tess; Heiden, Serge; Janssen, Maarten; Křen, Michal; Pozo, Alvaro Perez
Published in: 2023
Publisher: CLS INFRA
DOI: 10.5281/zenodo.7951060

CLS INFRA D2.1 Communication and Dissemination Plan (opens in new window)

Author(s): Ciara Murphy, Justin Tonra
Published in: 2021
Publisher: CLS INFRA
DOI: 10.5281/zenodo.5512567

Streamlining poetry research with Averell

Author(s): Álvaro Pérez Pozo, Javier de la Rosa, Aitor Díaz, Salvador Ros, Elena González-Blanco
Published in: 2023
Publisher: CLS INFRA

CLS INFRA D6.1 Inventory of existing data sources and formats (opens in new window)

Author(s): Ďurčo, Matej; Charvat, Vera Maria; Börner, Ingo; Mrugalski, Michał; Odebrecht, Carolin
Published in: 2022
Publisher: CLS INFRA
DOI: 10.5281/zenodo.7520287

Beyond Academia: Uses of CLS Tools and Methods for Journalists, Policy Makers, GLAM and Medical Humanities (opens in new window)

Author(s): Hoover, Sarah Yakupova, Vera Edmond, Jennifer
Published in: 2025
Publisher: CLS INFRA: Computational Literary Studies Infrastructure
DOI: 10.5281/zenodo.15212772

CLS INFRA D5.1. Review of the Data Landscape (opens in new window)

Author(s): Mrugalski, Michał; Odebrecht, Carolin; Charvat, Vera; Börner, Ingo; Durco, Matej
Published in: 2022
Publisher: CLS INFRA
DOI: 10.5281/zenodo.6856059

CLS INFRA D4.1 Skills Gap Analysis (opens in new window)

Author(s): Lisanne M. van Rossum; Artjoms Šeļa
Published in: 2022
Publisher: CLS INFRA
DOI: 10.5281/zenodo.6421513

CLS INFRA D7.2 API Libraries for R and Python for the Programmable Corpora Prototype “DraCor”:“rdracor” and “pydracor”

Author(s): Sluyter-Gäthje, Henny; Trilcke, Peer; Börner, Ingo
Published in: 2023
Publisher: CLS INFRA

CLS INFRA D3.4 Position Papers and Pilot Studies on Emerging Trends in CLS (opens in new window)

Author(s): Havrylash, Julia Fileva, Evgeniia Bruchertseifer, Ruth Schöch, Christof
Published in: 2025
Publisher: CLS INFRA: Computational Literary Studies Infrastructure
DOI: 10.5281/zenodo.15131773

CLS INFRA D8.2 Report and Prototypes for Annotation as Enrichment

Author(s): Birkholz, Julie; Cinková, Silvie; Decorde, Serge; Janssen, Maarten; Křen, Michal; Pozo, Alvaro Perez; Diego Fresno Fernandez, Victor; Ros, Salvador
Published in: 2024
Publisher: CLS INFRA

CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (opens in new window)

Author(s): Schöch, Christof; Dudar, Julia; Fileva, Evgeniia
Published in: 2023
Publisher: CLS INFRA
DOI: 10.5281/zenodo.7782364

Questionnaire of the survey: How do you Compose your Literary Corpus or Literary Collection?

Author(s): Calvo Tello, José; Rißler-Pipka, Nanette; Barth, Florian; Jung, Kerstin; Schöch, Christof
Published in: 2023
Publisher: CLS INFRA

Poesie als Fehler. Ein 'Tool Misuse'-Experiment zur Prozessierung von Lyrik (opens in new window)

Author(s): Sluyter-Gäthje, Henny; Trilcke, Peer
Published in: DHd 2022: Kulturen des digitalen Gedächtnisses, 2022, Page(s) 190-193
Publisher: CLS INFRA
DOI: 10.5281/zenodo.6328200

CLS INFRA D3.5 User Needs Beyond Academic Research for Computational Literary Analysis

Author(s): Edmond, Jennifer Yakupova, Vera Schöch, Christof (contributor) Eder, Maciej (contributor)
Published in: 2024
Publisher: Zenodo

CLS INFRA D5.2 Case Studies in Data Preparation and Sharing (opens in new window)

Author(s): Mrugalski, Michał; Blietz, Annika; Börner, Ingo; Bauer, Elisabeth; Charvat, Vera; Ďurčo, Matej; Laszakovits, Sabine; Resch, Stefan
Published in: 2023
Publisher: CLS INFRA
DOI: 10.5281/zenodo.8115923

CLS INFRA D3.1 Baseline Methodological User Needs Analysis (opens in new window)

Author(s): Christof Schöch, Evgeniia Fileva, Julia Dudar
Published in: 2022
Publisher: CLS INFRA
DOI: 10.5281/zenodo.6389333

LyricSIM: Un nuevo dataset y benchmark para la detección de similitud en letras de canciones en español (opens in new window)

Author(s): Benito-Santos, Alejandro; Ghajari, Adrián; Hernández,Pedro; Fresno, Victor; Ros, Salvador; González-Blanco, Elena
Published in: 2023
Publisher: Cornell University
DOI: 10.48550/arxiv.2306.01325

CLS INFRA D6.3 Standards beyond TEI / Extended Transformation Matrix / Alternative Formats (opens in new window)

Author(s): Ďurčo, Matej; Charvát, Vera Maria; Resch, Stefan; Börner, Ingo; Plank, Lukas
Published in: 2023
Publisher: CLS INFRA
DOI: 10.5281/zenodo.10209607

CLS INFRA D3.3 Showcases for the application of CLS methods and tools (opens in new window)

Author(s): Christof Schöch et al.
Published in: 2024
Publisher: CLS INFRA
DOI: 10.5281/zenodo.10912516

CLS INFRA D7.3 On Versioning Living and Programmable Corpora (Executable) Report and Prototypes for Reproducible Research

Author(s): Trilcke, Peer; Börner, Ingo
Published in: 2023
Publisher: CLS INFRA

CLS INFRA D7.1 On Programmable Corpora (opens in new window)

Author(s): Börner, Ingo; Trilcke, Peer
Published in: 2023
Publisher: CLS INFRA
DOI: 10.5281/zenodo.7664964

CLS INFRA D1.1 Data Management Plan and Open Research Data Pilot (V1, M6) (opens in new window)

Author(s): Tóth-Czifra, Erzsébet; Edmond, Jennifer; Eder, Maciej; Odebrecht, Caroline; Cinkova, Silvie; Börner, Ingo; Birkholz, Julie; Chambers, Sally; Schöch, Christoph; Durco, Matej; Fischer, Frank; Tonra, Justin;
Published in: 2021
Publisher: CLS INFRA
DOI: 10.5281/zenodo.5566785

Plataforma de exploración de la Composición Semántica a partir de Modelos de Lenguaje pre-entrenados y embeddings estáticos

Author(s): Adrian Ghajari, Victor Fresno, Enrique Amigó
Published in: Annual Conference of the Spanish Association for Natural Language Processing, Issue 5, 2022, Page(s) 52-56
Publisher: ceur-ws.org

Computational Literary Studies Data Landscape Review

Author(s): Ingo Börner, Vera Maria Charvat, Matej Ďurčo, Michał Mrugalski, Carolin Odebrecht
Published in: DHd 2022: Kulturen des digitalen Gedächtnisses, 2022, Page(s) 272-273
Publisher: CLS INFRA

TEITOK API - Programmable DH Corpora (opens in new window)

Author(s): Janssen, Maarten
Published in: Digital Humanities 2023: Book of Abstracts, 2023
Publisher: ADHO
DOI: 10.5281/zenodo.8107984

Dockerizing DraCor – A Container-based Approach to Reproducibility in Computational Literary Studies (opens in new window)

Author(s): Boerner, Ingo; Trilcke, Peer; Milling, Carsten; Fischer, Frank; Sluyter-Gäthje, Henny
Published in: Digital Humanities 2023: Book of Abstracts, Issue 3, 2023, Page(s) 293-295
Publisher: CLS INFRA
DOI: 10.5281/zenodo.8107836

a H2020 Research Infrastructure Project that aids to connect researchers, data, and methods (opens in new window)

Author(s): Birkholz, Julie M.; Börner, Ingo; Chambers, Sally; Cinková, Silvie; van Dalen-Oskam, Karina; Dejaeghere, Tess; Dudar, Julia; Eder, Maciej; Edmond, Jennifer; Garnett, Vicky; Kren, Michal; Mrugalski, Michal; Murphy, Ciara L.; Odebrecht, Carolin; Papaki, Eliza; Raciti, Marco; van Rossum, Lisanne; Schöch, Christof; Šela, Artjoms; Sharma, Srishti; Tonra, Justin; Tóth-Czifra, Erzsébet; Trilcke, Pe
Published in: DH Benelux 2022, 2022
Publisher: ADHO
DOI: 10.5281/zenodo.6573891

Distributed Corpus Building in Literary Studies: The DraCor Example (opens in new window)

Author(s): Giovannini, Luca; Skorinkin, Daniil; Trilcke, Peer; Börner, Ingo; Fischer, Frank; Dudar, Julia; Milling, Carsten; Pořízka, Petr
Published in: Digital Humanities 2023: Book of Abstracts, 2023, Page(s) 513-515
Publisher: ADHO
DOI: 10.5281/zenodo.8107457

Finding Haiku – Enhancing Findability and Accessibility of Poetry Resources in Multi-genre Collections across Different Languages (opens in new window)

Author(s): Mrugalski, Michał; Charvat, Vera Maria; Börner, Ingo; Durco, Matej; Laszkovits, Sabine; Resch, Stefan
Published in: Digital Humanities 2023: Book of Abstracts, 2023, Page(s) 125-127
Publisher: CLS INFRA
DOI: 10.5281/zenodo.8107747

Computational Literary Studies Infrastructure (CLS INFRA): Initial Findings and Conclusions for the Field (opens in new window)

Author(s): Birkholz, Julie M.; Börner, Ingo; Byszuk, Joanna; Chambers, Sally; Charvat, Vera Maria; Cinková, Silvie; Dejaeghere, Tess; Dudar, Julia; Ďurčo, Matej; Eder, Maciej; Edmond, Jennifer; Fileva, Evgeniia; Fischer, Frank; Garnett, Vicky; Heiden, Serge; Křen, Michal; Kunda, Bartłomiej; Laszakovits, Sabine; Mrugalski, Michał; Papaki, Eliza; Raciti, Marco; Resch, Stefan; Ros, Salvador; Schöch, Chr
Published in: Digital Humanities 2023: Book, 2023, Page(s) 497-498
Publisher: Graz Universitat
DOI: 10.5281/zenodo.8107903

Towards a sustainable community effort: training NLP data for under-resourced language domains in CLS

Author(s): Cinková, Silvie; Janssen, Maarten
Published in: 2023
Publisher: CLS INFRA

CLARIN in Training and Education (opens in new window)

Author(s): Koenraad De Smedt, Iulianna Van der Lek, Henk Van den Heuvel, Antonio Balvet, Maarten Janssen, Silvie Cinková, Amelia Sanz, Stavros Assimakopoulos, Louis Ten Bosch
Published in: Linköping Electronic Conference Proceedings, Selected papers from the CLARIN Annual Conference 2023, Issue 2024, 2024, ISBN 978-91-8075-740-9
Publisher: Linköping University Electronic Press
DOI: 10.3384/ecp210011

How Corpus Analysis Helps Operationalize Research Questions and Entices Literary Scholars to Learn Programming (opens in new window)

Author(s): Cinková, Silvie; Cvrček, Václav; Janssen, Maarten; Křen, Michal
Published in: Digital Humanities 2023: Book of Abstracts, 2023, Page(s) 323-325
Publisher: CLS INFRA
DOI: 10.5281/zenodo.8107804

Einführung in DraCor - Programmable Corpora für die digitale Dramenanalyse (opens in new window)

Author(s): B��rner, Ingo; Fischer, Frank; Milling, Carsten; Sluyter-G��thje, Henny
Published in: DHd 2022 Kulturen des digitalen Gedächtnisses, 2022
Publisher: Universität Potsdam
DOI: 10.5281/zenodo.6327951

Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama

Author(s): Fischer, Frank; Börner, Ingo; Göbel, Mathias; Hechtl, Angelika; Kittel, Christopher; Milling, Carsten; Trilcke, Peer
Published in: Digital Humanities 2019: Book of Abstracts, 2019
Publisher: ADHO

Computational Literary Studies Infrastructure (CLS INFRA): a project to connect people, data, tools, and methods

Author(s): Julie Birkholz, Ingo Börner, Sally Chambers, Vera Charvat, Silvie Cinková, Tess Dejaeghere, Julia Dudar, Matej Ďurčo, Maciej Eder, Jennifer Edmond, Evgeniia Evgeniia, Frank Fischer, Serge Heiden, Michal Křen, Bartłomiej Kunda, Michał Mrugalski, Ciara Murphy, Carolin Odebrecht, Marco Raciti, Salvador Ros, Christof Schöch, Artjoms Šeļa, Toma Tasovac, Justin Tonra, Erzsébet Tóth-Czifra, P
Published in: Digital Humanities 2022: Conference Abstracts, Issue 2, 2022, Page(s) 624-627
Publisher: CLS INFRA

Onboard onto DraCor. Prototyping Workflows to Homogenize Drama Corpora for an Open Infrastructure (opens in new window)

Author(s): Börner, Ingo; Fischer, Frank; Giovannini, Luca; Lu, Christopher; Milling, Carsten; Skorinkin, Daniil; Sluyter-Gäthje, Henny; Trilcke, Peer
Published in: DHd 2023 Open Humanities Open Culture, 2023
Publisher: ADHO
DOI: 10.5281/zenodo.7715332

What's the use? Exploring academic applications of (computational) literary studies

Author(s): Edmond, Jennifer; Yakupova, Vera
Published in: Digital Humanities 2023: Book of Abstracts, 2023, Page(s) 232-233
Publisher: ADHO

Characterizing the visualization design space of distant and close reading of poetic rhythm (opens in new window)

Author(s): Alejandro Benito-Santos; Alejandro Benito-Santos; Salvador Muñoz; Roberto Therón Sánchez; Francisco J. García Peñalvo
Published in: Frontiers in Big Data, Issue 6(1), 2023, Page(s) 1167708, ISSN 2624-909X
Publisher: The Frontiers
DOI: 10.3389/fdata.2023.1167708

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available

My booklet 0 0