# What a difference 5 years make:
Achievements and challenges of Distant Reading for European Literary History
**Christof Schöch and Maciej Eder** *** Distant Reading Closing Conference, April 21-22, 2022
https://distantreading.github.io/closing/ ***
:: -- ### Overview 1. [What was Distant Reading?](#/2) 2. [Scholarly Resources](#/3) 3. [Methods and Tools](#/4) 4. [Literary Theory and History](#/5) 4. [Dissemination and Communication](#/6) 4. [Training and Exchange](#/7) 5. [Conclusion](#/8) :: - What a difference 5 years make, 54 little months now... -- ## (0) What was Distant Reading? --- ### We've come a long way
* Fotis Jannidis launched the idea in 2013/2014 * We applied several times until being awarded the grant in 2017 * Five years later: 200+ researchers from 31+ countries working together --- ### Our Research coordination objectives 1. To coordinate the creation of a multilingual European Literary Text Collection (ELTeC). 1. To use the ELTeC to establish best practices and develop innovative methods of Distant Reading for the multiple European literary traditions. 1. To engage in an investigation into the theoretical, methodological and practical consequences of Distant Reading approaches for literary history and literary theory. --- ### Our capacity-building objectives 5. To foster the acquisition of state-of-the-art methods of Distant Reading, including competencies relating to data curation, standards, best practices and methods of Distant Reading analysis. 1. To encourage and support the submission of competitive grant proposals both at the national and European levels. 1. To help address the current gender imbalance. -- ## (1) Scholarly Resources --- ### Our key accomplishment:
European Literary Text Collection (ELTeC)
https://distantreading.github.io/ELTeC/ --- ### Key points about ELTeC * Current status * 10 complete corpora (1000 novels) * 8 corpora in progress (378 novels) * 9 extension corpora (567 novels) * Some other facts * All corpora: Encoded in XML-TEI * Many: with annotations (POS and NER) * Strong composition criteria for comparability * Next (final) release: May 2022 --- ### Significance of ELTeC * A unique, massively multilingual, collaboratively-built scholarly resource * A rallying point for a European, multilingual community * A benchmark corpus for the development and evaluation of cross-lingual methods * A key building block for a history of European literature that would be truly digital, multilingual and diverse -- ## (2) Methods and Tools --- ### Key concern: Annotating / analysing ELTeC * Levels of annotation * Part-of-speech annotation (standard) * Named Entity Recognition annotation (desireable) * Character vs. narrator speech * Sentiment Analysis * Challenges * Make structural TEI markup coexist with token-level annotations * Ensure cross-language compatibility of pipelines and annotations -- ## (3) Literary Theory and History --- ### Several contributions * Titling practices in ELTeC (published paper) * The term/concept of the novel across Europe (map) * Distant reading and style, authorship, genre, period (submitted paper) * Inner life of characters (paper in progress) * Analyses of prefaces using Sentiment Analysis (paper in progress) -- ## (4) Dissemination and Communication --- ### Many tangible outputs * Successful social media presence of the Action (1244 Twitter and 500 Facebook followers) * Informative Action portal, with "Distant Reading Recommends" series and videos * 32 publications from the Action (several in progress) * 15 presentations, posters, webinars about the Action (probably several more) * A substantial number of affiliated grants at national, bi-national and international levels -- ## (5) Training and Exchange --- ### Many occasions for learning and discussion
* Jointly organized Training Schools: Würzburg, Galway, Budapest, online, Belgrade * STSMs and VMGs: 34 exchanges lasting over 600 days in total * Major Action Meetings: Brussels, Prague, Antwerp, Lisbon, Málaga, online 2021, online 2022 -- ## Conclusion --- ### Achievements * Designing, encoding, annotating and using ELTeC is our key tangible accomplishment * Having helped an international, diverse, international, welcoming community of researchers constitute itself is just as important * The large number of satellite projects, presentations and publications also shows the success of the Action --- ### Special thanks! * WG leads and co-leads * WG1: Carolin Odebrecht, Lou Burnard, Borja Navarro Colorado * WG2: Mike Kestemont, Joanna Byszuk, Fotis Jannidis,
George Mikros, Yaakov Hacohen Kerner * WG3: Antonija Primorac, Rosario Arias, Pieter Francois * WG4: Justin Tonra, Katja Mihurko-Poniz * Other Management roles * Training School coordinator: Roxana Patras * STSM coordinator: Catherine Kanellopoulou * Science communication officer: Katja Mihurko-Poniz * Inclusivity ambassador: Karina van Dalen-Oskam --- ### Challenges * Diversity (of people, history, and data) as a strength, a challenge, and a learning opportunity * How can we support further work on and with ELTeC, after the Action? * How will we balance Distant Reading and Computational Literary Studies in the future? --- ### Thank you all
for making the Action what is is today!
https://distant-reading.net