View on GitHub

compendium

The Distant Reading Compendium: A virtual edited volume

Annotation of the Serbian ELTeC Collection

Reference

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, and Mihailo Škorić. “Annotation of the Serbian ELTeC Collection”, Infotheca – Journal of Digital Humanities, 21.2 (2021), 43–59. DOI: 10.18485/infotheca.2021.21.2.3

Abstract

This paper presents the so-called level-2 edition of SrpELTeC collection developed within the activities of Working Group 2 – Methods and Tools of the COST Action CA 16204 (Distant Reading for European Literary History), and its schema specification. The level-2 edition is a follow-up of the level-1 edition, which is used as input for morphosyntactic and NER annotation of novels. The Serbian level-2 pipeline outlines steps required for production of level-2, including methods and tools used in the process. Some statistics drawn from the Serbian ELTeC level-2 sub-collection brings an interesting insight into collection content.

Keywords

ELTeC, Serbian, Novel, Corpus, Annotation, POS, NER, Named Entities

Direct Access

BibTex

@article{stankovic_annotation_2021,
     title = {Annotation of the {Serbian} {ELTeC} {Collection}},
     volume = {21},
     issn = {1450-9687, 2217-9461},
     url = {http://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2021.21.2.3_en},
     doi = {10.18485/infotheca.2021.21.2.3},
     number = {2},
     urldate = {2022-02-17},
     journal = {Infotheca},
     author = {Stanković, Ranka and Krstev, Cvetana and Šandrih Todorović, Branislava and Škorić, Mihailo},
     year = {2021},
     keywords = {type_publication},
     pages = {43--59},
 }