As well as the following summary statistics, this page provides links to human-readable versions of each text currently included in the European Literary Text Collection (ELTeC). Click on a language code in the table below to see a list of texts now available in that language. Then click on the identifier of a text to see a simple rendering of the text as produced by CETEIcean. The original source files are stored in a GitHub repository at COST-ELTeC, and may be downloaded freely from there.
The E5C column gives the conformance score calculated for each repository and is displayed in green if conformance is high. The other columns give counts for each of the four balance criteria, with numbers in red indicating that this criterion is unsatisfied. Hovering over the last figure in each column displays the E5C score calculated for that criterion.
This remains a work in progress! Comments and reports of any problems are much appreciated: send them to the WG1 Issue Tracker.
| AUTHORSHIP | LENGTH | TIME SLOT | REPRINT COUNT | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Language | Last update | Texts | Words | Male | Female | 1-title | 3-title | Short | Medium | Long | 1840-59 | 1860-79 | 1880-99 | 1900-20 | range | Frequent | Rare | E5C |
| cze | 2020-11-16 | 16 | 366626 | 14 | 2 | 12 | 0 | 16 | 0 | 0 | 5 | 6 | 5 | 0 | 6 | 0 | 15 | 33.85 |
| deu | 2020-12-30 | 98 | 12086096 | 65 | 33 | 36 | 8 | 20 | 37 | 41 | 24 | 24 | 25 | 25 | 1 | 46 | 46 | 93.85 |
| eng | 2021-02-14 | 100 | 12227703 | 49 | 51 | 70 | 10 | 27 | 27 | 46 | 21 | 22 | 31 | 26 | 10 | 32 | 68 | 100.00 |
| fra | 2021-02-17 | 100 | 8712219 | 66 | 34 | 58 | 10 | 32 | 38 | 30 | 25 | 25 | 25 | 25 | 0 | 44 | 56 | 101.54 |
| hun | 2020-11-15 | 100 | 6948590 | 79 | 21 | 71 | 9 | 47 | 31 | 22 | 22 | 21 | 27 | 30 | 9 | 32 | 67 | 100.00 |
| ita | 2019-11-21 | 34 | 3328244 | 32 | 2 | 19 | 3 | 13 | 10 | 11 | 5 | 12 | 10 | 7 | 7 | 12 | 0 | 55.97 |
| lav | 2020-12-20 | 2 | 106045 | 2 | 0 | 2 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 21.54 |
| lit | 2020-08-20 | 25 | 636132 | 18 | 7 | 16 | 1 | 19 | 3 | 2 | 5 | 3 | 3 | 14 | 11 | 6 | 18 | 55.38 |
| nor | 2020-11-17 | 50 | 3195845 | 36 | 14 | 20 | 8 | 25 | 17 | 8 | 5 | 2 | 28 | 15 | 26 | 30 | 20 | 71.54 |
| pol | 2020-12-17 | 100 | 8500172 | 58 | 42 | 1 | 33 | 33 | 35 | 32 | 8 | 11 | 35 | 46 | 38 | 39 | 61 | 80.00 |
| por | 2021-02-25 | 100 | 6799385 | 83 | 17 | 73 | 9 | 40 | 41 | 19 | 13 | 37 | 19 | 31 | 24 | 26 | 60 | 94.62 |
| rom | 2020-11-15 | 80 | 4905678 | 65 | 11 | 43 | 7 | 35 | 29 | 16 | 4 | 14 | 23 | 39 | 35 | 24 | 56 | 83.08 |
| slv | 2020-11-15 | 100 | 5682120 | 89 | 11 | 26 | 5 | 53 | 39 | 8 | 2 | 13 | 36 | 49 | 47 | 48 | 52 | 78.46 |
| spa | 2020-11-15 | 81 | 6874582 | 65 | 16 | 42 | 5 | 30 | 27 | 24 | 16 | 15 | 25 | 25 | 10 | 42 | 39 | 90.77 |
| srp | 2021-02-19 | 85 | 3700668 | 77 | 8 | 32 | 11 | 53 | 31 | 1 | 2 | 12 | 35 | 36 | 34 | 30 | 46 | 72.94 |
| swe | 2020-11-15 | 58 | 4960085 | 29 | 28 | 18 | 8 | 16 | 24 | 18 | 15 | 3 | 20 | 20 | 17 | 17 | 41 | 76.92 |
| ukr | 2021-02-24 | 47 | 1640847 | 34 | 13 | 22 | 7 | 33 | 12 | 2 | 5 | 10 | 10 | 22 | 17 | 29 | 18 | 66.15 |
Summary produced: 2021-02-26