A Critical Examination of the Intertextual Phrase Matching Module in the Thesaurus Linguae Graecae and Its Relevance for Biblical and Patristic Studies
DOI:
https://doi.org/10.7146/hn.v5i2.142731Keywords:
Thesaurus Linguae Graecae, TLG, N-grams, Intertextual Phrase Matching, Patristic Quotations, Gospel According to John, Gospel According to Matthew, PlutarchAbstract
The rich availability of ancient Greek texts in the Thesaurus Linguae Graecae (TLG) has opened up new types of research for Biblical and Patristic scholars. A very helpful feature on the TLG’s website is the option to trace quotations with the help of an n-grams module (Intertextual Phrase Matching). However, it is virtually unknown how well this module performs and what scholars might expect from the results it produces. The core of this article, therefore, is devoted to a critical examination of the algorithm and of its results. The gospel according to John has been compared with the Paedagogus of Clement of Alexandria, with the gospel according to Matthew, and with the complete works of Plutarch. As it turns out, the algorithm performs well in cases of longer quotations with no or very few interpolations. Short quotations, however, are missed while interpolated or adapted quotations are poorly handled. It is suggested that the algorithm might perform better if the team of the TLG were to revisit its decision to ignore stopwords and if the algorithm were to allow for foreign words in its n-grams. Finally, it is advised that more transparency in the algorithm’s mechanisms and a possibility for manually tuning its parameters might improve its applicability.
Downloads
Published
How to Cite
Issue
Section
License
Counting from volume 9 (2024), articles published in HIPHIL Novum are licensed under Attribution-ShareAlike 4.0 International (CC BY-SA 4.0). The editorial board may accept other Creative Commons licenses for individual articles, if required by funding bodies e.g. the European Research Council. With the publication of volume 9, authors retain copyright to their articles and give Hiphil Novum the right to the first publication. The authors retain copyright to earlier versions of the articles, such as the submitted and the accepted manuscript. Authors and readers may use, reuse, and build upon the published work, use it for text or data mining or for any other lawful purpose, as long as appropriate attribution is maintained.
Articles in volumes 1-8 are not licensed under Creative Commons. In these volumes, all rights are reserved to the authors of the articles respectively. This implies that readers can download, read, and link to the articles, but they cannot republish the articles. Authors may post the published version of their article to their personal website, institutional repository, or a repository required by their funding agency as a part of a green open access policy.