Authorship Verification of the Disputed Pauline Letters through Deep Learning

Forfattere

  • Evy Beijen Department of Mathematics and Department of Texts and Traditions, Vrije Universiteit Amsterdam
  • Rianne de Heide University of Twente

DOI:

https://doi.org/10.7146/hn.v10i1.147482

Nøgleord:

Pauline epistles, authorship attribution, deep learning, text classification

Resumé

In the Christian tradition, fourteen letters of the New Testament have been attributed to the Apostle Paul. However, for seven of these letters—1 and 2 Timothy, Titus, Ephesians, Colossians, 2 Thessalonians, and Hebrews —the attribution to Paul has been the subject of scholarly debate. This study aims to develop a bidirectional long short-term memory (BiLSTM) network to classify chunks of these disputed letters, each approximately 100 words long, as either authored by Paul or not. Two model variants—a plaintext variant and a lemmatized text variant—were trained on undisputed Pauline letters and ‘impostor letters’ which serve as negative examples of Paul’s writing. The plaintext variant achieved 84% accuracy and the lemmatized text variant 83% accuracy. Both variants classify the majority of text chunks from Colossians and 2 Thessalonians as Pauline and the majority of chunks from Hebrews and 1 Timothy as non-Pauline, although caution is warranted in drawing strong conclusions from these results. For the remaining disputed Pauline letters—Titus, 2 Timothy, and Ephesians—the majority classification varies between the model variants, further emphasizing the need for caution. Nevertheless, this study introduces a deep learning approach to the authorship verification problem of the disputed Pauline letters, potentially serving as a model for future research.

Referencer

Abadi, Martín, et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Acces-sed May 8, 2024. https://www.tensorflow.org/.

Chollet, François, et al. 2015. Keras. Accessed May 8, 2024. https://keras.io.

Ehrman, Bart D. 2020. The New Testament: A Historical Introduction to the Early Christian Writings. Ox-ford: Oxford University Press.

El Houda Dehimi, Nour, and Zakaria Tolba. 2024. “Attention Mechanisms in Deep Learning: Towards Explainable Artificial Intelligence.” In 2024 6th International Conference on Pattern Analysis and In-telligent Systems (PAIS), 1–7. https://doi.org/10.1109/PAIS62114.2024.10541203.

Gasparetto, Andrea, Matteo Marcuzzo, Alessandro Zangari, and Andrea Albarelli. 2022. “A Survey on Text Classification Algorithms: From Text to Predictions.” Information 13 (2): 83. https://doi.org/10.3390/info13020083.

Graves, Alex, and Jürgen Schmidhuber. 2005. “Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures.” Neural Networks 18 (5): 602–610. https://doi.org/10.1016/j.neunet.2005.06.042.

Hochreiter, Sepp, and Jürgen Schmidhuber. 1997. “Long Short-Term Memory.” Neural Computation 9 (8): 1735–80. https://doi.org/10.1162/neco.1997.9.8.1735.

Johnson, Kyle P., Patrick J. Burns, John Stewart, Todd Cook, Clément Besnier, and William J. B. Matting-ly. 2021. “The Classical Language Toolkit: An NLP Framework for Pre-Modern Languages.” In Pro-ceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th In-ternational Joint Conference on Natural Language Processing: System Demonstrations, 20–29. Asso-ciation for Computational Linguistics. https://doi.org/10.18653/v1/2021.acl-demo.3.

Kingma, Diederik P., and Jimmy Ba. 2017. “Adam: A Method for Stochastic Optimization.” arXiv:1412.6980.

Koppel, Moshe, and Yaron Winter. 2014. “Determining If Two Documents Are Written by the Same Au-thor.” Journal of the Association for Information Science and Technology 65 (1): 178–87.

https://doi.org/10.1002/asi.22954.

Lake, Kirsopp, ed. 1913. The Apostolic Fathers. Christian Classics Ethereal Library. Accessed March 2, 2024. https://www.ccel.org/l/lake/fathers/toc.htm.

Martinez-Cantin, Ruben. 2014. “BayesOpt: A Bayesian Optimization Library for Nonlinear Optimization, Experimental Design, and Bandits.” Journal of Machine Learning Research 15 (1): 3735–39.

Mealand, David L. 1989. “Positional Stylometry Reassessed: Testing a Seven Epistle Theory of Pauline Authorship.” New Testament Studies 35 (2): 266–86. https://doi.org/10.1017/S0028688500024656.

Mealand, David L. 1995. “The Extent of the Pauline Corpus: A Multivariate Approach.” Journal for the Study of the New Testament 18: 61–92. https://doi.org/10.1177/0142064X9601805904.

Metzger, Bruce M., and Bart D. Ehrman. 2005. The Text of the New Testament: Its Transmission, Corrup-tion, and Restoration. Oxford: Oxford University Press.

Morton, A. Q. 1965. The Authorship of the Pauline Epistles: A Scientific Solution. Saskatoon: University of Saskatchewan.

Morton, A.Q., and A.D. Winspear. 1971. It’s Greek to the Computer. Montreal: Harvest House.

Perry, Jordan. 2021. “Examining the Authenticity of Plato’s Epistle VII through Deep Learning.” Bache-lor's thesis, Harvard University.

Roy, Ashley, and Paul Robertson. 2023. “Applying Cosine Similarity to Paul’s Letters: Mathematically Modeling Formal and Stylistic Similarities.” In New Approaches to Textual and Image Analysis in Early Jewish and Christian Studies, edited by Claire Clivaz and Ken M. Penner, 88–117. Leiden: Brill.

Savoy, Jacques. 2019. “Authorship of Pauline Epistles Revisited.” Journal of the Association for Infor-mation Science and Technology 70 (10): 1089–97. https://doi.org/10.1002/asi.24176.

Sari, Yunita. 2018. “Neural and Non-neural Approaches to Authorship Attribution.” PhD diss., University of Sheffield.

Society of Biblical Literature (SBL). 2023. SBL Greek New Testament. GitHub. Version 1.2. Accessed Feb-ruary 24, 2024. https://github.com/LogosBible/SBLGNT.

Snoek, Jasper, Hugo Larochelle, and Ryan P. Adams. 2012. “Practical Bayesian Optimization of Machine Learning Algorithms.” In Advances in Neural Information Processing Systems 25, edited by F. Perei-ra, C. J. Burges, L. Bottou, and K. Q. Weinberger. Curran Associates, Inc.

Srivastava, Nitish, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. “Dropout: A Simple Way to Prevent Neural Networks from Overfitting.” Journal of Machine Learn-ing Research 15 (56): 1929–58.

Van der Ventel, Brandon I.S., and Richard T. Newman. 2022. “Application of the Term Frequency-Inverse Document Frequency Weighting Scheme to the Pauline Corpus.” Andrews University Seminary Stud-ies 59 (2): 251–71.

Xu, Feiyu, Hans Uszkoreit, Yangzhou Du, Wei Fan, Dongyan Zhao, and Jun Zhu. 2019. “Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges.” In Natural Language Pro-cessing and Chinese Computing (NLPCC 2019), edited by Jie Tang, Min-Yen Kan, Dongyan Zhao, Sujian Li, and Hongying Zan, 563–74. Cham: Springer. https://doi.org/10.1007/978-3-030-32236-6_51.

Yu, Yong, Xiaosheng Si, Changhua Hu, and Jianxun Zhang. 2019. “A Review of Recurrent Neural Net-works: LSTM Cells and Network Architectures.” In Neural Computation, 31 (7): 1235–70. https://doi.org/10.1162/neco_a_01199.

Downloads

Publiceret

2025-06-06

Citation/Eksport

Beijen, E., & de Heide, R. (2025). Authorship Verification of the Disputed Pauline Letters through Deep Learning. HIPHIL Novum, 10(1), 22–39. https://doi.org/10.7146/hn.v10i1.147482

Nummer

Sektion

Articles