Literary mystifications and the authorial use of numerals
Zenkov Andrei Viacheslavovich
Ural Federal University
Submitted: 21.09.2023
Abstract. This study pertains to stylometry. There are cases when a writer who has achieved fame, for various reasons, begins to create under a different name, attempts to write in a different manner and sometimes achieves success again in a new incarnation. The aim of the study is to test the feasibility of intentionally making significant changes to an author’s literary style. Numerals present in the texts by a particular author are used as a style marker. Examples from English, French and Russian literature demonstrate that the use of numerals is a literary ‘fingerprint’ that manifests in all or most of sufficiently long texts by that author. The obtained results show that, contrary to an author’s attempts to write in a ‘new’ way, the usage of numerals is conservative and allows for the recognition of fictitious authorship. This conclusion is drawn based on the analysis of works by R. Gary and B. Akunin (G. Chkhartishvili), who are known for their literary hoaxes. The analysis of numerals usage is also applied to the issue of authorship regarding Harper Lee’s novel ‘To Kill a Mockingbird’. Conclusions about the similarity/difference of literary styles are made based on hierarchical cluster analysis and are supported by the Pearson chi-squared test. The scientific originality of the paper lies in taking a new approach to the search for a literary ‘fingerprint’ and text attribution.
Key words and phrases: стилометрия, стилеметрия, квантитативная лингвистика, атрибуция текстов, числительные в тексте, stylometry, quantitative linguistics, text attribution, numerals in the text
Open the whole article in PDF format. Free PDF-files viewer can be downloaded here.
References:
Artinian A. Maupassant Criticism in France, 1880-1940, with an Inquiry into His Present Fame and a Bibliography. N. Y.: Kings Crown Press, 1941.
Benford F. The Law of Anomalous Numbers // Proceedings of the American Philosophical Society. 1938. Vol. 78. No. 4.
Boisen J. Un Picaro métaphysique: Romain Gary et l’art du roman. Odense: Odense University Press, 1996.
Brocardo M. L., Traore I., Woungang I., Obaidat M. S. Authorship Verification Using Deep Belief Network Systems // International Journal of Communication Systems. 2017. Vol. 30. Iss. 12. https://doi.org/10.1002/dac.3259
Burns B. D. Do People Fit to Benford’s Law, or Do They Have a Benford Bias? 2020. https://cognitivesciencesociety.org/cogsci20/papers/0379/index.html
Choiński M., Eder M., Rybicki J. Harper Lee and Other People: A Stylometric Diagnosis // Mississippi Quarterly. 2017/2018. Vol. 70/71. No. 3.
Dugan J. R. Illusion and Reality, A Study of Descriptive Techniques in the Works of Guy de Maupassant. Berlin – Boston: Mouton, 1973.
Hocus Bogus. Romain Gary Writing as Émile Ajar / transl. by D. Bellos. New Haven – L.: Yale University Press, 2010.
Hungerbühler N. Benfords Gesetz über führende Ziffern: wie die Mathematik Steuersündern das Fürchten lehrt. 2007. https://ethz.ch/content/dam/ethz/special-interest/dual/educeth-dam/documents/Unterrichtsmaterialien/mathematik/Benfords%20Gesetz%20über%20führende%20Ziffern%20(Artikel)/benford.pdf
Koppel M., Winter Y. Determining if Two Documents Are Written by the Same Author // Journal of the Association for Information Science and Technology. 2014. Vol. 65. No. 1.
Lloyd C. Guy de Maupassant. L.: Reaktion Books, 2020.
Moisl H. Cluster Analysis for Corpus Linguistics. Berlin – München – Boston: De Gruyter Mouton, 2015.
Poier-Bernhard A. Romain Gary – das brennende Ich: literaturtheoretische Implikationen eines Pseudonymenspiels. Tübingen: Niemeyer, 1996.
Shields C. J. Mockingbird: A Portrait of Harper Lee: From Scout to Go Set a Watchman. 2nd ed. N. Y.: Henry Holt and Co., 2016.
Stamatatos E. A Survey of Modern Authorship Attribution Methods // Journal of the American Society for Information Science and Technology. 2009. Vol. 60. No. 3.
Zenkov A. V. A Method of Text Attribution Based on the Statistics of Numerals // Journal of Quantitative Linguistics. 2018. Vol. 25. No. 3.
Zenkov A. V. Stylometry and Numerals Usage: Benford’s Law and Beyond // Stats. 2021. Vol. 4.
Zenkov A., Místecký M. Young Vladimír Vašek? – A Numerals Analysis Contribution to the Bezruč-Hrzánský Identity Issue // Naše řeč. 2022. Vol. 105. No. 3.
Zenkov A. V., Místecký M. The Romantic Clash: Influence of Karel Sabina over Macha’s Cikani from the Perspective of the Numerals Usage Statistics // Glottometrics. 2019. Vol. 46.