Keywords Extraction on the Topic "Образование/Education"
Bashmakova Anastasiia Yurievna
University of Tyumen
Submitted: 26.10.2021
Abstract. The paper aims to identify the keywords features of the thematic field "Образование/Education" in the Russian and English languages. The article describes the stages of the automated parsing of the news articles from the websites of the educational online portals "EDU-Inform" and "Education Today Magazine" for corpus creation. The study also pays significant attention to the linguistic analysis of the extracted keywords. The scientific originality of the research consists in the interdisciplinary consideration of the issue of keywords studying and usage of computer programming instruments for the automated natural language text processing. As a result, the research presents the visualization of the thematic field "Образование/Education" for the Russian and English languages in the form of a word cloud.
Key words and phrases: компьютерная лингвистика, извлечение ключевых слов, образование, облако слов, computational linguistics, keyword extraction, education, BERT, word cloud
Open the whole article in PDF format. Free PDF-files viewer can be downloaded here.
References:
Arnol'd I. V. Semanticheskaya struktura slova v sovremennom angliiskom yazyke i metodika ee issledovaniya. L.: Prosveshchenie, 1966.
Akhmanova O. S. Ocherki po obshchei i russkoi leksikologii. M.: Gosudarstvennoe uchebno-pedagogicheskoe Izdatel'stvo Ministerstva Prosveshcheniya RSFSR, 1957.
Globina L. V. Leksiko-semanticheskoe pole partitivnoi leksiki v sovremennom russkom yazyke: avtoref. diss. … k. filol. n. Voronezh, 1995.
Lysyakova M. V. Leksiko-semanticheskie paradigmy: lingvisticheskii status, kriterii razgranicheniya // Russian Journal of Linguistics. 2005. № 7.
Filin F. P. O leksiko-semanticheskikh gruppakh slov // Ezikovedskiі iz"sledovaniya v chest na akademik Stefan Mladenov. Sofiya: B'lg. akad. na naukite, 1967.
Anandarajan M., Hill C., Nolan T. Practical Text Analytics. Maximizing the Value of Text Data. Advances in Analytics and Data Science. Springer Nature Switzerland, Cham, 2019.
Grootendorst M. Keybert: Minimal keyword extraction with bert. 2020. URL: https://github.com/MaartenGr/KeyBERT
Kaser O., Lemire D. Tag-cloud drawing: Algorithms for cloud visualization // Proceedings of the World Wide Web Workshop on Tagging and Metadata for Social Information Organization. Coleman, 2007.