Research Trends in Record Management Using Unstructured Text Data Analysis

홍덕용; 허준석

doi:10.14404/JKSARM.2023.23.4.073

Abstract

This study aims to analyze the frequency of keywords used in Korean abstracts, which are unstructured text data in the domestic record management research field, using text mining techniques to identify domestic record management research trends through distance analysis between keywords. To this end, 1,157 keywords of 77,578 journals were visualized by extracting 1,157 articles from 7 journal types (28 types) searched by major category (complex study) and middle category (literature informatics) from the institutional statistics (registered site, candidate site) of the Korean Citation Index (KCI). Analysis of t-Distributed Stochastic Neighbor Embedding (t-SNE) and Scattertext using Word2vec was performed. As a result of the analysis, first, it was confirmed that keywords such as “record management” (889 times), “analysis” (888 times), “archive” (742 times), “record” (562 times), and “utilization” (449 times) were treated as significant topics by researchers. Second, Word2vec analysis generated vector representations between keywords, and similarity distances were investigated and visualized using t-SNE and Scattertext. In the visualization results, the research area for record management was divided into two groups, with keywords such as “archiving,” “national record management,” “standardization,” “official documents,” and “record management systems” occurring frequently in the first group (past). On the other hand, keywords such as “community,” “data,” “record information service,” “online,” and “digital archives” in the second group (current) were garnering substantial focus.

keywords: 기록관리연구동향, 빅데이터, 텍스트마이닝, t-분포확률적임베딩, 산점도, Research Trends in Record Management, Big Data, Text Mining, t-SNE, Scattertext

바로가기메뉴

Article Contents

Vol.23 No.4

Research Trends in Record Management Using Unstructured Text Data Analysis

Abstract

Journal of Korean Society of Archives and Records Management