Considerations for Applying Korean Natural Language Processing Technology in Records Management

김학래

doi:10.14404/JKSARM.2022.22.4.129

P-ISSN1598-1487
E-ISSN2671-7247

Home

OA Policy

ISSN : 1598-1487

Article Contents

Prev Next

e-Submission

Vol.22 No.4

Citation Share

Considerations for Applying Korean Natural Language Processing Technology in Records Management

Journal of Korean Society of Archives and Records Management / Journal of Korean Society of Archives and Records Management, (P)1598-1487; (E)2671-7247

2022, v.22 no.4, pp.129-149

https://doi.org/10.14404/JKSARM.2022.22.4.129

(2022). Considerations for Applying Korean Natural Language Processing Technology in Records Management. Journal of Korean Society of Archives and Records Management, 22(4), 129-149, https://doi.org/10.14404/JKSARM.2022.22.4.129

copy

Abstract

Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records’ creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning–based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.

keywords: 기록관리, 자연어 처리, 인공지능, 머신러닝, 딥러닝, Records management, Natural language processing, Artificial intelligence, Machine learning, Deep learning

바로가기메뉴

Article Contents

Vol.22 No.4

Considerations for Applying Korean Natural Language Processing Technology in Records Management

Abstract

Journal of Korean Society of Archives and Records Management