바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

A Case Study on Metadata Extraction for Records Management Using ChatGPT

Journal of Korean Society of Archives and Records Management / Journal of Korean Society of Archives and Records Management, (P)1598-1487; (E)2671-7247
2024, v.24 no.2, pp.89-112
https://doi.org/10.14404/JKSARM.2024.24.2.089
Minji Kim
Sunghee Kang
Hae-young Rieh
  • Downloaded
  • Viewed

Abstract

Metadata is a crucial component of record management, playing a vital role in properly managing and understanding the record. In cases where automatic metadata assignment is not feasible, manual input by records professionals becomes necessary. This study aims to alleviate the challenges associated with manual entry by proposing a method that harnesses ChatGPT technology for extracting records management metadata elements. To employ ChatGPT technology, a Python program utilizing the LangChain library was developed. This program was designed to analyze PDF documents and extract metadata from records through questions, both with a locally installed instance of ChatGPT and the ChatGPT online service. Multiple PDF documents were subjected to this process to test the effectiveness of metadata extraction. The results revealed that while using LangChain with ChatGPT-3.5 turbo provided a secure environment, it exhibited some limitations in accurately retrieving metadata elements. Conversely, the ChatGPT-4 online service yielded relatively accurate results despite being unable to handle sensitive documents for security reasons. This exploration underscores the potential of utilizing ChatGPT technology to extract metadata in records management. With advancements in ChatGPT-related technologies, safer and more accurate results are expected to be achieved. Leveraging these advantages can significantly enhance the efficiency and productivity of tasks associated with managing records and metadata in archives.

keywords
메타데이터, 기록관리 메타데이터, 챗GPT, 랭체인, 기록정보서비스, Metadata, Records Management Metadata, ChatGPT, Langchain, Archival Information Service

Journal of Korean Society of Archives and Records Management