바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

  • P-ISSN1013-0799
  • E-ISSN2586-2073
  • KCI

A Study on Recognition of Citation Metadata using Bidirectional GRU-CRF Model based on Pre-trained Language Model

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2021, v.38 no.1, pp.221-242
https://doi.org/10.3743/KOSIM.2021.38.1.221


Abstract

This study applied reference metadata recognition using bidirectional GRU-CRF model based on pre-trained language model. The experimental group consists of 161,315 references extracted by 53,562 academic documents in PDF format collected from 40 journals published in 2018 based on rules. In order to construct an experiment set. This study was conducted to automatically extract the references from academic literature in PDF format. Through this study, the language model with the highest performance was identified, and additional experiments were conducted on the model to compare the recognition performance according to the size of the training set. Finally, the performance of each metadata was confirmed.

keywords
참고문헌 메타데이터 인식, 텍스트 마이닝, 심층학습, 언어모델, reference metadata recognition, text mining, deep learning, language model
Submission Date
2021-02-25
Revised Date
2021-03-06
Accepted Date
2021-03-17

Journal of the Korean Society for Information Management