A Study on Recognition of Citation Metadata using Bidirectional GRU-CRF Model based on Pre-trained Language Model
Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2021, v.38 no.1, pp.221-242
https://doi.org/10.3743/KOSIM.2021.38.1.221
&
(2021). A Study on Recognition of Citation Metadata using Bidirectional GRU-CRF Model based on Pre-trained Language Model. Journal of the Korean Society for Information Management, 38(1), 221-242, https://doi.org/10.3743/KOSIM.2021.38.1.221
Abstract
This study applied reference metadata recognition using bidirectional GRU-CRF model based on pre-trained language model. The experimental group consists of 161,315 references extracted by 53,562 academic documents in PDF format collected from 40 journals published in 2018 based on rules. In order to construct an experiment set. This study was conducted to automatically extract the references from academic literature in PDF format. Through this study, the language model with the highest performance was identified, and additional experiments were conducted on the model to compare the recognition performance according to the size of the training set. Finally, the performance of each metadata was confirmed.
- keywords
-
참고문헌 메타데이터 인식,
텍스트 마이닝,
심층학습,
언어모델,
reference metadata recognition,
text mining,
deep learning,
language model
- Submission Date
- 2021-02-25
- Revised Date
- 2021-03-06
- Accepted Date
- 2021-03-17