E-ISSN : 3058-311X
This paper acquires and explores the data schema of the RAWDATA of Korean Modern and Contemporary Magazine Materials from the National Institute of Korean History (NIKH). To acquire the RAWDATA, a request for public data provision was submitted through the Public Data Portal, and a request for data provision was submitted through Docu24. As of March 27, 2024, the RAWDATA of Korean Modern and Contemporary Magazine Materials was acquired. The RAWDATA of Korean Modern and Contemporary Magazine Materials basically follows the NIKH standard XML schema (history.dtd). <Level1> deals with magazine information, <Level2> deals with volume information, and <Level3> deals with individual article information. The body of each article is divided into <paragraph> units. Contextual elements include index (object name), emph(emphasis), pTitle(title), name (author name), illustration(figure), and tableGroup(table), but they are currently only available for data that has body text, as not all magazines currently provide body text information. The RAWDATA of Korean Modern and Contemporary Magazine Materials can be used for analysis of modern literary language and modern literary social networks. It is also expected to be used in various fields such as the foundation data for morpheme analysis tools for modern literature and translation into modern Korean. We hope that the RAWDATA of Korean Modern and Contemporary Magazine Materials will become even richer through the collective intelligence of literary scholars.