Method for Enhancing the Quality of Large-Scale Lexical Resources  Using Semantic Inference Rules:  Focusing on the National Library of Korea Subject Headings

Jeong (Do-Heon; 정도헌

doi:10.16981/kliss.55.4.202412.245

ACOMS+ 및 학술지 리포지터리 설명회

한국과학기술정보연구원(KISTI) 서울분원 대회의실(별관 3층)
2024년 07월 03일(수) 13:30

사전등록 바로가기

오늘 하루 그만보기

P-ISSN2466-2542
KCI

홈으로

OA 정책

ISSN : 2466-2542

논문 상세

이전 다음

논문 투고

Vol.55 No.4

Citation Share

시맨틱 추론 규칙을 이용한 대규모 언어 자원의 품질 고도화 방안: 국립중앙도서관 주제명표목표를 중심으로

Method for Enhancing the Quality of Large-Scale Lexical Resources Using Semantic Inference Rules: Focusing on the National Library of Korea Subject Headings

한국도서관·정보학회지 / Journal of Korean Library and Information Science Society, (P)2466-2542;

2024, v.55 no.4, pp.245-263

https://doi.org/10.16981/kliss.55.4.202412.245

정도헌((Do-Heon Jeong) (덕성여자대학교)

정도헌. (2024). 시맨틱 추론 규칙을 이용한 대규모 언어 자원의 품질 고도화 방안: 국립중앙도서관 주제명표목표를 중심으로. 한국도서관·정보학회지, 55(4), 245-263, https://doi.org/10.16981/kliss.55.4.202412.245

복사

초록

본 연구는 대규모 언어 자원에 관한 연구 동향을 바탕으로, 자동화된 시맨틱 추론 기법을 이용한 거대 언어 자원의 효율적 품질 제고 방안을 제시하고 응용 가능성을 제안하고자 한다. 이를 위해, 우선 언어 자원 내 용어 간의 다양한 관계를 분석하여 도출한 정오 사례를 바탕으로 공통의 시맨틱 추론 규칙을 정의하였다. 정의된 추론 규칙을 바탕으로 거대한 언어 자원의 네트워크를 고속 탐색하고 오류를 검출하는 스프레딩 알고리즘 기반의 시맨틱 추론 엔진을 개발하였다. 국립중앙도서관의 주제명표목표에 대한 실험을 통해, 본 연구에서 제안한 자동화 기법과 웹 기반 관리 시스템을 활용하여 대용량 데이터의 품질 고도화 작업을 효율적으로 수행할 수 있음을 확인하였다. 본 연구는 대규모 언어 자원의 품질 고도화를 위한 시맨틱 추론 기법을 새롭게 제안한 점, 복잡한 용어 네트워크에서 발생하는 논리적 오류 사례를 분석하고 체계화하는 최초의 시도였다는 점에서 의의가 있으며, 일련의 과정을 통해 인공 지능 시대의 인간과 기계의 협업 방식을 논의하였다는 데 의의가 있다.

keywords: 언어 자원, 대용량 데이터, 시맨틱 추론, 인공 지능, 주제명표목표, 인간 기계 협업

Abstract

The purpose of this study is to propose an efficient quality enhancement method for large-scale lexical resources using automated semantic inference techniques, based on current research trends in large lexical resources, and to suggest practical applications. To achieve this, common semantic inference rules were first defined by analyzing various relational cases among terms within lexical resources and identifying correct and erroneous patterns. Using these defined inference rules, a semantic inference engine based on a spreading algorithm was developed, enabling rapid network traversal and error detection across very large lexical resources. Through experiments on the Subject Headings of the National Library of Korea, it was confirmed that the automated methods and web-based management system proposed in this study enable effective quality enhancement of large-scale data. The study is significant in that it proposes a novel semantic inference approach for enhancing the quality of large-scale lexical resources, as well as the first attempt to analyze and organize logical error cases arising within complex term networks. Furthermore, it is meaningful in discussing methods of human-machine collaboration in the era of artificial intelligence.

keywords: Lexical Resources, Large-Scale Data, Semantic Inference, Artificial Intelligence, Subject Headings, Human Machine Collaboration

투고일Submission Date: 2024-11-23

수정일Revised Date

게재확정일Accepted Date: 2024-12-08

바로가기메뉴

논문 상세

Vol.55 No.4

시맨틱 추론 규칙을 이용한 대규모 언어 자원의 품질 고도화 방안: 국립중앙도서관 주제명표목표를 중심으로

Method for Enhancing the Quality of Large-Scale Lexical Resources Using Semantic Inference Rules: Focusing on the National Library of Korea Subject Headings

초록

Abstract

한국도서관·정보학회지