The purpose of this study is to investigate the characteristics of indexes by human and machine, and differences between them in terms of term identification in a fulltext environment. A back-of-book index and two indexes produced by two term identifiers (LinkIt and Termer) as pseudo-indexing systems for a whole body of a monograph are examined. In the investigation, the traditional contrast between manual and automatic indexing is confirmed in fulltext environment; manual index is for browsing and human use, and automatic index is for searching and machine use. The border between them, however, becomes vague. Some considerations for the use of the term identifiers for browsing and for searching are discussed, and further research for the use of the term identifier is suggested.

나민경(연세대학교 대학도서관발전연구소) ; 이지연(연세대학교 문헌정보학과) 2024, Vol.41, No.2, pp.19-46 https://doi.org/10.3743/KOSIM.2024.41.2.019

본 연구는 최근 이용률이 높은 영상정보의 특성을 탐색하고 이를 기반으로 영상정보 활용을 위한 기초능력을 파악하고자 문헌 연구와 탐색 연구를 진행하였다. 문헌 연구를 통해 다른 유형의 정보와 달리 영상이 가지는 특성을 다양한 측면에서 파악하였다. 다음으로 10대부터 50대에 해당하는 16명을 인터뷰 참여자로 선정하고 영상 이용 경험에 관한 반구조화된 1:1 심층 인터뷰를 진행하였다. 인터뷰 내용을 범주별로 조직화하여 코드북을 제작하고 내용 분석을 진행하였으며 이를 토대로 영상정보의 특성을 확인하였다. 최종적으로 문헌 연구와 인터뷰 내용 분석을 통해 영상정보의 특성을 확인하였으며, 이를 영상정보의 속성과 영상정보 이용의 특성으로 구분하였다. 본 연구에서 확인한 영상정보 특성을 기반으로 영상정보 활용을 위한 기초능력을 제안하였다.


In this study, we conducted a literature review and exploratory research to identify the characteristics of recently popular video information and to propose the basic capabilities required for video information literacy. Through a literature review, the distinct characteristics of video information were examined from various perspectives, differentiating it from other types of information. Subsequently, we had one-on-one, in-depth, semi-structured interviews with 16 participants in their teens to 50s to collect their video usage experiences. The interview contents were categorized to create a codebook, and content analysis was performed. Based on this analysis, we derived the characteristics of video information. Finally, the characteristics of video information were identified through the literature review and interview analysis outcomes, and these characteristics were classified into properties of video and characteristics related to video information usage. Based on the identified characteristics of video information, this study proposed the basic capabilities required for video information literacy.


공공데이터의 개방과 제공의 활성화와 함께, 공공도서관이 업무 중에 생산한 서지 데이터와 대출 이력과 같은 데이터가 도서관 공공데이터로 제공되고 있다. 본 논문은 도서관 공공데이터의 품질을 진단하고, 그 결과를 바탕으로 도서관 공공데이터의 품질을 높일 개선방안을 제안하고자 한다. 먼저, 문헌정보학 영역에서 공공데이터에 관해 이루어진 연구를 개괄한다. 그다음으로, 도서관 공공데이터 개방 플랫폼인 도서관 정보나루의 오픈 API를 통해 확보한 도서관 공공데이터의 완전성과 정확성을 진단한다. 마지막으로, 데이터 품질 진단 결과에 바탕을 개선방안을 도출한다. 완전성을 진단한 결과, 도서의 식별과 검색을 위 필수적인 서지 요소에서 다수의 공백이 확인되었다. 정확성을 진단한 결과, 값의 유형, 값의 범위, 제한조건을 따르지 않는 부정확한 서지 요소가 확인되었다. 본 연구는 데이터 품질 진단 분석 결과를 바탕으로, 도서관 정보나루의 데이터 수집 절차 개선, 데이터별 스키마 구축, 데이터 수집과 데이터 처리에 관한 안내 제공, 원자료 공개를 제언하였다.


With the popularization of open government data, Library-related open government data is also open and utilized to the public. The purpose of this paper is to diagnose the quality of library-related open government data and propose improvement measures to enhance the quality based on the diagnosis result. As a result of diagnosing the completeness of the data, a number of blanks are identified in the bibliographic elements essential for identifying and searching a book. As a result of diagnosing the accuracy of the data, the bibliographic elements that are not compliant with the data schema have been identified. Based on the result of data quality diagnosis, this study suggested improving the data collection procedure, establishing data set schema, providing details on data collection and data processing, and publishing raw data.



Since information scientists have begun trying to quantify significant research trends in scientific publications, ‘-metrics’ research such as ‘bibliometrics’, ‘scientometrics’, ‘informetrics’, ‘webometrics’, and ‘citation analysis’ have been identified as crucial areas of information science. To illustrate the dynamic research activities in these areas, this study investigated the major contributors of ‘-metrics’ research for the last decade at three levels: nations, institutions, and documents. ‘-metrics’ literature of this study was obtained from the Science Citation Index for the years 2001-2011. In this analysis, we used Pathfinder network, PNNC algorithm, PageRank and several indicators based on h-index. In terms of international collaborations, USA and England were identified as major countries. At the institutional level, Katholieke University, Leuven and the University of Amsterdam in Europe and Indiana University and the Office of Naval Research in the USA have led co-research projects in informetrics areas. At the document level, Hirsch’s h-index paper and Ingwersen’s web impact factor paper were identified as the most influential work by two methods: PageRank and single paper h-index.



The success of social networking sites (SNSs) may depend on many factors. Continuance use of SNSs is one of these. Especially, in the Web environment where users can leave one service with a single mouse click, maintaining existing members cost much time and efforts. Without continuance use of SNSs, SNS-based service would not create any value. This study focused on identifying factors influencing users’ continuance intention in SNSs. Based on relevant literature review, six influencing factors were initially identified. They were reputation, relational capital, knowledge quality, compatibility, personalization, and satisfaction. Web-based ques- tionnaire survey was conducted and a total of 325 usable responses were collected. Reliability test and two rounds of exploratory factor analyses resulted in identifying five factors. The relationship between the factors and the continuance intention was tested by using multiple regression analyses. The analyses revealed that satisfaction was the most significant factor. Knowledge quality and relational capital also had significant effects while reputation and personalization did not have significant effect on continuance intention. Instead, reputation and personalization showed significance in influencing satisfaction.

송성전(독립연구자) ; 심지영(연세대학교 대학도서관발전연구소) 2022, Vol.39, No.3, pp.311-336 https://doi.org/10.3743/KOSIM.2022.39.3.311

본 연구는 도서관 정보서비스 환경에서 도서 이용자의 도서추천에 영향을 미치는 선호요인을 파악하기 위해 전 세계 도서 이용자의 참여로 이루어지는 사회적 목록 서비스인 Goodreads 리뷰 데이터를 대상으로 내용분석하였다. 이용자 선호의 내용을 보다 세부적인 관점에서 파악하기 위해 샘플 선정 과정에서 평점 그룹별, 도서별, 이용자별 하위 데이터 집합을 구성하였으며, 다양한 토픽을 고루 반영하기 위해 리뷰 텍스트의 토픽모델링 결과에 기반하여 층화 샘플링을 수행하였다. 그 결과, ‘내용’, ‘캐릭터’, ‘글쓰기’, ‘읽기’, ‘작가’, ‘스토리’, ‘형식’의 7개 범주에 속하는 총 90개 선호요인 관련 개념을 식별하는 한편, 평점에 따라 드러나는 일반적인 선호요인은 물론 호불호가 분명한 도서와 이용자에서 드러나는 선호요인의 양상을 파악하였다. 본 연구의 결과는 이용자 선호요인의 구체적 양상을 파악하여 향후 추천시스템 등에서 보다 정교한 추천에 기여할 수 있을 것으로 보인다.


This study analyzed the contents of Goodreads review data, which is a social cataloging service with the participation of book users around the world, to identify the preference factors that affect book users’ book recommendations in the library information service environment. To understand user preferences from a more detailed point of view, sub-datasets for each rating group, each book, and each user were constructed in the sample selection process. Stratified sampling was also performed based on the result of topic modeling of review text data to include various topics. As a result, a total of 90 preference factors belonging to 7 categories(‘Content’, ‘Character’, ‘Writing’, ‘Reading’, ‘Author’, ‘Story’, ‘Form’) were identified. Also, the general preference factors revealed according to the ratings, as well as the patterns of preference factors revealed in books and users with clear likes and dislikes were identified. The results of this study are expected to contribute to more sophisticated recommendations in future recommendation systems by identifying specific aspects of user preference factors.


이 연구는 저작권법 제35조의 4과 저작권법 시행령 제16조의 3에 규정하는 ‘상당한 조사’의 실효성을 점검하기 위한 것이다. 권리자불명 저작물 판정을 위한 ‘상당한 조사’의 과정을 세밀하게 분석하여 그 문제점을 제시하고, 개선 방안을 제안하였다. ‘상당한 조사’는 저작재산권자와 그의 거소를 파악하는 과정이지만, 해당 내역을 파악하는 것이 불가능한 경우와 불필요한 조사를 요구하는 경우가 포함되어 있어 개선이 필요한 것으로 밝혀졌다. 이를 바탕으로 ‘상당한 조사’를 위한 법률의 요건을 ‘저작재산권자와 그의 거소’로 보다 명확하게 제시하는 방향으로 개정할 필요가 있으며, 실효성이 없는 시행령 제16조의 3의 제5호부터 제8호까지의 조항을 폐지할 것을 제안하였다.


This study aims to check the effectiveness of ‘diligent search’ stipulated in Article 35-4 of the Copyright Act of Korea. ‘Diligent search’ is to identify the copyright holder and his or her contact information. But the process provided by the law includes many cases in which it is practically impossible to identify the relevant details, and includes unnecessary requirements. So it appears that improvement is needed. Based on this, it was proposed to improve the text of the Copyright Act (Article 35-4) and to abolish unnecessary provisions (Article 16-3 no.5~8) of the Enforcement Decree.

김기영(연세대학교) 2015, Vol.32, No.3, pp.183-197 https://doi.org/10.3743/KOSIM.2015.32.3.183


The purpose of this conceptual paper is to identify the library services market and its characteristics versus the common commodity market so that marketing and management in library services can be more fruitful in terms of research and development. Based on the developed hypothetical market, a library services market is identified; the market is then characterized in comparison to the common commodity market using three theoretical characteristics of the library services market: indirect exchange, limited competition, and time-lagging exchange. Based on these characteristics, two possible research directions are suggested: development of goals for library management and consideration of applications in library marketing.

이재윤(경기대학교) ; 최상희(대구가톨릭대학교) 2011, Vol.28, No.2, pp.11-36 https://doi.org/10.3743/KOSIM.2011.28.2.011


Since the 1990s, informetrics has grown in popularity among information scientists. Today it is a general discipline that comprises all kinds of metrics, including bibliometrics and scientometrics. To illustrate the dynamic progress of this field, this study aims to identify the structure and infrastructure of the informetrics literature using statistical and profiling methods. Informetrics literature was obtained from the Web of Knowledge for the years 2001-2010. The selected articles contain least one of these keywords: ‘informetrics’, ‘bibliometrics’, ‘scientometrics’, ‘webometrics’, and ‘citation analysis.’ Noteworthy publication patterns of major countries were identified by a statistical method. Intellectual structure analysis shows major research areas, authors, and journals.

허고은(연세대학교) ; 송민(연세대학교) 2019, Vol.36, No.2, pp.175-199 https://doi.org/10.3743/KOSIM.2019.36.2.175

불확실성이란 정보의 합의나 현존하는 지식 부족으로 인해 명제의 지식이 불완전한 상태를 의미한다. 과학적 지식의 불확실성을 연구하는 학술문헌의 양은 시간이 흐름에 따라 기하급수적으로 증가하고 있으며, 이에 따라 새로운 지식이 발견되고 연구가 발전하고 있다. 이처럼 시간의 흐름은 지식의 불확실성의 패턴을 발견하는데 중요한 요인이 될 수 있음에도 불구하고 기존의 연구들은 불확실성 단어의 단순 출현 빈도를 기반으로 특정 학문 영역에서 불확실성의 특성을 파악해왔다. 따라서, 본 연구에서는 구축한 불확실성 단어를 생의학 영역의 불확실성 연구에 적용하여 시간의 흐름에 따른 불확실성의 변화와 패턴을 파악하고자 한다. 시간의 흐름에 따른 생의학 지식의 패턴을 분석하기 위해 대표 개체 페어, 동사 유형, 대표 개체의 패턴을 살펴보았으며 선형 회귀 분석을 통해 유의성 검증을 수행했다. 개체 페어 분석에서는 17건 중 7건의 개체 페어가 유의하게 감소하는 패턴을 보였다. 10개의 대표적인 동사 유형은 모두 시간이 흐름에 따라 유의하게 감소했다. 대표 개체의 연도별 상대적 중요도 분석에서는 유의하게 상승과 하강 패턴을 보이는 개체들의 불확실성 증감을 분석했다.


Uncertainty means incomplete stages of knowledge of propositions due to the lack of consensus of information and existing knowledge. As the amount of academic literature increases exponentially over time, new knowledge is discovered as research develops. Although the flow of time may be an important factor to identify patterns of uncertainty in scientific knowledge, existing studies have only identified the nature of uncertainty based on the frequency in a particular discipline, and they did not take into consideration of the flow of time. Therefore, in this study, we identify and analyze the uncertainty words that indicate uncertainty in the scientific literature and investigate the stream of knowledge. We examine the pattern of biomedical knowledge such as representative entity pairs, predicate types, and entities over time. We also perform the significance testing using linear regression analysis. Seven pairs out of 17 entity pairs show the significant decrease pattern statistically and all 10 representative predicates decrease significantly over time. We analyze the relative importance of representative entities by year and identify entities that display a significant rising and falling pattern.
