바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

  • P-ISSN1013-0799
  • E-ISSN2586-2073
  • KCI

A Study on the Pivoted Inverse Document Frequency Weighting Method

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2003, v.20 no.4, pp.233-248
https://doi.org/10.3743/KOSIM.2003.20.4.233

Abstract

The Inverse Document Frequency (IDF) weighting method is based on the hypothesis that in the document collection the lower the frequency of a term is, the more important the term is as a subject word. This well-known hypothesis is, however, somewhat questionable because some low frequency terms turn out to be insufficient subject words. This study suggests the pivoted IDF weighting method for better retrieval effectiveness, on the assumption that medium frequency terms are more important than low frequency terms. We thoroughly evaluated this method on three test collections and it showed performance improvements especially at high ranks.

keywords
역문헌빈도, 정보검색, 용어가중치, Information Retrieval, Inverse Document Frequency, Term Weights

Journal of the Korean Society for Information Management