In this study, experiments for selection of association terms were conducted in order to discover the optimum method in selecting additional terms that are related to an initial query term. Association term sets were generated by using support, confidence, and lift measures of the Apriori algorithm, and also by using the similarity measures such as GSS, Jaccard coefficient, cosine coefficient, and Sokal & Sneath 5, and mutual information. In performance evaluation of term selection methods, precision of association terms as well as the overlap ratio of association terms and relevant documents' indexing terms were used. It was found that Apriori algorithm and GSS achieved the highest level of performances.
박우창. (2003). 데이터마이닝: 개념 및 기법. , -.
이재윤. (2004). 연관성 척도의 빈도수준 선호경향에 대한 연구. 정보관리학회지, 21(4), 281-294.
정영미. (2005). 정보검색연구. , -.
