This experimental study suggests an element-based XML document retrieval method that reveals highly relevant elements. The models investigated here for comparison are divergence and smoothing method, and hierarchical language model. In conclusion, the hierarchical language model proved to be most effective in element-based XML document retrieval with regard to the improved exhaustivity and harmed specificity.
Abolhassani, M. (2004). Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents. European Conference on Information Retrieval Research, 26, -.
Amati, G. (2002). Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Transactions on Information Systems, 20(4), -.
Chiaramella, Y. (1996). A Model for multimedia information retrieval. University of Glasgow.
Hiemstra, D. (1999). Twenty-One at TREC-7: Ad-hoc and cross-language track. Text REtrieval Conference, 7, 227-238.
Jelinek, F.. (1980). Interpolated estimation of Markov source parameters from sparse data (-). Pattern Recognition in Practice.
McCallum, A. (1999). Text classification by bootstrapping with keywords, em and shrinkage (52-58). ACL 99 Workshop for Unsupervised Learning in Natural Language Processing.
Miller, D. R. H. (1999). A hidden Markov model information retrieval system. ACM SIGIR Conference, 22, 214-221.
Moffat, A. (1994). Retrieval of partial documents In D. Harman, editor, Proceedings of the Second Text REtrieval Conference (TREC-2).
Ogilvie, P. (2003). Using Language Models for Flat Text Queries in XML Retrieval. In Proceedings of the Second Annual Workshop of the INitiative for the Evaluation of XML Retrieval (INEX)..
Ponte, J. M.. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st ACM Conference on Research and Development in Information Retrieval, , -.
Salton, G. (1993). Approach to passage retrieval in full text information systems. Annual International Conference on Research and Development in Information Retrieval, 16, -.
Sigurbjörnsson, B.,. (2003). An Element- based Approach to XML Retrieval (-). the third Workshop of the INitiative for the Evaluation of XML Retrieval.
Singhul, A. (1996). Pivoted document length normalization. In Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development Information Retrieval, 19, 21-29.
Wilkinson, R. (1994). Effective retrieval of structured docu- ments. Proceedings of SIGIR Conference, , 311-317.
Zhai, C. (2001). A study of smoothing methods for language models applied to ad hoc information retrieval. Proceedings of the ACM SIGIR Conference, 24, 334-342.