File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Verbosity normalized pseudo-relevance feedback in information retrieval

Author(s)
Na, Seung-HoonKim, Kangil
Issued Date
2018-03
DOI
10.1016/j.ipm.2017.09.006
URI
https://scholarworks.unist.ac.kr/handle/201301/86814
Citation
INFORMATION PROCESSING & MANAGEMENT, v.54, no.2, pp.219 - 239
Abstract
Document length normalization is one of the fundamental components in a retrieval model because term frequencies can readily be increased in long documents. The key hypotheses in literature regarding document length normalization are the verbosity and scope hypotheses, which imply that document length normalization should consider the distinguishing effects of verbosity and scope on term frequencies. In this article, we extend these hypotheses in a pseudo-relevance feedback setting by assuming the verbosity hypothesis on the feedback query model, which states that the verbosity of an expanded query should not be high. Furthermore, we postulate the following two effects of document verbosity on a feedback query model that easily and typically holds in modem pseudo-relevance feedback methods: 1) the verbosity-preserving effect the query verbosity of a feedback query model is determined by feedback document verbosities; 2) the verbosity-sensitive effect highly verbose documents more significantly and unfairly affect the resulting query model than normal documents do. By considering these effects, we propose verbosity normalized pseudo-relevance feedback, which is straightforwardly obtained by replacing original term frequencies with their verbosity-normalized term frequencies in the pseudo-relevance feedback method. The results of the experiments performed on three standard TREC collections show that the proposed verbosity normalized pseudo-relevance feedback consistently provides statistically significant improvements over conventional methods, under the settings of the relevance model and latent concept expansion.
Publisher
ELSEVIER SCI LTD
ISSN
0306-4573
Keyword (Author)
Pseudo-relevance feedbackVerbosity normalizationScope normalizationTerm frequency

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.