File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 239 -
dc.citation.number 2 -
dc.citation.startPage 219 -
dc.citation.title INFORMATION PROCESSING & MANAGEMENT -
dc.citation.volume 54 -
dc.contributor.author Na, Seung-Hoon -
dc.contributor.author Kim, Kangil -
dc.date.accessioned 2025-04-25T15:12:46Z -
dc.date.available 2025-04-25T15:12:46Z -
dc.date.created 2025-04-08 -
dc.date.issued 2018-03 -
dc.description.abstract Document length normalization is one of the fundamental components in a retrieval model because term frequencies can readily be increased in long documents. The key hypotheses in literature regarding document length normalization are the verbosity and scope hypotheses, which imply that document length normalization should consider the distinguishing effects of verbosity and scope on term frequencies. In this article, we extend these hypotheses in a pseudo-relevance feedback setting by assuming the verbosity hypothesis on the feedback query model, which states that the verbosity of an expanded query should not be high. Furthermore, we postulate the following two effects of document verbosity on a feedback query model that easily and typically holds in modem pseudo-relevance feedback methods: 1) the verbosity-preserving effect the query verbosity of a feedback query model is determined by feedback document verbosities; 2) the verbosity-sensitive effect highly verbose documents more significantly and unfairly affect the resulting query model than normal documents do. By considering these effects, we propose verbosity normalized pseudo-relevance feedback, which is straightforwardly obtained by replacing original term frequencies with their verbosity-normalized term frequencies in the pseudo-relevance feedback method. The results of the experiments performed on three standard TREC collections show that the proposed verbosity normalized pseudo-relevance feedback consistently provides statistically significant improvements over conventional methods, under the settings of the relevance model and latent concept expansion. -
dc.identifier.bibliographicCitation INFORMATION PROCESSING & MANAGEMENT, v.54, no.2, pp.219 - 239 -
dc.identifier.doi 10.1016/j.ipm.2017.09.006 -
dc.identifier.issn 0306-4573 -
dc.identifier.scopusid 2-s2.0-85034863034 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/86814 -
dc.identifier.wosid 000426022300007 -
dc.language 영어 -
dc.publisher ELSEVIER SCI LTD -
dc.title Verbosity normalized pseudo-relevance feedback in information retrieval -
dc.type Article -
dc.description.isOpenAccess FALSE -
dc.relation.journalWebOfScienceCategory Computer Science, Information Systems; Information Science & Library Science -
dc.relation.journalResearchArea Computer Science; Information Science & Library Science -
dc.type.docType Article -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass ssci -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordAuthor Pseudo-relevance feedback -
dc.subject.keywordAuthor Verbosity normalization -
dc.subject.keywordAuthor Scope normalization -
dc.subject.keywordAuthor Term frequency -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.