File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 145 -
dc.citation.number 1 -
dc.citation.startPage 121 -
dc.citation.title INFORMATION PROCESSING & MANAGEMENT -
dc.citation.volume 43 -
dc.contributor.author Na, Seung-Hoon -
dc.contributor.author Kang, In-Su -
dc.contributor.author Lee, Jong-Hyeok -
dc.date.accessioned 2025-04-25T15:14:05Z -
dc.date.available 2025-04-25T15:14:05Z -
dc.date.created 2025-04-08 -
dc.date.issued 2007-01 -
dc.description.abstract In the KL divergence framework, the extended language modeling approach has a critical problem of estimating a query model, which is the probabilistic model that encodes the user's information need. For query expansion in initial retrieval, the translation model had been proposed to involve term co-occurrence statistics. However, the translation model was difficult to apply, because the term co-occurrence statistics must be constructed in the offline time. Especially in a large collection, constructing such a large matrix of term co-occurrences statistics prohibitively increases time and space complexity. In addition, reliable retrieval performance cannot be guaranteed because the translation model may comprise noisy non-topical terms in documents. To resolve these problems, this paper investigates an effective method to construct co-occurrence statistics and eliminate noisy terms by employing a parsimonious translation model. The parsimonious translation model is a compact version of a translation model that can reduce the number of terms containing non-zero probabilities by eliminating non-topical terms in documents. Through experimentation on seven different test collections, we show that the query model estimated from the parsimonious translation model significantly outperforms not only the baseline language modeling, but also the non-parsimonious models. (c) 2006 Elsevier Ltd. All rights reserved. -
dc.identifier.bibliographicCitation INFORMATION PROCESSING & MANAGEMENT, v.43, no.1, pp.121 - 145 -
dc.identifier.doi 10.1016/j.ipm.2006.04.005 -
dc.identifier.issn 0306-4573 -
dc.identifier.scopusid 2-s2.0-33748316409 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/86844 -
dc.identifier.wosid 000241539500009 -
dc.language 영어 -
dc.publisher ELSEVIER SCI LTD -
dc.title Parsimonious translation models for information retrieval -
dc.type Article -
dc.description.isOpenAccess FALSE -
dc.relation.journalWebOfScienceCategory Computer Science, Information Systems; Information Science & Library Science -
dc.relation.journalResearchArea Computer Science; Information Science & Library Science -
dc.type.docType Article -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordAuthor information retrieval -
dc.subject.keywordAuthor language model -
dc.subject.keywordAuthor parsimonious translation model -
dc.subject.keywordAuthor query expansion -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.