File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Adaptive document clustering based on query-based similarity

Author(s)
Na, Seung-HoonKang, I.-S.Lee, J.-H.
Issued Date
2007-07
DOI
10.1016/j.ipm.2006.08.008
URI
https://scholarworks.unist.ac.kr/handle/201301/86842
Citation
INFORMATION PROCESSING AND MANAGEMENT, v.43, no.4, pp.887 - 901
Abstract
In information retrieval, cluster-based retrieval is a well-known attempt in resolving the problem of term mismatch. Clustering requires similarity information between the documents, which is difficult to calculate at a feasible time. The adaptive document clustering scheme has been investigated by researchers to resolve this problem. However, its theoretical viewpoint has not been fully discovered. In this regard, we provide a conceptual viewpoint of the adaptive document clustering based on query-based similarities, by regarding the user's query as a concept. As a result, adaptive document clustering scheme can be viewed as an approximation of this similarity. Based on this idea, we derive three new query-based similarity measures in language modeling framework, and evaluate them in the context of cluster-based retrieval, comparing with K-means clustering and full document expansion. Evaluation result shows that retrievals based on query-based similarities significantly improve the baseline, while being comparable to other methods. This implies that the newly developed query-based similarities become feasible criterions for adaptive document clustering. © 2006 Elsevier Ltd. All rights reserved.
Publisher
Pergamon Press Ltd.
ISSN
0306-4573
Keyword (Author)
Adaptive document clusteringCluster-based retrievalLanguage modeling approachQuery-based similarity

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.