File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

남덕우

Nam, Dougu
Bioinformatics Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 12 -
dc.citation.number 22 -
dc.citation.startPage 1 -
dc.citation.title BMC BIOINFORMATICS -
dc.citation.volume 10 -
dc.contributor.author Kim, Eun-Youn -
dc.contributor.author Kim, Seon-Young -
dc.contributor.author Ashlock, Daniel -
dc.contributor.author Nam, Dougu -
dc.date.accessioned 2023-12-22T07:42:20Z -
dc.date.available 2023-12-22T07:42:20Z -
dc.date.created 2014-10-13 -
dc.date.issued 2009-08 -
dc.description.abstract Background: Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. Results: We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. Conclusion: The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors. -
dc.identifier.bibliographicCitation BMC BIOINFORMATICS, v.10, no.22, pp.1 - 12 -
dc.identifier.doi 10.1186/1471-2105-10-260 -
dc.identifier.issn 1471-2105 -
dc.identifier.scopusid 2-s2.0-70349730070 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/7149 -
dc.identifier.url http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=70349730070 -
dc.identifier.wosid 000270274000002 -
dc.language 영어 -
dc.publisher BIOMED CENTRAL LTD -
dc.title MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering -
dc.type Article -
dc.description.isOpenAccess TRUE -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.