File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

박성훈

Park, Sunghoon
Biochemical Engineering Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

A gene clustering method with masking cross-matching fragments using modified suffix tree clustering method

Author(s)
Han, Sang IlLee, Sung GunHou, Bo KyengPark, SunghoonKim, Young HanHwang, Kyu Suk
Issued Date
2005-05
DOI
10.1007/BF02719409
URI
https://scholarworks.unist.ac.kr/handle/201301/25382
Fulltext
https://link.springer.com/article/10.1007%2FBF02719409
Citation
KOREAN JOURNAL OF CHEMICAL ENGINEERING, v.22, no.3, pp.345 - 352
Abstract
Multiple sequence alignment is a method for comparing two or more DNA or protein sequences. Most multiple sequence alignment methods rely on pairwise alignment and Smith-Waterman algorithm [Needleman and Wunsch, 1970; Smith and Waterman, 198 1] to generate an alignment hierarchy. Therefore, as the number of sequences increases, the runtime increases exponentially. To resolve this problem, this paper presents a multiple sequence alignment method using a parallel processing suffix tree algorithm to search for common subsequences at one time without pairwise alignment. The cross-matched subsequences among the searched common subsequences may be generated and those cause inexact-matching. So the procedure of masking cross-matching pairs was suggested in this study. The proposed method, improved STC (Suffix Tree Clustering), is summarized as follows: (1) construction of suffix tree; (2) search and overlap of common subsequences; (3) grouping of subsequence pairs; (4) masking of cross-matching pairs; and (5) clustering of gene sequences. The new method was successfully evaluated with 23 genes in Mus musculus and 22 genes in three species, clustering nine and eight clusters, respectively.
Publisher
KOREAN INSTITUTE CHEMICAL ENGINEERS
ISSN
0256-1115
Keyword (Author)
multiple sequence alignmentclusteringsequencegene
Keyword
MULTIPLE SEQUENCE ALIGNMENTCLUSTALSIMILARITIESCONSTRUCTIONALGORITHM

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.