BROWSE

Related Researcher

Author

Song, Minseok
Business Process Intelligence(BPI) Lab
Research Interests
  • Process mining

ITEM VIEW & DOWNLOAD

A comparative study of dimensionality reduction techniques to enhance trace clustering performances

Cited 0 times inthomson ciCited 4 times inthomson ci
Title
A comparative study of dimensionality reduction techniques to enhance trace clustering performances
Author
Song, MinseokYang, H.Siadat, S. H.Pechenizkiy, M.
Keywords
Big datum; Comparative studies; Computation time; Dimensionality reduction; Dimensionality reduction techniques; Dutch hospitals; Event logs; Experimental studies; Feature space; Feature transformations; High dimensionality; In-process; PCA; Principal components analysis; Process mining; Process model; Random projections; State of the art; Trace clustering; Treatment process; Useful patterns
Issue Date
201307
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
Citation
EXPERT SYSTEMS WITH APPLICATIONS, v.40, no.9, pp.3722 -
Abstract
Process mining techniques have been used to analyze event logs from information systems in order to derive useful patterns. However, in the big data era, real-life event logs are huge, unstructured, and complex so that traditional process mining techniques have difficulties in the analysis of big logs. To reduce the complexity during the analysis, trace clustering can be used to group similar traces together and to mine more structured and simpler process models for each of the clusters locally. However, a high dimensionality of the feature space in which all the traces are presented poses different problems to trace clustering. In this paper, we study the effect of applying dimensionality reduction (preprocessing) techniques on the performance of trace clustering. In our experimental study we use three popular feature transformation techniques; singular value decomposition (SVD), random projection (RP), and principal components analysis (PCA), and the state-of-the art trace clustering in process mining. The experimental results on the dataset constructed from a real event log recorded from patient treatment processes in a Dutch hospital show that dimensionality reduction can improve trace clustering performance with respect to the computation time and average fitness of the mined local process models.
URI
Go to Link
DOI
http://dx.doi.org/10.1016/j.eswa.2012.12.078
ISSN
0957-4174
Appears in Collections:
SBA_Journal Papers

find_unist can give you direct access to the published full text of this article. (UNISTARs only)

Show full item record

qr_code

  • mendeley

    citeulike

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

MENU