There are no files associated with this item.
Cited time in
Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.citation.endPage | 36 | - |
| dc.citation.number | 2 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.title | JOURNAL OF DATA AND INFORMATION QUALITY | - |
| dc.citation.volume | 17 | - |
| dc.contributor.author | Comuzzi, Marco | - |
| dc.contributor.author | Ko, Jonghyeon | - |
| dc.contributor.author | Maggi, Fabrizio Maria | - |
| dc.date.accessioned | 2025-07-24T10:00:00Z | - |
| dc.date.available | 2025-07-24T10:00:00Z | - |
| dc.date.created | 2025-07-24 | - |
| dc.date.issued | 2025-06 | - |
| dc.description.abstract | Real-life business process event logs may suffer from significant data quality problems negatively influencing process mining analysis. Over time, a range of approaches has been developed to detect and repair these quality problems. Validation of these approaches tends to be challenging due to the lack of a ground truth. Moreover, the identification and definition of event log quality problems have been tackled mainly through a pattern-based approach, with systematic and extensible methods currently lacking. In this article, we present FLAWD, a formal language for describing event log data quality issues that enables solutions addressing the shortcomings of process mining data quality research identified above. FLAWD can be used to formally describe and possibly reason over event log data quality errors, as well as to guide the development of tools for controlled and sophisticated “polluting” of event logs through which benchmark datasets may be systematically created. We present the abstract syntax grammar of FLAWD and an open-source software tool based on it that allows for the insertion of all so-called event log imperfection patterns in a stochastic manner. We show how FLAWD has been used in our research to generate benchmark datasets and how it can be used to formally describe and replicate a range of errors found in real-life event logs. | - |
| dc.identifier.bibliographicCitation | JOURNAL OF DATA AND INFORMATION QUALITY, v.17, no.2, pp.1 - 36 | - |
| dc.identifier.doi | 10.1145/3743144 | - |
| dc.identifier.issn | 1936-1955 | - |
| dc.identifier.scopusid | 2-s2.0-105009653864 | - |
| dc.identifier.uri | https://scholarworks.unist.ac.kr/handle/201301/87522 | - |
| dc.identifier.wosid | 001532034900001 | - |
| dc.language | 영어 | - |
| dc.publisher | ASSOC COMPUTING MACHINERY | - |
| dc.title | A Language to Model and Simulate Data Quality Issues in Process Mining | - |
| dc.type | Article | - |
| dc.description.isOpenAccess | TRUE | - |
| dc.type.docType | Article | - |
| dc.description.journalRegisteredClass | scopus | - |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr
Copyright (c) 2023 by UNIST LIBRARY. All rights reserved.
ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.