File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

MarcoComuzzi

Comuzzi, Marco
Intelligent Enterprise Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Declarative process mining in big data scenarios using an application-agnostic framework

Author(s)
Mavroudopoulos, IBalaktsis, CVarvoutas, KKougka, GGounaris, AComuzzi, Marco
Issued Date
2025-04
DOI
10.1007/s44311-025-00013-9
URI
https://scholarworks.unist.ac.kr/handle/201301/87523
Citation
PROCESS SCIENCE , v.2, pp.6
Abstract
Although they are usually designed for top-notch scalability and flexibility, application-agnostic big data pattern analysis frameworks are seldom exploited in process mining. As the size of event logs and the velocity at which events can be generated grow, however, the need for big data-aware process mining solutions emerges. This work targets the extraction of declarative process constraints. Its key novelty lies in employing an application-agnostic pattern analysis framework, called SIESTA, rather than devising an ad-hoc solution tailored to declarative constraint discovery. The key contribution of our work is threefold: (i) we show how we can build on top of SIESTA to extract the full set of Declare constraints in large event logs in a more efficient and scalable manner than the ad-hoc competitors; (ii) we extend our SIESTA-based approach to operate in an incremental manner, which may be required both when the event logs are very large and when they are continuously updated by new batches of events; and (iii) we demonstrate how our SIESTA-based framework can be extended to mine temporal violations of Declare constraints to support variant analysis. The experimental results show that our solution can ingest and process thousands of events per second even using a commodity machine, it can handle datasets of tens of millions of events, and it is much faster than the competitors in repetitive constraint extraction tasks for larger datasets than the ones that can be handled by the competitors.
Publisher
Springer Nature
ISSN
2948-2178

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.