Tuning Machine Learning to Address Process Mining Requirements

Paolo Ceravolo; Sylvio Barbon Junior; Ernesto Damiani; Wil Van Der Aalst

doi:10.1109/access.2024.3361650

IEEE Access (Jan 2024)

Tuning Machine Learning to Address Process Mining Requirements

Paolo Ceravolo,
Sylvio Barbon Junior,
Ernesto Damiani,
Wil Van Der Aalst

Affiliations

Paolo Ceravolo: ORCiD; Department of Computer Science, University of Milan, Milan, Italy
Sylvio Barbon Junior: ORCiD; Department of Engineering and Architecture, University of Trieste, Trieste, Italy
Ernesto Damiani: ORCiD; Department of Electrical Engineering and Computer Science, Khalifa University, Abu Dhabi, United Arab Emirates
Wil Van Der Aalst: ORCiD; Chair of Process and Data Science, RWTH Aachen University, Aachen, Germany

DOI: https://doi.org/10.1109/access.2024.3361650
Journal volume & issue: Vol. 12
pp. 24583 – 24595

Abstract

Read online

Machine learning models are routinely integrated into process mining pipelines to carry out tasks like data transformation, noise reduction, anomaly detection, classification, and prediction. Often, the design of such models is based on some ad-hoc assumptions about the corresponding data distributions, which are not necessarily in accordance with the non-parametric distributions typically observed with process data. Moreover, mainstream machine-learning approaches tend to ignore the challenges posed by concurrency in operational processes. Data encoding is a key element to smooth the mismatch between these assumptions but its potential is poorly exploited. In this paper, we argue that a deeper understanding of the challenges associated with training machine learning models on process data is essential for establishing a robust integration of process mining and machine learning. Our analysis aims to lay the groundwork for a methodology that aligns machine learning with process mining requirements. We encourage further research in this direction to advance the field and effectively address these critical issues.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords