TY - GEN
T1 - Turning Logs Into Lumber
T2 - Preprocessing Tasks in Process Mining
AU - Liu, Ying
AU - Stein Dani, Vinicius
AU - Beerepoot, Iris
AU - Lu, Xixi
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024/4/13
Y1 - 2024/4/13
N2 - Event logs are invaluable for conducting process mining projects, offering insights into process improvement and data-driven decision-making. However, data quality issues affect the correctness and trustworthiness of these insights, making preprocessing tasks a necessity. Despite the recognized importance, the execution of preprocessing tasks remains ad-hoc, lacking support. This paper presents a systematic literature review that establishes a comprehensive repository of preprocessing tasks and their usage in case studies. We identify six high-level and 20 low-level preprocessing tasks in case studies. Log filtering, transformation, and abstraction are commonly used, while log enriching, integration, and reduction are less frequent. These results can be considered a first step in contributing to more structured, transparent event log preprocessing, enhancing process mining reliability.
AB - Event logs are invaluable for conducting process mining projects, offering insights into process improvement and data-driven decision-making. However, data quality issues affect the correctness and trustworthiness of these insights, making preprocessing tasks a necessity. Despite the recognized importance, the execution of preprocessing tasks remains ad-hoc, lacking support. This paper presents a systematic literature review that establishes a comprehensive repository of preprocessing tasks and their usage in case studies. We identify six high-level and 20 low-level preprocessing tasks in case studies. Log filtering, transformation, and abstraction are commonly used, while log enriching, integration, and reduction are less frequent. These results can be considered a first step in contributing to more structured, transparent event log preprocessing, enhancing process mining reliability.
KW - Event log
KW - Log preprocessing
KW - Process mining
UR - http://www.scopus.com/inward/record.url?scp=85192209908&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-56107-8_8
DO - 10.1007/978-3-031-56107-8_8
M3 - Conference contribution
SN - 9783031561061
T3 - Lecture Notes in Business Information Processing
SP - 98
EP - 109
BT - Process Mining Workshops - ICPM 2023 International Workshops, 2023, Revised Selected Papers
A2 - De Smedt, Johannes
A2 - Soffer, Pnina
PB - Springer
ER -