A Probabilistic Approach to Event-Case Correlation for Process Mining

Dina Bayomie, Claudio Di Ciccio, Marcello La Rosa, Jan Mendling

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Process mining aims to understand the actual behavior and performance of business processes from event logs recorded by IT systems. A key requirement is that every event in the log must be associated with a unique case identifier (e.g., the order ID in an order-to-cash process). In reality, however, this case ID may not always be present, especially when logs are acquired from different systems or when such systems have not been explicitly designed to offer process-tracking capabilities. Existing techniques for correlating events have worked with assumptions to make the problem tractable: some assume the generative processes to be acyclic while others require heuristic information or user input. In this paper, we lift these assumptions by presenting a novel technique called EC-SA based on probabilistic optimization. Given as input a sequence of timestamped events (the log without case IDs) and a process model describing the underlying business process, our approach returns an event log in which every event is mapped to a case identifier. The approach minimises the misalignment between the generated log and the input process model, and the variance between activity durations across cases. The experiments conducted on a variety of real-life datasets show the advantages of our approach over the state of the art.
Original languageEnglish
Title of host publicationConceptual Modeling - 39th International Conference, ER 2019, Vienna, Austria, November 3-6, 2020, Proceedings
EditorsGillian Dobbie, Ulrich Frank, Gerti Kappel, Stephen W. Liddle, Heinrich C. Mayr
PublisherSpringer
Pages136-152
Number of pages17
ISBN (Print)978-3-030-62522-1
DOIs
Publication statusPublished - Nov 2019

Publication series

NameLecture Notes in Computer Science
PublisherSpringer

Keywords

  • Event correlation
  • Simulated annealing
  • Process mining

Fingerprint

Dive into the research topics of 'A Probabilistic Approach to Event-Case Correlation for Process Mining'. Together they form a unique fingerprint.

Cite this