Abstract
Process mining is a discipline that enables organizations to discover and analyze their work processes. A prerequisite for conducting a process mining initiative is the so-called event log, which is not always readily available. In such cases, extracting an event log involves various time-consuming tasks, such as creating tailor-made structured query language (SQL) scripts to extract an event log from a relational database. With this work, we investigate the use of large language models (LLMs) to support event log extraction, particularly by leveraging LLMs ability to produce SQL scripts. In this paper, we report on how effectively an LLM can assist with event log extraction for process mining. Despite the intrinsic non-deterministic nature of LLMs, our results show the potential of future LLM-assisted event log extraction tools, especially when domain and data knowledge are available. The implementation of such tools can increase access to event log extraction to a broader range of users within an organization by reducing the reliance on specialized technical skills for producing relational database query scripts and minimizing manual effort.
| Original language | English |
|---|---|
| Title of host publication | 30th International Conference on Cooperative Information Systems (CoopIS 2024) |
| Publisher | Springer |
| Pages | 56-72 |
| Number of pages | 17 |
| ISBN (Electronic) | 978-3-031-81375-7 |
| ISBN (Print) | 978-3-031-81374-0 |
| DOIs | |
| Publication status | Published - 14 Feb 2025 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15506 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.