Skip to main navigation Skip to search Skip to main content

From Prompt to Process: Event Log Extraction From Relational Databases Using Large Language Models

  • Uitvoeringsinstituut Werknemersverzekeringen (UWV)
  • Kühne Logistics University

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Process mining enables organizations to discover, monitor, and analyze their work processes based on data. A fundamental requirement for initiating a process mining project is the availability of an event log, which is not always readily available. In such cases, extracting an event log typically involves various time-consuming tasks, such as writing custom structured query language (SQL) scripts to extract relevant data into an event log format from a relational database. In this work, we explore the potential of large language models (LLMs) to support event log extraction for process mining by leveraging LLMs’ ability to produce SQL scripts. We evaluate the effectiveness of LLMs in assisting this process and analyze their performance across a range of scenarios. Despite the inherent non-determinism of LLM outputs, our findings highlight the potential of future LLM-assisted tools in automating event log extraction, particularly when provided with the appropriate domain and data knowledge context. The implementation of such tools could democratize access to process mining by reducing the need for specialized technical expertise for producing relational database query scripts and minimizing manual effort.

Original languageEnglish
Article number2650001
JournalInternational Journal of Cooperative Information Systems
DOIs
Publication statusE-pub ahead of print - 25 Mar 2026

Bibliographical note

Publisher Copyright:
© 2026 World Scientific Publishing Company.

Keywords

  • assistance
  • event log extraction
  • large language model
  • LLM
  • Process mining
  • prompt engineering
  • relational database

Fingerprint

Dive into the research topics of 'From Prompt to Process: Event Log Extraction From Relational Databases Using Large Language Models'. Together they form a unique fingerprint.

Cite this