Reinforcement learning for optimizing responses in care processes

  • Olusanmi A. Hundogan
  • , Bart J. Verhoef
  • , Patrick Theeven
  • , Hajo A. Reijers
  • , Xixi Lu*
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Prescriptive process monitoring aims to derive recommendations for optimizing complex processes. While previous studies have successfully used reinforcement learning techniques to derive actionable policies in business processes, care processes present unique challenges due to their dynamic and multifaceted nature. For example, at any stage of a care process, a multitude of actions is possible. In this study, we follow the Reinforcement Learning (RL) approach and present a general approach that uses event data to build and train Markov decision processes. We proposed three algorithms including one that takes the elapsed time into account when transforming an event log into a semi-Markov decision process. We evaluated the RL approach using an aggression incident data set. Specifically, the goal is to optimize staff member actions when clients are displaying different types of aggressive behavior. The Q-learning and SARSA are used to find optimal policies. Our results showed that the derived policies align closely with current practices while offering alternative options in specific situations. By employing RL in the context of care processes, we contribute to the ongoing efforts to enhance decision-making and efficiency in dynamic and complex environments.

Original languageEnglish
Article number102412
Pages (from-to)1-21
Number of pages21
JournalData and Knowledge Engineering
Volume157
DOIs
Publication statusPublished - May 2025

Bibliographical note

Publisher Copyright:
© 2025

Funding

This research was supported by the NWO TACTICS project ( 628.011.004 ) and Lunet in the Netherlands. We would like to thank the experts from Lunet for their assistance. We also thank Dr. Shihan Wang and Dr. Ronald Poppe for the invaluable discussions that helped shape this paper.

FundersFunder number
Nederlandse Organisatie voor Wetenschappelijk Onderzoek628.011.004

    Keywords

    • Markov decision process
    • Prescriptive process monitoring
    • Process mining
    • Process optimization
    • Reinforcement learning
    • Semi-Markov decision process

    Fingerprint

    Dive into the research topics of 'Reinforcement learning for optimizing responses in care processes'. Together they form a unique fingerprint.

    Cite this