Machine learning vs. rule-based methods for document classification of electronic health records within mental health care: A systematic literature review

Emil Rijcken*, Kalliopi Zervanou, Pablo Mosteiro, Floortje Scheepers, Marco Spruit, Uzay Kaymak

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.
Original languageEnglish
Article number100129
JournalNatural Language Processing
Volume10
Early online date25 Jan 2025
DOIs
Publication statusPublished - Mar 2025

Keywords

  • Document classification
  • Natural language processing
  • Electronic health records
  • Mental healthcare
  • Machine learning
  • Rule-based methods

Fingerprint

Dive into the research topics of 'Machine learning vs. rule-based methods for document classification of electronic health records within mental health care: A systematic literature review'. Together they form a unique fingerprint.

Cite this