TY - JOUR
T1 - Machine learning vs. rule-based methods for document classification of electronic health records within mental health care
T2 - A systematic literature review
AU - Rijcken, Emil
AU - Zervanou, Kalliopi
AU - Mosteiro, Pablo
AU - Scheepers, Floortje
AU - Spruit, Marco
AU - Kaymak, Uzay
PY - 2025/3
Y1 - 2025/3
N2 - Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.
AB - Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.
KW - Document classification
KW - Natural language processing
KW - Electronic health records
KW - Mental healthcare
KW - Machine learning
KW - Rule-based methods
U2 - 10.1016/j.nlp.2025.100129
DO - 10.1016/j.nlp.2025.100129
M3 - Article
SN - 2949-7191
VL - 10
JO - Natural Language Processing
JF - Natural Language Processing
M1 - 100129
ER -