Text Mining in Healthcare: Bringing Structure to Electronic Health Records

Research output: ThesisDoctoral thesis 1 (Research UU / Graduation UU)

Abstract

Electronic health records (EHRs) are rich in data with the potential to leverage applications that provide safer care, reduce medical errors, reduce healthcare expenditure, and enable providers to improve their productivity and efficiency. A major portion of this data is inside free text in the form of physicians’ notes, discharge summaries, and radiology reports among other types of clinical narratives. Despite many attempts to encode text in the form of structured data, free text continues to be used in EHRs. Therefore, text mining techniques can be applied to create a more structured representation of a text, making its content more accessible for data science, machine learning and statistics. To this end, this thesis is concentrated on providing solutions for some of the challenges in the analysis of the free text element frequently included in the clinical domain. Text mining techniques have been applied to numerous health applications involving clinical decision support systems, patient identification, disease classification, and the prediction models. Many of these clinical applications depend on some form of text classification. Text classification is a task that consists in automatically assigning a document to a predefined set of labels. The focus of the work described in this thesis is the analysis and classification of the clinical free text found in heath records used in the University Medical Center (UMC) Utrecht.
Original languageEnglish
QualificationDoctor of Philosophy
Awarding Institution
  • Utrecht University
Supervisors/Advisors
  • van der Heijden, Peter, Primary supervisor
  • Asselbergs, Folkert, Supervisor, External person
  • Oberski, Daniel, Co-supervisor
Award date15 Jan 2021
Publisher
Print ISBNs978-94-6416-390-2
DOIs
Publication statusPublished - 15 Jan 2021

Keywords

  • Text Mining
  • Natural Language Processing
  • Data Science
  • Deep Learning
  • Cardiovascular Disease
  • Text Classification
  • ICD Classification
  • Prediction Models
  • Machine Learning
  • Topic Modeling

Fingerprint

Dive into the research topics of 'Text Mining in Healthcare: Bringing Structure to Electronic Health Records'. Together they form a unique fingerprint.

Cite this