Automatic Extraction of Adverse Drug Reactions from Summary of Product Characteristics

Zhengru Shen, Marco Spruit

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

The summary of product characteristics from the European Medicines Agency is a reference document on medicines in the EU. It contains textual information for clinical experts on how to safely use medicines, including adverse drug reactions. Using natural language processing (NLP) techniques to automatically extract adverse drug reactions from such unstructured textual information helps clinical experts to effectively and efficiently use them in daily practices. Such techniques have been developed for Structured Product Labels from the Food and Drug Administration (FDA), but there is no research focusing on extracting from the Summary of Product Characteristics. In this work, we built a natural language processing pipeline that automatically scrapes the summary of product characteristics online and then extracts adverse drug reactions from them. Besides, we have made the method and its output publicly available so that it can be reused and further evaluated in clinical practices. In total, we extracted 32,797 common adverse drug reactions for 647 common medicines scraped from the Electronic Medicines Compendium. A manual review of 37 commonly used medicines has indicated a good performance, with a recall and precision of 0.99 and 0.934, respectively.
Original languageEnglish
Article number2663
Pages (from-to)1-11
Number of pages11
JournalApplied Sciences
Volume11
Issue number6
DOIs
Publication statusPublished - 2 Mar 2021

Keywords

  • Adverse drug reactions
  • Information extraction
  • Natural language processing
  • Summary of product characteristics

Fingerprint

Dive into the research topics of 'Automatic Extraction of Adverse Drug Reactions from Summary of Product Characteristics'. Together they form a unique fingerprint.

Cite this