Abstract
This paper proposes an ontology learning framework that combines text mining, information extraction and retrieval. The proposed model takes advantage of existing structured knowledge by reusing terms and concepts from other ontologies. We further apply the methodology to create a detailed ontology for the emerging precision medicine (PM) domain by collecting a corpus of relevant articles and mapping its frequent terms to existing concepts. The resulting ontology consists of 543 annotated classes. The ontology was also tested for effectiveness by applying two evaluation frameworks to validate its design and quality. The results demonstrate that the ontology learning system is able to capture and represent the semantics of the PM domain with high precision and significance. Moreover, the computer-assisted construction process reduced dependency on expert knowledge. The developed PreMedOnto ontology could be further used to enhance the potentials of other NLP applications in the PM domain.
Original language | English |
---|---|
Title of host publication | NLDB 2019: International Conference on Applications of Natural Language to Information Systems |
Editors | E. Métais, et al. |
Place of Publication | Cham |
Publisher | Springer |
Pages | 329–336 |
Number of pages | 8 |
Volume | 11608 |
DOIs | |
Publication status | Published - 2019 |
Keywords
- Precision medicine
- Data mining
- Ontology reuse