Skip to main navigation Skip to search Skip to main content

Unsupervised ontology acquisition from plain texts: The OntoGain system

  • Euthymios Drymonas*
  • , Kalliopi Zervanou
  • , Euripides G.M. Petrakis
  • *Corresponding author for this work
  • Technical University of Crete

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

We propose OntoGain, a system for unsupervised ontology acquisition from unstructured text which relies on multi-word term extraction. For the acquisition of taxonomic relations, we exploit inherent multi-word terms' lexical information in a comparative implementation of agglomerative hierarchical clustering and formal concept analysis methods. For the detection of non-taxonomic relations, we comparatively investigate in OntoGain an association rules based algorithm and a probabilistic algorithm. The OntoGain system allows for transformation of the derived ontology into standard OWL statements. OntoGain results are compared to both hand-crafted ontologies, as well as to a state-of-the art system, in two different domains: the medical and computer science domains.

Original languageEnglish
Title of host publicationNatural Language Processing and Information Systems - 15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010, Proceedings
Pages277-287
Number of pages11
DOIs
Publication statusPublished - 2010
Event15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010 - Cardiff, United Kingdom
Duration: 23 Jun 201025 Jun 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6177 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010
Country/TerritoryUnited Kingdom
CityCardiff
Period23/06/1025/06/10

Keywords

  • association rules
  • formal concept analysis
  • multi-word terms
  • ontology acquisition
  • OWL
  • term clustering
  • term extraction
  • term similarity

Fingerprint

Dive into the research topics of 'Unsupervised ontology acquisition from plain texts: The OntoGain system'. Together they form a unique fingerprint.

Cite this