TY - GEN
T1 - Unsupervised ontology acquisition from plain texts
T2 - 15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010
AU - Drymonas, Euthymios
AU - Zervanou, Kalliopi
AU - Petrakis, Euripides G.M.
PY - 2010
Y1 - 2010
N2 - We propose OntoGain, a system for unsupervised ontology acquisition from unstructured text which relies on multi-word term extraction. For the acquisition of taxonomic relations, we exploit inherent multi-word terms' lexical information in a comparative implementation of agglomerative hierarchical clustering and formal concept analysis methods. For the detection of non-taxonomic relations, we comparatively investigate in OntoGain an association rules based algorithm and a probabilistic algorithm. The OntoGain system allows for transformation of the derived ontology into standard OWL statements. OntoGain results are compared to both hand-crafted ontologies, as well as to a state-of-the art system, in two different domains: the medical and computer science domains.
AB - We propose OntoGain, a system for unsupervised ontology acquisition from unstructured text which relies on multi-word term extraction. For the acquisition of taxonomic relations, we exploit inherent multi-word terms' lexical information in a comparative implementation of agglomerative hierarchical clustering and formal concept analysis methods. For the detection of non-taxonomic relations, we comparatively investigate in OntoGain an association rules based algorithm and a probabilistic algorithm. The OntoGain system allows for transformation of the derived ontology into standard OWL statements. OntoGain results are compared to both hand-crafted ontologies, as well as to a state-of-the art system, in two different domains: the medical and computer science domains.
KW - association rules
KW - formal concept analysis
KW - multi-word terms
KW - ontology acquisition
KW - OWL
KW - term clustering
KW - term extraction
KW - term similarity
UR - https://www.scopus.com/pages/publications/77955457950
U2 - 10.1007/978-3-642-13881-2_29
DO - 10.1007/978-3-642-13881-2_29
M3 - Conference contribution
AN - SCOPUS:77955457950
SN - 3642138802
SN - 9783642138805
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 277
EP - 287
BT - Natural Language Processing and Information Systems - 15th International Conference on Applications of Natural Language to Information Systems, NLDB 2010, Proceedings
Y2 - 23 June 2010 through 25 June 2010
ER -