Entity-based keyword search in web documents

Enrico Sartori, Yannis Velegrakis*, Francesco Guerra

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In document search, documents are typically seen as a flat list of keywords. To deal with the syntactic interoperability, i.e., the use of different keywords to refer to the same real world entity, entity linkage has been used to replace keywords in the text with a unique identifier of the entity to which they are referring. Yet, the flat list of entities fails to capture the actual relationships that exist among the entities, information that is significant for a more effective document search. In this work we propose to go one step further from entity linkage in text, and model the documents as a set of structures that describe relationships among the entities mentioned in the text. We show that this kind of representation is significantly improving the effectiveness of document search. We describe the details of the implementation of the above idea and we present an extensive set of experimental results that prove our point.

Original languageEnglish
Title of host publicationTransactions on Computational Collective Intelligence XXI - Special Issue on Keyword Search and Big Data
EditorsNgoc Thanh Nguyen, Paulo Rupino da Cunha, Ryszard Kowalczyk
PublisherSpringer
Pages21-49
Number of pages29
ISBN (Print)9783662495209
DOIs
Publication statusPublished - 1 Jan 2016
Event8th International Conference on Computational Collective Intelligence, ICCCI 2016 - Halkidiki, Greece
Duration: 28 Sept 201630 Sept 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9630
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Computational Collective Intelligence, ICCCI 2016
Country/TerritoryGreece
CityHalkidiki
Period28/09/1630/09/16

Fingerprint

Dive into the research topics of 'Entity-based keyword search in web documents'. Together they form a unique fingerprint.

Cite this