TY - GEN
T1 - Entity-based keyword search in web documents
AU - Sartori, Enrico
AU - Velegrakis, Yannis
AU - Guerra, Francesco
PY - 2016/1/1
Y1 - 2016/1/1
N2 - In document search, documents are typically seen as a flat list of keywords. To deal with the syntactic interoperability, i.e., the use of different keywords to refer to the same real world entity, entity linkage has been used to replace keywords in the text with a unique identifier of the entity to which they are referring. Yet, the flat list of entities fails to capture the actual relationships that exist among the entities, information that is significant for a more effective document search. In this work we propose to go one step further from entity linkage in text, and model the documents as a set of structures that describe relationships among the entities mentioned in the text. We show that this kind of representation is significantly improving the effectiveness of document search. We describe the details of the implementation of the above idea and we present an extensive set of experimental results that prove our point.
AB - In document search, documents are typically seen as a flat list of keywords. To deal with the syntactic interoperability, i.e., the use of different keywords to refer to the same real world entity, entity linkage has been used to replace keywords in the text with a unique identifier of the entity to which they are referring. Yet, the flat list of entities fails to capture the actual relationships that exist among the entities, information that is significant for a more effective document search. In this work we propose to go one step further from entity linkage in text, and model the documents as a set of structures that describe relationships among the entities mentioned in the text. We show that this kind of representation is significantly improving the effectiveness of document search. We describe the details of the implementation of the above idea and we present an extensive set of experimental results that prove our point.
UR - http://www.scopus.com/inward/record.url?scp=84969242713&partnerID=8YFLogxK
U2 - 10.1007/978-3-662-49521-6_2
DO - 10.1007/978-3-662-49521-6_2
M3 - Conference contribution
AN - SCOPUS:84969242713
SN - 9783662495209
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 21
EP - 49
BT - Transactions on Computational Collective Intelligence XXI - Special Issue on Keyword Search and Big Data
A2 - Nguyen, Ngoc Thanh
A2 - da Cunha, Paulo Rupino
A2 - Kowalczyk, Ryszard
PB - Springer
T2 - 8th International Conference on Computational Collective Intelligence, ICCCI 2016
Y2 - 28 September 2016 through 30 September 2016
ER -