Abstract
The Web has been flooded with highly heterogeneous data sources that freely offer their data to the public. Careful design and compliance to standards is a way to cope with the heterogeneity. However, any agreement and compliance is practically hard to achieve across different communities. In this work we describe a framework that enables the exploitation of content across different scientific disciplines. Our approach combines several novel techniques at the syntactic, structural and semantic level. In particular, we advocate that integration should take place at the much higher level, factoring out any syntactic discrepancies, and facilitating the exchange of information. We show how a novel technique for data annotation using intentional attributes can cope with data associations in high data volumes, we present a way to overcome the multilingualism barrier, and describe a new kind of database that considers data evolution as first class citizen with the additional ability to annotate free text.
Original language | English |
---|---|
Title of host publication | Proceedings - 21st International Workshop on Database and Expert Systems Applications, DEXA 2010 |
Pages | 305-309 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 24 Nov 2010 |
Event | 21st International Workshop on Database and Expert Systems Applications, DEXA 2010 - Bilbao, Spain Duration: 30 Aug 2010 → 3 Sept 2010 |
Conference
Conference | 21st International Workshop on Database and Expert Systems Applications, DEXA 2010 |
---|---|
Country/Territory | Spain |
City | Bilbao |
Period | 30/08/10 → 3/09/10 |