Top-κ Item identification on dynamic and distributed datasets

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

The problem of identifying the most frequent items across multiple datasets has received considerable attention over the last few years. When storage is a scarce resource, the topic is already a challenge; yet, its complexity may be further exacerbated not only by the many independent data sources, but also by the dynamism of the data, i.e., the fact that new items may appear and old ones disappear at any time. In this work, we provide a novel approach to the problem by using an existing gossip-based algorithm for identifying the k most frequent items over a distributed collection of datasets, in ways that deal with the dynamic nature of the data. The algorithm has been thoroughly analyzed through trace-based simulations and compared to state-of-the-art decentralized solutions, showing better precision at reduced communication overhead.

Original languageEnglish
Title of host publicationEuro-Par 2014
Subtitle of host publicationParallel Processing - 20th International Conference, Proceedings
EditorsFernando Silva, Inês Dutra, Vítor Santos Costa
PublisherSpringer
Pages270-281
Number of pages12
ISBN (Electronic)9783319098722
ISBN (Print)9783319098722
DOIs
Publication statusPublished - 1 Jan 2014
Event20th International Conference on Parallel Processing, Euro-Par 2014 - Porto, Portugal
Duration: 25 Aug 201429 Aug 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8632 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Parallel Processing, Euro-Par 2014
Country/TerritoryPortugal
CityPorto
Period25/08/1429/08/14

Fingerprint

Dive into the research topics of 'Top-κ Item identification on dynamic and distributed datasets'. Together they form a unique fingerprint.

Cite this