EQUALITY: Quality-aware intensive analytics on the edge

Anna Valentini Michailidou, Anastasios Gounaris, Moysis Symeonides, Demetris Trihinas

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Our work is motivated by the fact that there is an increasing need to perform complex analytics jobs over streaming data as close to the edge devices as possible and, in parallel, it is important that data quality is considered as an optimization objective along with performance metrics. In this work, we develop a solution that trades latency for an increased fraction of incoming data, for which data quality-related measurements and operations are performed, in jobs running over geo-distributed heterogeneous and constrained resources. Our solution is hybrid: on the one hand, we perform search heuristics over locally optimal partial solutions to yield an enhanced global solution regarding task allocations; on the other hand, we employ a spring relaxation algorithm to avoid unnecessarily increased degree of partitioned parallelism. Through thorough experiments, we show that we can improve upon state-of-the-art solutions in terms of our objective function that combines latency and extent of quality checks by up to 2.56X. Moreover, we implement our solution within Apache Storm, and we perform experiments in an emulated setting. The results show that we can reduce the latency in 86.9% of the cases examined, while latency is up to 8 times lower compared to the built-in Storm scheduler, with the average latency reduction being 52.5%.

    Original languageEnglish
    Article number101953
    JournalInformation Systems
    Volume105
    DOIs
    Publication statusPublished - Mar 2022

    Keywords

    • Data quality
    • Fog computing
    • Optimization
    • Sensors

    Fingerprint

    Dive into the research topics of 'EQUALITY: Quality-aware intensive analytics on the edge'. Together they form a unique fingerprint.

    Cite this