Transparent data deduplication in the cloud

Frederik Armknecht, Jens Matthias Bohli, Ghassan O. Karame, Franck Youssef

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Cloud storage providers such as Dropbox and Google drive heavily rely on data deduplication to save storage costs by only storing one copy of each uploaded file. Although recent studies report that whole file deduplication can achieve up to 50% storage reduction, users do not directly benefit from these savings-as there is no transparent relation between effective storage costs and the prices offered to the users. In this paper, we propose a novel storage solution, ClearBox, which allows a storage service provider to transparently attest to its customers the deduplication patterns of the (encrypted) data that it is storing. By doing so, ClearBox enables cloud users to verify the effective storage space that their data is occupying in the cloud, and consequently to check whether they qualify for benefits such as price reductions, etc. ClearBox is secure against malicious users and a rational storage provider, and ensures that files can only be accessed by their legitimate flowners. We evaluate a prototype implementation of ClearBox using both Amazon S3 and Dropbox as back-end cloud storage. Our findings show that our solution works with the APIs provided by existing service providers without any modifications and achieves comparable performance to existing solutions.

    Original languageEnglish
    Title of host publicationCCS 2015 - Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security
    PublisherAssociation for Computing Machinery
    Pages886-900
    Number of pages15
    Volume2015-October
    ISBN (Electronic)9781450338325
    DOIs
    Publication statusPublished - 12 Oct 2015
    Event22nd ACM SIGSAC Conference on Computer and Communications Security, CCS 2015 - Denver, United States
    Duration: 12 Oct 201516 Oct 2015

    Other

    Other22nd ACM SIGSAC Conference on Computer and Communications Security, CCS 2015
    Country/TerritoryUnited States
    CityDenver
    Period12/10/1516/10/15

    Keywords

    • Cloud security
    • Secure data deduplication
    • Transparent attestation of deduplication

    Fingerprint

    Dive into the research topics of 'Transparent data deduplication in the cloud'. Together they form a unique fingerprint.

    Cite this