Copula-Based Multi-Dimensional Crowdsourced Data Synthesis and Release with Local Privacy

Research output: Contribution to journalConference articlepeer-review

8 Scopus citations

Abstract

Various paradigms, based on differential privacy, have been proposed to release a privacy-preserving dataset with statistical approximation. Nonetheless, most existing schemes are limited when facing highly correlated attributes, and cannot prevent privacy threats from untrusted servers. In this paper, we propose a novel Copula- based scheme to efficiently synthesize and release multi-dimensional crowdsourced data with local differential privacy. In our scheme, each participant's (or user's) data is locally transformed into bit strings based on a randomized response technique, which guarantees a participant's privacy on the participant (user) side. Then, Copula theory is leveraged to synthesize multi-dimensional crowdsourced data based on univariate marginal distribution and attribute dependence. Univariate marginal distribution is estimated by the Lasso-based regression algorithm from the aggregated privacy- preserving bit strings. Dependencies among attributes are modeled as multivariate Gaussian Copula, of which parameter is estimated by Pearson correlation coefficients. We conduct experiments to validate the effectiveness of our scheme. Our experimental results demonstrate that our scheme is effective for the release of multi-dimensional data with local differential privacy guaranteed to distributed participants.

Original languageEnglish
Pages (from-to)1-6
Number of pages6
JournalProceedings - IEEE Global Communications Conference, GLOBECOM
Volume2018-January
DOIs
StatePublished - 2017
Event2017 IEEE Global Communications Conference, GLOBECOM 2017 - Singapore, Singapore
Duration: 4 Dec 20178 Dec 2017

Fingerprint

Dive into the research topics of 'Copula-Based Multi-Dimensional Crowdsourced Data Synthesis and Release with Local Privacy'. Together they form a unique fingerprint.

Cite this