Skip to main navigation Skip to search Skip to main content

Local Differential Privacy for data collection and analysis

  • Beijing University of Posts and Telecommunications
  • Nanyang Technological University
  • Xi'an Jiaotong University

Research output: Contribution to journalArticlepeer-review

36 Scopus citations

Abstract

Local Differential Privacy (LDP) can provide each user with strong privacy guarantees under untrusted data curators while ensuring accurate statistics derived from privatized data. Due to its powerfulness, LDP has been widely adopted to protect privacy in various tasks (e.g., heavy hitters discovery, probability estimation) and systems (e.g., Google Chrome, Apple iOS). In particular, (∊,δ)-LDP has been studied in related statistical tasks like private learning and hypothesis testing, but is mainly achieved by using Gaussian mechanism, leading to the limited data utility. In this paper, we investigate several novel mechanisms that achieve (∊,δ)-LDP with higher data utility in collecting and analyzing users’ data. Specifically, we first design two (∊,δ)-LDP algorithms for mean estimations on multi-dimensional numeric data, which can ensure higher accuracy than the optimal Gaussian mechanism. Then, we investigate different local protocols for frequency estimations on categorical attributes under (∊,δ)-LDP. Based on the proposed mechanisms, we further study on (∊,δ)-LDP-compliant stochastic gradient descent algorithms for machine learning models. Besides, the theoretical analysis of the error bound and the variance of the proposed algorithms are also presented in the paper. We have conducted extensive experiments on both real-world and synthetic datasets and demonstrated the high data utility of our proposed algorithms in the perspectives of simple data statistics tasks and complex machine learning tasks. The experimental results have shown that our proposed algorithms can effectively improve the data utility in different tasks while alleviating the privacy concerns of each individual.

Original languageEnglish
Pages (from-to)114-133
Number of pages20
JournalNeurocomputing
Volume426
DOIs
StatePublished - 22 Feb 2021

Keywords

  • (∊,δ)-local differential privacy
  • Data collection and analysis
  • Data utility
  • Stochastic gradient descent

Fingerprint

Dive into the research topics of 'Local Differential Privacy for data collection and analysis'. Together they form a unique fingerprint.

Cite this