EPS2: Privacy Preserving Set-Valued Data Analysis in the Shuffle Model

Leixia Wang, Qingqing Ye, Haibo Hu, Xiaofeng Meng

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Collecting and analyzing users' set-valued data with privacy-preserving is a common scenario in real life. However, the existing solutions in LDP are not efficient enough, where users perturbing their data locally introduces a large amount of noise. The shuffle model, which adds a shuffler in LDP to shuffle all perturbed values, can amplify privacy, then improve utility. Inspired by this, we study the frequency estimation and top- frequent item estimation of set-valued data in the shuffle model. To solve the challenges of different item quantities of users and further improve the utility, we combine sampling and shuffling together, and propose the framework, i.e., EPS. Based on this framework, we propose three protocols for frequency estimation in different application scenarios, then assemble them into multi-phase protocols for the top- frequent item estimation. Theoretically, we identify all three protocols gain dual privacy amplification from sampling and shuffling. And by setting the size of users' set to 1, we can extend this amplified bound to the single-valued frequency estimation scenario, producing a tighter privacy bound than existing works. Finally, we perform experiments on both synthetic and real-world datasets to demonstrate the effectiveness of our protocols. IEEE

Original languageEnglish
Article number10352643
Pages (from-to)1-14
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
DOIs
Publication statusPublished - Dec 2023

Keywords

  • Analytical models
  • Differential privacy
  • Encoding
  • Estimation
  • Frequency estimation
  • frequency estimation
  • Perturbation methods
  • Privacy
  • Protocols
  • sampling
  • shuffling

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'EPS2: Privacy Preserving Set-Valued Data Analysis in the Shuffle Model'. Together they form a unique fingerprint.

Cite this