Abstract
Local differential privacy (LDP) is a promising privacy model for data collection that protects sensitive information of individuals. However, applying LDP to top-k estimation in set-valued data (e.g., identifying most frequent k items) may yield poor results for small and sparse datasets due to high sensitivity and heavy perturbation. To address this, we propose an adaptive approach that frames the problem as a multi-armed bandit (MAB) problem, in which the decision-maker selects actions based on information collected from previous rounds to maximize the total reward over time. Inspired by this, we present two adaptive sampling schemes based on MAB: ARBS for identifying top-k items and ARBSF for both top-k item discovery and frequency estimation on these items. Furthermore, to address the potential long delay of multi-round collection, we propose an optimization technique to reduce the time complexity. Both theoretical and empirical results show that our adaptive sampling schemes significantly outperform existing alternatives.
| Original language | English |
|---|---|
| Article number | 10700950 |
| Pages (from-to) | 1763-1780 |
| Journal | IEEE Transactions on Dependable and Secure Computing |
| DOIs | |
| Publication status | Published - Sept 2024 |
Keywords
- Local Differential Privacy
- Multi-armed Bandit
- Top-k Estimation
ASJC Scopus subject areas
- General Computer Science
- Electrical and Electronic Engineering