PriPL-Tree: Accurate Range Query for Arbitrary Distribution under Local Differential Privacy

Leixia Wang, Qingqing Ye, Haibo Hu, Xiaofeng Meng

Research output: Journal article publicationConference articleAcademic researchpeer-review

Abstract

Answering range queries in the context of Local Differential Privacy (LDP) is a widely studied problem in Online Analytical Processing (OLAP). Existing LDP solutions all assume a uniform data distribution within each domain partition, which may not align with real-world scenarios where data distribution is varied, resulting in inaccurate estimates. To address this problem, we introduce PriPL-Tree, a novel data structure that combines hierarchical tree structures with piecewise linear (PL) functions to answer range queries for arbitrary distributions. PriPL-Tree precisely models the underlying data distribution with a few line segments, leading to more accurate results for range queries. Furthermore, we extend it to multi-dimensional cases with novel data-aware adaptive grids. These grids leverage the insights from marginal distributions obtained through PriPL-Trees to partition the grids adaptively, adapting the density of underlying distributions. Our extensive experiments on both real and synthetic datasets demonstrate the effective ness and superiority of PriPL-Tree over state-of-the-art solutions in answering range queries across arbitrary data distributions.

Original languageEnglish
Pages (from-to)3031-3044
Number of pages14
JournalProceedings of the VLDB Endowment
Volume17
Issue number11
DOIs
Publication statusPublished - Aug 2024
Event50th International Conference on Very Large Data Bases, VLDB 2024 - Guangzhou, China
Duration: 25 Aug 202429 Aug 2024

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • General Computer Science

Fingerprint

Dive into the research topics of 'PriPL-Tree: Accurate Range Query for Arbitrary Distribution under Local Differential Privacy'. Together they form a unique fingerprint.

Cite this