TY - GEN
T1 - Multi-Class Item Mining Under Local Differential Privacy
AU - Mao, Yulian
AU - Ye, Qingqing
AU - Du, Rong
AU - Wang, Qi
AU - Huang, Kai
AU - Hu, Haibo
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/8
Y1 - 2025/8
N2 - Item mining, a fundamental task for collecting statistical data from users, has raised increasing privacy concerns. To address these concerns, local differential privacy (LDP) was proposed as a privacy-preserving technique. Existing LDP item mining mechanisms primarily concentrate on global statistics, i.e., those from the entire dataset. Nevertheless, they fall short of usertailored tasks such as personalized recommendations, whereas classwise statistics can improve task accuracy with fine-grained information. Meanwhile, the introduction of class labels brings new challenges. Label perturbation may result in invalid items for aggregation. To this end, we propose frameworks for multi-class item mining, along with two mechanisms: validity perturbation to reduce the impact of invalid data, and correlated perturbation to preserve the relationship between labels and items. We also apply these optimized methods to two multi-class item mining queries: frequency estimation and top-k item mining. Through theoretical analysis and extensive experiments, we verify the effectiveness and superiority of these methods.
AB - Item mining, a fundamental task for collecting statistical data from users, has raised increasing privacy concerns. To address these concerns, local differential privacy (LDP) was proposed as a privacy-preserving technique. Existing LDP item mining mechanisms primarily concentrate on global statistics, i.e., those from the entire dataset. Nevertheless, they fall short of usertailored tasks such as personalized recommendations, whereas classwise statistics can improve task accuracy with fine-grained information. Meanwhile, the introduction of class labels brings new challenges. Label perturbation may result in invalid items for aggregation. To this end, we propose frameworks for multi-class item mining, along with two mechanisms: validity perturbation to reduce the impact of invalid data, and correlated perturbation to preserve the relationship between labels and items. We also apply these optimized methods to two multi-class item mining queries: frequency estimation and top-k item mining. Through theoretical analysis and extensive experiments, we verify the effectiveness and superiority of these methods.
KW - Frequency estimation
KW - Local differential privacy
KW - Multi-class item mining
KW - Top-k item mining
UR - https://www.scopus.com/pages/publications/105015432629
U2 - 10.1109/ICDE65448.2025.00265
DO - 10.1109/ICDE65448.2025.00265
M3 - Conference article published in proceeding or book
AN - SCOPUS:105015432629
T3 - Proceedings - International Conference on Data Engineering
SP - 3549
EP - 3561
BT - Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025
PB - IEEE Computer Society
T2 - 41st IEEE International Conference on Data Engineering, ICDE 2025
Y2 - 19 May 2025 through 23 May 2025
ER -