A novel privacy-preserving probability transductive classifiers from group probabilities based on regression model

Yizhang Jiang, Zhaohong Deng, Kup Sze Choi, Pengjiang Qian, Wenjun Hu, Shitong Wang

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Group probability classifier learning is an emerging and promising learning technique, especially in privacy-preserving data mining. It is used to train a classifier from a group probability dataset, where the class labels of each sample are unknown while the probabilities of each class in the given data groups of the whole dataset are available. The existing work is mainly based on the inverse calibration (IC) strategy to obtain the estimated labels for data in the group probability dataset and then make use of classical classification algorithms such as support vector machine (SVM) model to train the desired classifier. A critical challenge of the exiting IC-based methods lies in the difficulty of designing an ideal IC function for label estimation and the methods are sensitive to the adopted IC function. In order to overcome this shortcoming, a novel probability transductive classifier that does not involve IC in the learning procedure is proposed, where the probability values are directly used as the output of the training data for the model training. Particularly, on the training data with the output being continuous real values, the existing classical regression model can be easily adopted to model the group probability classification problem. For a future testing data, the model output of the obtained group probability classification model can present the probability that the testing data belong to the positive class. With a given threshold, the final class label of the testing data can be obtained for the classification task. The experimental results on synthetic datasets and real UCI datasets show that the proposed method is more effective than the existing methods.
Original languageEnglish
Pages (from-to)917-925
Number of pages9
JournalJournal of Intelligent and Fuzzy Systems
Volume29
Issue number2
DOIs
Publication statusPublished - 1 Jan 2015

Keywords

  • classification
  • group probability
  • Privacy preserving
  • probability transductive
  • regression model

ASJC Scopus subject areas

  • Statistics and Probability
  • Engineering(all)
  • Artificial Intelligence

Cite this