TY - JOUR
T1 - Ensemble-learning based neural networks for novelty detection in multi-class systems
AU - Chan, Tung Sun
AU - Wang, Z.X.
AU - Patnaik, Sovit
AU - Tiwari, M.K.
AU - Wang, X.P.
AU - Ruan, J.H
PY - 2020/8
Y1 - 2020/8
N2 - In most real-world systems or processes, determining the complete set of classes during the training phase is generally impossible. There is a high chance that novelties or abnormal data can appear in future phases which might severely affect the performance of the machine learning system. Novelty detection is of great importance in many critical systems and domains, such as business intelligence, process monitoring, information security, clinical decision support etc. Most of the available methods for novelty detection use a one-class classification (OCC) criterion, i.e. treating multiple known classes as a single ”Normal” class, whose aim is to distinguish data samples between “Normal” and “Not Normal” classes. In this paper, the problem of novelty detection in multi-class systems is addressed through ensemble based learning of neural networks (EBNN), capable of both detecting novelties and classifying the known normal samples in future datasets. Moreover, the model is analogous to the semi-supervised learning system as it is trained using only the available normal classes. Evaluation of the proposed model (EBNN) on UCI machine learning datasets showed that the model not only outperforms other models in detecting novelties but also has a better multi-class classification accuracy for known normal classes. The proposed model implements a novel activation function in its framework and differs from the commonly available novelty detection models in three aspects. First, the model is much simpler to implement and does not need any initial assumptions about the model. Second, the model does not require any novel or abnormal data during training phase (semi-supervised learning). Third, it can be used as a two in one system to detect novelties and at the same time to classify data based on known classes.
AB - In most real-world systems or processes, determining the complete set of classes during the training phase is generally impossible. There is a high chance that novelties or abnormal data can appear in future phases which might severely affect the performance of the machine learning system. Novelty detection is of great importance in many critical systems and domains, such as business intelligence, process monitoring, information security, clinical decision support etc. Most of the available methods for novelty detection use a one-class classification (OCC) criterion, i.e. treating multiple known classes as a single ”Normal” class, whose aim is to distinguish data samples between “Normal” and “Not Normal” classes. In this paper, the problem of novelty detection in multi-class systems is addressed through ensemble based learning of neural networks (EBNN), capable of both detecting novelties and classifying the known normal samples in future datasets. Moreover, the model is analogous to the semi-supervised learning system as it is trained using only the available normal classes. Evaluation of the proposed model (EBNN) on UCI machine learning datasets showed that the model not only outperforms other models in detecting novelties but also has a better multi-class classification accuracy for known normal classes. The proposed model implements a novel activation function in its framework and differs from the commonly available novelty detection models in three aspects. First, the model is much simpler to implement and does not need any initial assumptions about the model. Second, the model does not require any novel or abnormal data during training phase (semi-supervised learning). Third, it can be used as a two in one system to detect novelties and at the same time to classify data based on known classes.
KW - Novelty detection
KW - Neural networks
KW - Ensemble-learning
KW - Posterior class probability
KW - Confidence intervals
UR - https://www.sciencedirect.com/science/article/pii/S1568494620303367
U2 - 10.1016/j.asoc.2020.106396
DO - 10.1016/j.asoc.2020.106396
M3 - Journal article
SN - 1568-4946
VL - 93
JO - Applied Soft Computing
JF - Applied Soft Computing
M1 - 106396
ER -