Abstract
Conventional κ-nearest neighbor (KNN) classification approaches have several limitations when dealing with some problems caused by the special datasets, such as the sparse problem, the imbalance problem, and the noise problem. In this paper, we first perform a brief survey on the recent progress of the KNN classification approaches. Then, the hybrid KNN (HBKNN) classification approach, which takes into account the local and global information of the query sample, is designed to address the problems raised from the special datasets. In the following, the random subspace ensemble framework based on HBKNN (RS-HBKNN) classifier is proposed to perform classification on the datasets with noisy attributes in the high-dimensional space. Finally, the nonparametric tests are proposed to be adopted to compare the proposed method with other classification approaches over multiple datasets. The experiments on the real-world datasets from the Knowledge Extraction based on Evolutionary Learning dataset repository demonstrate that RS-HBKNN works well on real datasets, and outperforms most of the state-of-The-Art classification approaches.
Original language | English |
---|---|
Article number | 7137658 |
Pages (from-to) | 1263-1275 |
Number of pages | 13 |
Journal | IEEE Transactions on Cybernetics |
Volume | 46 |
Issue number | 6 |
DOIs | |
Publication status | Published - 1 Jun 2016 |
Keywords
- Classification
- ensemble learning
- machine learning
- nearest neighbor classifier
- supervised learning
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Information Systems
- Human-Computer Interaction
- Computer Science Applications
- Electrical and Electronic Engineering