A cluster-based intelligence ensemble learning method for classification problems

Shaoze Cui, Yanzhang Wang, Yunqiang Yin, T. C.E. Cheng, Dujuan Wang, Mingyu Zhai

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Classification is a vital task in machine learning. By learning patterns of samples of known categories, the model can develop the ability to distinguish the categories of samples of unknown categories. Noticing the advantages of the clustering method in cluster structure analysis, we combine the clustering and classification methods to develop the novel cluster-based intelligence ensemble learning (CIEL) method. We use the clustering method to analyze the inherent distribution of the data and divide all the samples into clusters according to the characteristics of the dataset. Then, for each specific cluster, we use different classification algorithms to establish the corresponding classification model. Finally, we integrate the prediction results of each base classifier to form the final prediction result. In view of the problem of parameter sensitivity, we use a swarm intelligence algorithm to optimize the key parameters involved in the clustering, classification, and ensemble stages in order to boost the classification performance. To assess the effectiveness of CIEL, we perform tenfold cross-validation experiments on the 24 benchmark datasets provided by UCI and KEEL. Designed to improve the performance of the classifiers, CIEL outperforms other popular machine learning methods such as naive Bayes, k-nearest neighbors, random forest, and support vector machine.

Original languageEnglish
Pages (from-to)386-409
Number of pages24
JournalInformation Sciences
Volume560
DOIs
Publication statusPublished - Jun 2021

Keywords

  • Classification algorithm
  • Clustering algorithm
  • Combination strategy
  • Ensemble learning
  • Swarm intelligence algorithm

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Theoretical Computer Science
  • Computer Science Applications
  • Information Systems and Management
  • Artificial Intelligence

Cite this