Abstract
As one of the most popular clustering techniques for high dimensional data, soft subspace clustering (SSC) algorithms have been receiving a great deal of attention in recent years. Unfortunately, most existing works do not cluster high dimensional sparse data and noisy data in an effective manner. In this study, a novel soft subspace clustering algorithm called PI-SSC is proposed. By introducing a partition index (PI) into the objective function, a novel soft subspace clustering algorithm that combines the concepts of hard and fuzzy clustering is proposed. Furthermore, the robust property of PI-SSC is analyzed from the viewpoint of ε-insensitive distance. A convergence theorem for PI-SSC is also established by applying Zangwill's convergence theorem. The results of the experiment demonstrate the effectiveness of the proposed algorithm in high dimensional sparse text data and noisy texture data.
Original language | English |
---|---|
Pages (from-to) | 133-154 |
Number of pages | 22 |
Journal | Information Sciences |
Volume | 246 |
DOIs | |
Publication status | Published - 10 Oct 2013 |
Keywords
- Convergence
- Fuzzy clustering
- High dimensional data
- Soft subspace clustering
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Theoretical Computer Science
- Computer Science Applications
- Information Systems and Management
- Artificial Intelligence