Unsupervised discovery of fuzzy patterns in gene expression data

Gene P.K. Wu, Chun Chung Chan, Andrew K.C. Wong, Bin Wu

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Discovering patterns from gene expression levels is regarded as a classification problem when tissue classes of the samples are given and solved as a discrete-data problem by discretizing the expression levels of each gene into intervals maximizing the interdependence between that gene and the class labels. However, when class information is unavailable, discovering gene expression patterns becomes difficult. This paper attempts to tackle this important problem. For a gene pool with large number of genes, we first cluster the genes into smaller groups. In each group, we use the representative gene, one with highest interdependence with others in the group, to drive the discretization of the gene expression levels of other genes. Treating intervals as discrete events, association patterns can be discovered. If the gene groups obtained are crisp clusters, significant patterns overlapping different clusters cannot be found. This paper presents a new method of "fuzzifying" the crisp attribute clusters for that purpose. To evaluate the effectiveness of our approach, we first apply the above described procedure on a synthetic dataset and then a gene expression dataset with known class labels. The class labels are not being used in both analyses but used later as the ground truth in a classificatory problem for assessing the algorithm's effectiveness in fuzzy gene clustering and discretization. The results show the efficacy of the proposed method.
Original languageEnglish
Title of host publicationProceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010
Pages269-273
Number of pages5
DOIs
Publication statusPublished - 1 Dec 2010
Event2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010 - Hong Kong, Hong Kong
Duration: 18 Dec 201021 Dec 2010

Conference

Conference2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010
CountryHong Kong
CityHong Kong
Period18/12/1021/12/10

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics

Cite this