Clustering gene expression data with hybrid GA approach

Patrick C.H. Ma, Chun Chung Chan

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

The combined interpretation of gene expression data and gene sequences offers a valuable approach to investigate the intricate relationships involving gene transcriptional regulation. The highly interactive gene expression data produced by microarray hybridization experiments allow us to find coexpressed genes. By analyzing the upstream regions of the identified coexpressed genes, we can discover the regulatory patterns characterized by transcription factor binding sites, which govern the process of transcriptional regulation. In the following, we present a generic clustering algorithm that uses a Hybrid GA approach to discover clusters in gene expression data. The advantage of this method is that large search space can be effectively explored by utilizing the evolutionary algorithm techniques. Moreover, it is able to discover underlying patterns in noisy gene expression data for meaningful data groupings, and statistically significant patterns hidden in each cluster can also be extracted at the same time. Since, the proposed method can handle both continuous- and discrete-valued data, it can be used with other microarray data and biomedical data. The experimental results obtained from real expression data reveal meaningful groupings and uncover many known transcription factor binding sites.
Original languageEnglish
Title of host publicationProceedings of the IASTED International Conference on Artificial Intelligence and Soft Computing
Pages223-228
Number of pages6
Volume7
Publication statusPublished - 1 Dec 2003
EventProceedings of the Seventh IASTED International Conference on Artificial Intelligence and Soft Computing - Banff, Canada
Duration: 14 Jul 200316 Jul 2003

Conference

ConferenceProceedings of the Seventh IASTED International Conference on Artificial Intelligence and Soft Computing
Country/TerritoryCanada
CityBanff
Period14/07/0316/07/03

Keywords

  • Cluster analysis
  • Data mining
  • Gene expression data analysis
  • Genetic algorithms

ASJC Scopus subject areas

  • Development
  • Artificial Intelligence

Cite this