Multi-metric and multi-substructure biclustering analysis for gene expression data

S. Y. Kung, Man Wai Mak, Ilias Tagkopoulos

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

11 Citations (Scopus)


A good number of biclustering algorithms have been proposed for grouping gene expression data. Many of them have adopted matrix norms to define the similarity score of a bicluster. We shall show that almost all matrix metrics can be converted into vector norms while preserving the rank equivalence. Vector norms provide a much more efficient vehicle for biclustering analysis and computation. The advantages are two folds: ease of analysis and saving of computation. Most existing biclustering algorithms have also implicitly assumed the use of univariate (i.e., single metric) evaluation for identifying biclusters. Such an approach however overlooks the fundamental principle that genes (even though they may belong to the same gene group) (1) may be subdivided into different substructures; and (2) they may be co-expressed via a diversity of coherence models (a gene may participate in multiple pathways that may or may not be co-active under all conditions). The former leads to the adoption of a multi-substurcture analysis, while the latter to the multivariate analysis. This paper will show that the proposed multivariate and multi-subscluster analysis is very effective in identifying and classifying biologically relevant groups in genes and conditions. For example, it has successfully yielded highly discriminant and accurate classification based on known ribosomal gene groups.
Original languageEnglish
Title of host publicationProceedings - 2005 IEEE Computational SystemsBioinformatics Conference, CSB 2005
Number of pages12
Publication statusPublished - 1 Dec 2005
Event2005 IEEE Computational Systems Bioinformatics Conference, CSB 2005 - Stanford, CA, United States
Duration: 8 Aug 200511 Aug 2005


Conference2005 IEEE Computational Systems Bioinformatics Conference, CSB 2005
Country/TerritoryUnited States
CityStanford, CA

ASJC Scopus subject areas

  • Engineering(all)
  • Medicine(all)


Dive into the research topics of 'Multi-metric and multi-substructure biclustering analysis for gene expression data'. Together they form a unique fingerprint.

Cite this