Missing value imputation for gene expression data: Computational techniques to recover missing data from available information

AlanWee Chung Liew, Ngai Fong Law, Hong Yan

Research output: Journal article publicationJournal articleAcademic researchpeer-review

135 Citations (Scopus)


Microarray gene expression data generally suffers from missing value problem due to a variety of experimental reasons. Since the missing data points can adversely affect downstream analysis, many algorithms have been proposed to impute missing values. In this survey, we provide a comprehensive review of existing missing value imputation algorithms, focusing on their underlying algorithmic techniques and how they utilize local or global information from within the data, or their use of domain knowledge during imputation. In addition, we describe how the imputation results can be validated and the different ways to assess the performance of different imputation algorithms, as well as a discussion on some possible future research directions. It is hoped that this review will give the readers a good understanding of the current development in this field and inspire them to come up with the next generation of imputation algorithms. Published by Oxford University Press.
Original languageEnglish
Article numberbbq080
Pages (from-to)498-513
Number of pages16
JournalBriefings in Bioinformatics
Issue number5
Publication statusPublished - 1 Sep 2011


  • Gene expression analysis
  • Gene expression data
  • Information recovery
  • Missing value imputation

ASJC Scopus subject areas

  • Molecular Biology
  • Information Systems

Cite this