Abstract
Microarray gene expression data generally suffers from missing value problem due to a variety of experimental reasons. Since the missing data points can adversely affect downstream analysis, many algorithms have been proposed to impute missing values. In this survey, we provide a comprehensive review of existing missing value imputation algorithms, focusing on their underlying algorithmic techniques and how they utilize local or global information from within the data, or their use of domain knowledge during imputation. In addition, we describe how the imputation results can be validated and the different ways to assess the performance of different imputation algorithms, as well as a discussion on some possible future research directions. It is hoped that this review will give the readers a good understanding of the current development in this field and inspire them to come up with the next generation of imputation algorithms. Published by Oxford University Press.
Original language | English |
---|---|
Article number | bbq080 |
Pages (from-to) | 498-513 |
Number of pages | 16 |
Journal | Briefings in Bioinformatics |
Volume | 12 |
Issue number | 5 |
DOIs | |
Publication status | Published - 1 Sept 2011 |
Keywords
- Gene expression analysis
- Gene expression data
- Information recovery
- Missing value imputation
ASJC Scopus subject areas
- Molecular Biology
- Information Systems