Abstract
Many cellular processes exhibit periodic behaviors. Hence, one of the important tasks in gene expression data analysis is to detect subset of genes that exhibit cyclicity or periodicity in their gene expression time series profiles. Unfortunately, gene expression time series profiles are usually of very short length, with very few periods, irregularly sampled and are highly contaminated with noise. This makes the detection of periodic profiles a very challenging problem. Recently, a hypothesis testing method based on the Fisher g-statistic with correction for multiple testing has been proposed to detect periodic gene expression profiles. However, it was observed that the test is not reliable if the signal length is too short. In this paper, we performed extensive simulation study to investigate the statistical power of the test as a function of noise distribution, signal length, SNR, and the false discovery rate (FDR). We have found that the number of periodic profiles can be severely underestimated for short length signal. The findings indicate that caution needs to be exercised when interpreting the test result for very short length signals.
Original language | English |
---|---|
Pages (from-to) | 549-556 |
Number of pages | 8 |
Journal | Pattern Recognition |
Volume | 42 |
Issue number | 4 |
DOIs | |
Publication status | Published - 1 Apr 2009 |
Keywords
- Fisher exact test
- g-Statistic
- Gene expression profiles
- Periodicity detection
- Short signal
ASJC Scopus subject areas
- Software
- Signal Processing
- Computer Vision and Pattern Recognition
- Artificial Intelligence