Abstract
In order to overcome the limitation on small sizes of gene datasets, many meta-classification methods which ensemble classifiers with different datasets have been developed. However, due to discrepancies of the characteristics within heterogeneous or cross-platform datasets, the number of common and significant genes is usually small. Instead of matching common genes between heterogeneous datasets, we propose a novel solution, alternative feature mapping approach (AFM), to utilize related and discriminative gene expressions while not necessarily having exact matches. Genes in the training dataset are clustered and mapped to the test dataset as gene groups. Through analyzing the correlation within gene groups between training and test datasets, related significant genes can be applied for classification. We conducted experiments consisting of 8 heterogeneous datasets with different cancer types and platforms to test the effectiveness of AFM. Our experiments show that classification performance is greatly improved using suitable significant genes selected by AFM.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009 |
Pages | 138-145 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 18 Nov 2009 |
Event | 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009 - Taichung, Taiwan Duration: 22 Jun 2009 → 24 Jun 2009 |
Conference
Conference | 2009 9th IEEE International Conference on Bioinformatics and BioEngineering, BIBE 2009 |
---|---|
Country/Territory | Taiwan |
City | Taichung |
Period | 22/06/09 → 24/06/09 |
Keywords
- Classification
- Feature
- Heterogeneous
- Mapping
- Microarray
ASJC Scopus subject areas
- Information Systems
- Biomedical Engineering
- Health Informatics