Abstract
The functions of proteins are closely related to their subcellular locations. In the post-proteomics era, the amount of gene and protein data grows exponentially, which necessitates the prediction of subcellular localization by computational means. This paper proposes mitigating the computation burden of alignment-based approaches to subcellular localization prediction by using the information provided by the N-terminal sorting signals. To this end, a cascaded fusion of cleavage site prediction and profile alignment is proposed. Specifically, the informative segments of protein sequences are identified by a cleavage site predictor. Then, only the informative segments are applied to a homology-based classifier for predicting the subcellular locations. Experimental results on a newly constructed dataset show that the method can make use of the best property of both approaches and can attain an accuracy higher than using the full-length sequences. Moreover, the method can reduce the computation time by 20 folds. We advocate that the method will be important for biologists to conduct large-scale protein annotation or for bioinformaticians to perform preliminary investigations on new algorithms that involve pairwise alignments.
Original language | English |
---|---|
Title of host publication | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 |
Pages | 147-154 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 20 Aug 2010 |
Event | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 - Montreal, QC, Canada Duration: 2 May 2010 → 5 May 2010 |
Conference
Conference | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 |
---|---|
Country/Territory | Canada |
City | Montreal, QC |
Period | 2/05/10 → 5/05/10 |
Keywords
- Cleavage sites prediction
- Profiles alignment
- Protein sequences
- Subcellular localization
- Support vector machines
ASJC Scopus subject areas
- Artificial Intelligence
- Computational Theory and Mathematics
- Biomedical Engineering