Abstract
The functions of proteins are closely related to their subcellular locations. In the post-proteomics era, the amount of gene and protein data grows exponentially, which necessitates the prediction of subcellular localization by computational means. This paper proposes mitigating the computation burden of alignment-based approaches to subcellular localization prediction by using the information provided by the N-terminal sorting signals. To this end, a cascaded fusion of cleavage site prediction and profile alignment is proposed. Specifically, the informative segments of protein sequences are identified by a cleavage site predictor. Then, only the informative segments are applied to a homology-based classifier for predicting the subcellular locations. Experimental results on a newly constructed dataset show that the method can make use of the best property of both approaches and can attain an accuracy higher than using the full-length sequences. Moreover, the method can reduce the computation time by 20 folds. We advocate that the method will be important for biologists to conduct large-scale protein annotation or for bioinformaticians to perform preliminary investigations on new algorithms that involve pairwise alignments.
| Original language | English |
|---|---|
| Title of host publication | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 |
| Pages | 147-154 |
| Number of pages | 8 |
| DOIs | |
| Publication status | Published - 20 Aug 2010 |
| Event | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 - Montreal, QC, Canada Duration: 2 May 2010 → 5 May 2010 |
Conference
| Conference | 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2010 |
|---|---|
| Country/Territory | Canada |
| City | Montreal, QC |
| Period | 2/05/10 → 5/05/10 |
Keywords
- Cleavage sites prediction
- Profiles alignment
- Protein sequences
- Subcellular localization
- Support vector machines
ASJC Scopus subject areas
- Artificial Intelligence
- Computational Theory and Mathematics
- Biomedical Engineering