Abstract
In this paper, an effective promoter detection algorithm, which is called PromoterExplorer, is proposed. In our approach, various features, i.e. local distribution of pentamers, positional CpG island features and digitized DNA sequence, are combined to build a high-dimensional input vector. A cascade AdaBoost based learning procedure is adopted to select the most "informative" or "discriminating" features to build a sequence of weak classifiers. A number of weak classifiers construct a strong classifier, which can achieve a better performance. In order to reduce the false positive, a cascade structure is used for detection. PromoterExplorer is tested based on large-scale DNA sequences from different databases, including EPD, Genbank and human chromosome 22. The proposed method consistently outperforms PromoterInspector and Dragon Promoter Finder.
Original language | English |
---|---|
Title of host publication | Proceedings of the 5th Asia-Pacific Bioinformatics Conference, APBC 2007 |
Pages | 37-46 |
Number of pages | 10 |
Volume | 5 |
Publication status | Published - 1 Dec 2007 |
Event | 5th Asia-Pacific Bioinformatics Conference, APBC 2007 - Hong Kong, Hong Kong Duration: 15 Jan 2007 → 17 Jan 2007 |
Conference
Conference | 5th Asia-Pacific Bioinformatics Conference, APBC 2007 |
---|---|
Country/Territory | Hong Kong |
City | Hong Kong |
Period | 15/01/07 → 17/01/07 |
ASJC Scopus subject areas
- Bioengineering
- Information Systems