Penalized feature selection and classification in bioinformatics

Shuangge Ma, Jian Huang

Research output: Journal article publicationReview articleAcademic researchpeer-review

152 Citations (Scopus)

Abstract

In bioinformatics studies, supervised classification with high-dimensional input variables is frequently encountered. Examples routinely arise in genomic, epigenetic and proteomic studies. Feature selection can be employed along with classifier construction to avoid over-fitting, to generate more reliable classifier and to provide more insights into the underlying causal relationships. In this article, we provide a review of several recently developed penalized feature selection and classification techniques - which belong to the family of embedded feature selection methods - for bioinformatics studies with high-dimensional input. Classification objective functions, penalty functions and computational algorithms are discussed. Our goal is to make interested researchers aware of these feature selection and classification methods that are applicable to high-dimensional bioinformatics data. Published by Oxford University Press.
Original languageEnglish
Pages (from-to)392-403
Number of pages12
JournalBriefings in Bioinformatics
Volume9
Issue number5
DOIs
Publication statusPublished - 27 Aug 2008
Externally publishedYes

Keywords

  • Bioinformatics application
  • Feature selection
  • Penalization

ASJC Scopus subject areas

  • Information Systems
  • Molecular Biology

Cite this