Utilizing Both Topological and Attribute Information for Protein Complex Identification in PPI Networks

Allen L. Hu, Chun Chung Chan

Research output: Journal article publicationJournal articleAcademic researchpeer-review

27 Citations (Scopus)

Abstract

� 2013 IEEE. Many computational approaches developed to identify protein complexes in protein-protein interaction (PPI) networks perform their tasks based only on network topologies. The attributes of the proteins in the networks are usually ignored. As protein attributes within a complex may also be related to each other, we have developed a PCIA algorithm to take into consideration both such information and network topology in the identification process of protein complexes. Given a PPI network, PCIA first finds information about the attributes of the proteins in a PPI network in the Gene Ontology databases and uses such information for the identification of protein complexes. PCIA then computes a Degree of Association measure for each pair of interacting proteins to quantitatively determine how much their attribute values associate with each other. Based on this association measure, PCIA is able to discover dense graph clusters consisting of proteins whose attribute values are significantly closer associated with each other. PCIA has been tested with real data and experimental results seem to indicate that attributes of the proteins in the same complex do have some association with each other and, therefore, that protein complexes can be more accurately identified when protein attributes are taken into consideration.
Original languageEnglish
Article number6513225
Pages (from-to)780-792
Number of pages13
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume10
Issue number3
DOIs
Publication statusPublished - 1 May 2013

Keywords

  • gene ontology
  • graph clustering
  • Markov clustering
  • PPI networks
  • protein complex

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Cite this