Abstract
A multi-labeled phylogenetic tree, or MUL-tree, is a generalization of a phylogenetic tree that allows each leaf label to be used many times. MUL-trees have applications in biogeography, the study of host-parasite cospeciation, gene evolution studies, and computer science. Here, we consider the problem of inferring a consensus MUL-tree that summarizes a given set of conflicting MUL-trees, and present the first polynomial-time algorithms for solving it. In particular, we give a straightforward, fast algorithm for building a strict consensus MUL-tree for any input set of MUL-trees with identical leaf label multisets, as well as a polynomial-time algorithm for building a majority rule consensus MUL-tree for the special case where every leaf label occurs at most twice. We also show that, although it is NP-hard to find a majority rule consensus MUL-tree in general, the variant that we call the singular majority rule consensus MUL-tree can be constructed efficiently whenever it exists.
| Original language | English |
|---|---|
| Pages (from-to) | 1073-1088 |
| Number of pages | 16 |
| Journal | Journal of Computational Biology |
| Volume | 19 |
| Issue number | 9 |
| DOIs | |
| Publication status | Published - 1 Sept 2012 |
| Externally published | Yes |
Keywords
- algorithm
- cluster
- computational complexity
- consensus tree
- MUL-tree.
- multi-labeled phylogenetic tree multiset
ASJC Scopus subject areas
- Molecular Biology
- Genetics
- Computational Mathematics
- Modelling and Simulation
- Computational Theory and Mathematics