Tags

Type your tag names separated by a space and hit enter

Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous.
Nature. 2004 Oct 21; 431(7011):980-4.Nat

Abstract

All inferences in comparative biology depend on accurate estimates of evolutionary relationships. Recent phylogenetic analyses have turned away from maximum parsimony towards the probabilistic techniques of maximum likelihood and bayesian Markov chain Monte Carlo (BMCMC). These probabilistic techniques represent a parametric approach to statistical phylogenetics, because their criterion for evaluating a topology--the probability of the data, given the tree--is calculated with reference to an explicit evolutionary model from which the data are assumed to be identically distributed. Maximum parsimony can be considered nonparametric, because trees are evaluated on the basis of a general metric--the minimum number of character state changes required to generate the data on a given tree--without assuming a specific distribution. The shift to parametric methods was spurred, in large part, by studies showing that although both approaches perform well most of the time, maximum parsimony is strongly biased towards recovering an incorrect tree under certain combinations of branch lengths, whereas maximum likelihood is not. All these evaluations simulated sequences by a largely homogeneous evolutionary process in which data are identically distributed. There is ample evidence, however, that real-world gene sequences evolve heterogeneously and are not identically distributed. Here we show that maximum likelihood and BMCMC can become strongly biased and statistically inconsistent when the rates at which sequence sites evolve change non-identically over time. Maximum parsimony performs substantially better than current parametric methods over a wide range of conditions tested, including moderate heterogeneity and phylogenetic problems not normally considered difficult.

Authors+Show Affiliations

Department of Computer and Information Science, University of Oregon, Eugene, Oregon 97403, USA.No affiliation info available

Pub Type(s)

Journal Article
Research Support, U.S. Gov't, Non-P.H.S.

Language

eng

PubMed ID

15496922

Citation

Kolaczkowski, Bryan, and Joseph W. Thornton. "Performance of Maximum Parsimony and Likelihood Phylogenetics when Evolution Is Heterogeneous." Nature, vol. 431, no. 7011, 2004, pp. 980-4.
Kolaczkowski B, Thornton JW. Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous. Nature. 2004;431(7011):980-4.
Kolaczkowski, B., & Thornton, J. W. (2004). Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous. Nature, 431(7011), 980-4.
Kolaczkowski B, Thornton JW. Performance of Maximum Parsimony and Likelihood Phylogenetics when Evolution Is Heterogeneous. Nature. 2004 Oct 21;431(7011):980-4. PubMed PMID: 15496922.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous. AU - Kolaczkowski,Bryan, AU - Thornton,Joseph W, PY - 2004/05/17/received PY - 2004/08/06/accepted PY - 2004/10/22/pubmed PY - 2004/12/16/medline PY - 2004/10/22/entrez SP - 980 EP - 4 JF - Nature JO - Nature VL - 431 IS - 7011 N2 - All inferences in comparative biology depend on accurate estimates of evolutionary relationships. Recent phylogenetic analyses have turned away from maximum parsimony towards the probabilistic techniques of maximum likelihood and bayesian Markov chain Monte Carlo (BMCMC). These probabilistic techniques represent a parametric approach to statistical phylogenetics, because their criterion for evaluating a topology--the probability of the data, given the tree--is calculated with reference to an explicit evolutionary model from which the data are assumed to be identically distributed. Maximum parsimony can be considered nonparametric, because trees are evaluated on the basis of a general metric--the minimum number of character state changes required to generate the data on a given tree--without assuming a specific distribution. The shift to parametric methods was spurred, in large part, by studies showing that although both approaches perform well most of the time, maximum parsimony is strongly biased towards recovering an incorrect tree under certain combinations of branch lengths, whereas maximum likelihood is not. All these evaluations simulated sequences by a largely homogeneous evolutionary process in which data are identically distributed. There is ample evidence, however, that real-world gene sequences evolve heterogeneously and are not identically distributed. Here we show that maximum likelihood and BMCMC can become strongly biased and statistically inconsistent when the rates at which sequence sites evolve change non-identically over time. Maximum parsimony performs substantially better than current parametric methods over a wide range of conditions tested, including moderate heterogeneity and phylogenetic problems not normally considered difficult. SN - 1476-4687 UR - https://www.unboundmedicine.com/medline/citation/15496922/Performance_of_maximum_parsimony_and_likelihood_phylogenetics_when_evolution_is_heterogeneous_ DB - PRIME DP - Unbound Medicine ER -