Tags

Type your tag names separated by a space and hit enter

Effects of branch length uncertainty on Bayesian posterior probabilities for phylogenetic hypotheses.
Mol Biol Evol. 2007 Sep; 24(9):2108-18.MB

Abstract

In Bayesian phylogenetics, confidence in evolutionary relationships is expressed as posterior probability--the probability that a tree or clade is true given the data, evolutionary model, and prior assumptions about model parameters. Model parameters, such as branch lengths, are never known in advance; Bayesian methods incorporate this uncertainty by integrating over a range of plausible values given an assumed prior probability distribution for each parameter. Little is known about the effects of integrating over branch length uncertainty on posterior probabilities when different priors are assumed. Here, we show that integrating over uncertainty using a wide range of typical prior assumptions strongly affects posterior probabilities, causing them to deviate from those that would be inferred if branch lengths were known in advance; only when there is no uncertainty to integrate over does the average posterior probability of a group of trees accurately predict the proportion of correct trees in the group. The pattern of branch lengths on the true tree determines whether integrating over uncertainty pushes posterior probabilities upward or downward. The magnitude of the effect depends on the specific prior distributions used and the length of the sequences analyzed. Under realistic conditions, however, even extraordinarily long sequences are not enough to prevent frequent inference of incorrect clades with strong support. We found that across a range of conditions, diffuse priors--either flat or exponential distributions with moderate to large means--provide more reliable inferences than small-mean exponential priors. An empirical Bayes approach that fixes branch lengths at their maximum likelihood estimates yields posterior probabilities that more closely match those that would be inferred if the true branch lengths were known in advance and reduces the rate of strongly supported false inferences compared with fully Bayesian integration.

Authors+Show Affiliations

Center for Ecology and Evolutionary Biology, University of Oregon, USA.No affiliation info available

Pub Type(s)

Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

Language

eng

PubMed ID

17636043

Citation

Kolaczkowski, Bryan, and Joseph W. Thornton. "Effects of Branch Length Uncertainty On Bayesian Posterior Probabilities for Phylogenetic Hypotheses." Molecular Biology and Evolution, vol. 24, no. 9, 2007, pp. 2108-18.
Kolaczkowski B, Thornton JW. Effects of branch length uncertainty on Bayesian posterior probabilities for phylogenetic hypotheses. Mol Biol Evol. 2007;24(9):2108-18.
Kolaczkowski, B., & Thornton, J. W. (2007). Effects of branch length uncertainty on Bayesian posterior probabilities for phylogenetic hypotheses. Molecular Biology and Evolution, 24(9), 2108-18.
Kolaczkowski B, Thornton JW. Effects of Branch Length Uncertainty On Bayesian Posterior Probabilities for Phylogenetic Hypotheses. Mol Biol Evol. 2007;24(9):2108-18. PubMed PMID: 17636043.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - Effects of branch length uncertainty on Bayesian posterior probabilities for phylogenetic hypotheses. AU - Kolaczkowski,Bryan, AU - Thornton,Joseph W, Y1 - 2007/07/17/ PY - 2007/7/20/pubmed PY - 2007/11/8/medline PY - 2007/7/20/entrez SP - 2108 EP - 18 JF - Molecular biology and evolution JO - Mol Biol Evol VL - 24 IS - 9 N2 - In Bayesian phylogenetics, confidence in evolutionary relationships is expressed as posterior probability--the probability that a tree or clade is true given the data, evolutionary model, and prior assumptions about model parameters. Model parameters, such as branch lengths, are never known in advance; Bayesian methods incorporate this uncertainty by integrating over a range of plausible values given an assumed prior probability distribution for each parameter. Little is known about the effects of integrating over branch length uncertainty on posterior probabilities when different priors are assumed. Here, we show that integrating over uncertainty using a wide range of typical prior assumptions strongly affects posterior probabilities, causing them to deviate from those that would be inferred if branch lengths were known in advance; only when there is no uncertainty to integrate over does the average posterior probability of a group of trees accurately predict the proportion of correct trees in the group. The pattern of branch lengths on the true tree determines whether integrating over uncertainty pushes posterior probabilities upward or downward. The magnitude of the effect depends on the specific prior distributions used and the length of the sequences analyzed. Under realistic conditions, however, even extraordinarily long sequences are not enough to prevent frequent inference of incorrect clades with strong support. We found that across a range of conditions, diffuse priors--either flat or exponential distributions with moderate to large means--provide more reliable inferences than small-mean exponential priors. An empirical Bayes approach that fixes branch lengths at their maximum likelihood estimates yields posterior probabilities that more closely match those that would be inferred if the true branch lengths were known in advance and reduces the rate of strongly supported false inferences compared with fully Bayesian integration. SN - 0737-4038 UR - https://www.unboundmedicine.com/medline/citation/17636043/Effects_of_branch_length_uncertainty_on_Bayesian_posterior_probabilities_for_phylogenetic_hypotheses_ L2 - https://academic.oup.com/mbe/article-lookup/doi/10.1093/molbev/msm141 DB - PRIME DP - Unbound Medicine ER -