Tags

Type your tag names separated by a space and hit enter

A protocol for large-scale rRNA sequence analysis: towards a detailed phylogeny of Coleoptera.
Mol Phylogenet Evol. 2008 Apr; 47(1):289-301.MP

Abstract

Large-scale phylogenetic analyses involving thousands of rRNA sequences are complicated due to length variability which compounds the already complex problem of large tree searches. Here, we generated a large data matrix and test phylogenetic procedures for large-scale analysis in the Coleoptera (beetles), as a resource for evolutionary biology and identification of this hugely diverse group. The analysis included nearly 1200 species, including representatives of 126 (75%) families, all 18 superfamilies of Polyphaga, and the four suborders. Alignments were obtained by a fragment-extension method derived from the BLAST algorithm using the BlastAlign script [Belshaw, R., Katzourakis, A., 2005. BlastAlign: a program that uses blast to align problematic nucleotide sequences. Bioinformatics 21, 122-123], followed by fast parsimony and maximum likelihood searches. Trees were assessed against the existing classification, using a formal procedure for coding the hierarchical position of taxa and establishing taxonomic congruence. We found that the BlastAlign procedure greatly exceeded the performance of standard progressive alignment methods such as Clustal. The resulting trees, when used as guide tree, also greatly improved the Clustal-based alignments. Long-branch attraction potentially affecting the quality of the tree was reduced by the systematic removal of all branches longer than a 95% interval of the distribution of branch lengths. We applied this protocol to the test for monophyly of major proposed lineages of Coleoptera, including Crowson's 18 superfamilies in the hyperdiverse suborder Polyphaga. While searches for very large trees remained challenging and details of the tree topology were not always satisfactory, the strategy for alignment and tree searches used here makes large-scale phylogenetics of super-diverse groups such as Coleoptera amenable to desktop computing.

Authors+Show Affiliations

Department of Entomology, Natural History Museum, London SW7 5BD, UK.No affiliation info available

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

18191591

Citation

Hunt, Toby, and Alfried P. Vogler. "A Protocol for Large-scale rRNA Sequence Analysis: Towards a Detailed Phylogeny of Coleoptera." Molecular Phylogenetics and Evolution, vol. 47, no. 1, 2008, pp. 289-301.
Hunt T, Vogler AP. A protocol for large-scale rRNA sequence analysis: towards a detailed phylogeny of Coleoptera. Mol Phylogenet Evol. 2008;47(1):289-301.
Hunt, T., & Vogler, A. P. (2008). A protocol for large-scale rRNA sequence analysis: towards a detailed phylogeny of Coleoptera. Molecular Phylogenetics and Evolution, 47(1), 289-301. https://doi.org/10.1016/j.ympev.2007.11.029
Hunt T, Vogler AP. A Protocol for Large-scale rRNA Sequence Analysis: Towards a Detailed Phylogeny of Coleoptera. Mol Phylogenet Evol. 2008;47(1):289-301. PubMed PMID: 18191591.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - A protocol for large-scale rRNA sequence analysis: towards a detailed phylogeny of Coleoptera. AU - Hunt,Toby, AU - Vogler,Alfried P, Y1 - 2007/12/07/ PY - 2007/08/09/received PY - 2007/11/02/revised PY - 2007/11/17/accepted PY - 2008/1/15/pubmed PY - 2008/8/1/medline PY - 2008/1/15/entrez SP - 289 EP - 301 JF - Molecular phylogenetics and evolution JO - Mol Phylogenet Evol VL - 47 IS - 1 N2 - Large-scale phylogenetic analyses involving thousands of rRNA sequences are complicated due to length variability which compounds the already complex problem of large tree searches. Here, we generated a large data matrix and test phylogenetic procedures for large-scale analysis in the Coleoptera (beetles), as a resource for evolutionary biology and identification of this hugely diverse group. The analysis included nearly 1200 species, including representatives of 126 (75%) families, all 18 superfamilies of Polyphaga, and the four suborders. Alignments were obtained by a fragment-extension method derived from the BLAST algorithm using the BlastAlign script [Belshaw, R., Katzourakis, A., 2005. BlastAlign: a program that uses blast to align problematic nucleotide sequences. Bioinformatics 21, 122-123], followed by fast parsimony and maximum likelihood searches. Trees were assessed against the existing classification, using a formal procedure for coding the hierarchical position of taxa and establishing taxonomic congruence. We found that the BlastAlign procedure greatly exceeded the performance of standard progressive alignment methods such as Clustal. The resulting trees, when used as guide tree, also greatly improved the Clustal-based alignments. Long-branch attraction potentially affecting the quality of the tree was reduced by the systematic removal of all branches longer than a 95% interval of the distribution of branch lengths. We applied this protocol to the test for monophyly of major proposed lineages of Coleoptera, including Crowson's 18 superfamilies in the hyperdiverse suborder Polyphaga. While searches for very large trees remained challenging and details of the tree topology were not always satisfactory, the strategy for alignment and tree searches used here makes large-scale phylogenetics of super-diverse groups such as Coleoptera amenable to desktop computing. SN - 1055-7903 UR - https://www.unboundmedicine.com/medline/citation/18191591/A_protocol_for_large_scale_rRNA_sequence_analysis:_towards_a_detailed_phylogeny_of_Coleoptera_ DB - PRIME DP - Unbound Medicine ER -