Tags

Type your tag names separated by a space and hit enter

A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm.
BMC Bioinformatics. 2008 Oct 07; 9:419.BB

Abstract

BACKGROUND

The process of horizontal gene transfer (HGT) is believed to be widespread in Bacteria and Archaea, but little comparative data is available addressing its occurrence in complete microbial genomes. Collection of high-quality, automated HGT prediction data based on phylogenetic evidence has previously been impractical for large numbers of genomes at once, due to prohibitive computational demands. DarkHorse, a recently described statistical method for discovering phylogenetically atypical genes on a genome-wide basis, provides a means to solve this problem through lineage probability index (LPI) ranking scores. LPI scores inversely reflect phylogenetic distance between a test amino acid sequence and its closest available database matches. Proteins with low LPI scores are good horizontal gene transfer candidates; those with high scores are not.

DESCRIPTION

The DarkHorse algorithm has been applied to 955 microbial genome sequences, and the results organized into a web-searchable relational database, called the DarkHorse HGT Candidate Resource http://darkhorse.ucsd.edu. Users can select individual genomes or groups of genomes to screen by LPI score, search for protein functions by descriptive annotation or amino acid sequence similarity, or select proteins with unusual G+C composition in their underlying coding sequences. The search engine reports LPI scores for match partners as well as query sequences, providing the opportunity to explore whether potential HGT donor sequences are phylogenetically typical or atypical within their own genomes. This information can be used to predict whether or not sufficient information is available to build a well-supported phylogenetic tree using the potential donor sequence.

CONCLUSION

The DarkHorse HGT Candidate database provides a powerful, flexible set of tools for identifying phylogenetically atypical proteins, allowing researchers to explore both individual HGT events in single genomes, and large-scale HGT patterns among protein families and genome groups. Although the DarkHorse algorithm cannot, by itself, provide definitive proof of horizontal gene transfer, it is a flexible, powerful tool that can be combined with slower, more rigorous methods in situations where these other methods could not otherwise be applied.

Authors+Show Affiliations

Marine Biology Research Division, Scripps Institution of Oceanography University of California at San Diego, La Jolla, CA 92093 USA. spodell@ucsd.eduNo affiliation info availableNo affiliation info available

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

18840280

Citation

Podell, Sheila, et al. "A Database of Phylogenetically Atypical Genes in Archaeal and Bacterial Genomes, Identified Using the DarkHorse Algorithm." BMC Bioinformatics, vol. 9, 2008, p. 419.
Podell S, Gaasterland T, Allen EE. A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm. BMC Bioinformatics. 2008;9:419.
Podell, S., Gaasterland, T., & Allen, E. E. (2008). A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm. BMC Bioinformatics, 9, 419. https://doi.org/10.1186/1471-2105-9-419
Podell S, Gaasterland T, Allen EE. A Database of Phylogenetically Atypical Genes in Archaeal and Bacterial Genomes, Identified Using the DarkHorse Algorithm. BMC Bioinformatics. 2008 Oct 7;9:419. PubMed PMID: 18840280.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm. AU - Podell,Sheila, AU - Gaasterland,Terry, AU - Allen,Eric E, Y1 - 2008/10/07/ PY - 2008/05/14/received PY - 2008/10/07/accepted PY - 2008/10/9/pubmed PY - 2009/2/7/medline PY - 2008/10/9/entrez SP - 419 EP - 419 JF - BMC bioinformatics JO - BMC Bioinformatics VL - 9 N2 - BACKGROUND: The process of horizontal gene transfer (HGT) is believed to be widespread in Bacteria and Archaea, but little comparative data is available addressing its occurrence in complete microbial genomes. Collection of high-quality, automated HGT prediction data based on phylogenetic evidence has previously been impractical for large numbers of genomes at once, due to prohibitive computational demands. DarkHorse, a recently described statistical method for discovering phylogenetically atypical genes on a genome-wide basis, provides a means to solve this problem through lineage probability index (LPI) ranking scores. LPI scores inversely reflect phylogenetic distance between a test amino acid sequence and its closest available database matches. Proteins with low LPI scores are good horizontal gene transfer candidates; those with high scores are not. DESCRIPTION: The DarkHorse algorithm has been applied to 955 microbial genome sequences, and the results organized into a web-searchable relational database, called the DarkHorse HGT Candidate Resource http://darkhorse.ucsd.edu. Users can select individual genomes or groups of genomes to screen by LPI score, search for protein functions by descriptive annotation or amino acid sequence similarity, or select proteins with unusual G+C composition in their underlying coding sequences. The search engine reports LPI scores for match partners as well as query sequences, providing the opportunity to explore whether potential HGT donor sequences are phylogenetically typical or atypical within their own genomes. This information can be used to predict whether or not sufficient information is available to build a well-supported phylogenetic tree using the potential donor sequence. CONCLUSION: The DarkHorse HGT Candidate database provides a powerful, flexible set of tools for identifying phylogenetically atypical proteins, allowing researchers to explore both individual HGT events in single genomes, and large-scale HGT patterns among protein families and genome groups. Although the DarkHorse algorithm cannot, by itself, provide definitive proof of horizontal gene transfer, it is a flexible, powerful tool that can be combined with slower, more rigorous methods in situations where these other methods could not otherwise be applied. SN - 1471-2105 UR - https://www.unboundmedicine.com/medline/citation/18840280/A_database_of_phylogenetically_atypical_genes_in_archaeal_and_bacterial_genomes_identified_using_the_DarkHorse_algorithm_ L2 - https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-9-419 DB - PRIME DP - Unbound Medicine ER -