Abstract

The pragmatic species concept for Bacteria and Archaea is ultimately based on DNA-DNA hybridization (DDH). While enabling the taxonomist, in principle, to obtain an estimate of the overall similarity between the genomes of two strains, this technique is tedious and error-prone and cannot be used to incrementally build up a comparative database. Recent technological progress in the area of genome sequencing calls for bioinformatics methods to replace the wet-lab DDH by in-silico genome-to-genome comparison. Here we investigate state-of-the-art methods for inferring whole-genome distances in their ability to mimic DDH. Algorithms to efficiently determine high-scoring segment pairs or maximally unique matches perform well as a basis of inferring intergenomic distances. The examined distance functions, which are able to cope with heavily reduced genomes and repetitive sequence regions, outperform previously described ones regarding the correlation with and error ratios in emulating DDH. Simulation of incompletely sequenced genomes indicates that some distance formulas are very robust against missing fractions of genomic information. Digitally derived genome-to-genome distances show a better correlation with 16S rRNA gene sequence distances than DDH values. The future perspectives of genome-informed taxonomy are discussed, and the investigated methods are made available as a web service for genome-based species delineation.

Keywords

GenomeBiologyComputational biologyDNA sequencingGenome projectReference genomeWhole genome sequencingGeneticsBacterial genome sizeComparative genomicsGenomicsGene

Affiliated Institutions

Related Publications

Publication Info

Year
2010
Type
article
Volume
2
Issue
1
Pages
117-134
Citations
1630
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1630
OpenAlex

Cite This

Alexander F. Auch, Mathias von Jan, Hans‐Peter Klenk et al. (2010). Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison. Standards in Genomic Sciences , 2 (1) , 117-134. https://doi.org/10.4056/sigs.531120

Identifiers

DOI
10.4056/sigs.531120