A simple shortcut to unsupervised alignment-free phylogenetic genome groupings, even from unassembled sequencing reads

A simple shortcut to unsupervised alignment-free phylogenetic genome groupings, even from unassembled sequencing reads
Title:
A simple shortcut to unsupervised alignment-free phylogenetic genome groupings, even from unassembled sequencing reads
Other Titles:
Journal of Bioinformatics and Computational Biology
Keywords:
Publication Date:
02 December 2013
Citation:
SEBASTIAN MAURER-STROH et al, J. Bioinform. Comput. Biol. 11, 1343005 (2013)
Abstract:
We propose an extension to alignment-free approaches that can produce reasonably accurate phylogenetic groupings starting from unaligned genomes, for example, as fast as 1 min on a standard desktop computer for 25 bacterial genomes. A 6-fold speed-up and 11-fold reduction in memory requirements compared to previous alignment-free methods is achieved by reducing the comparison space to a representative sample of k-mers of optimal length and with specific tag motifs. This approach was applied to the test case of fitting the enterohemorrhagic O104:H4 E.coli strain from the 2011 outbreak in Germany into the phylogenetic network of previously known E.coli-related strains and extend the method to allow assigning any new strain to the correct phylogenetic group even directly from unassembled short sequence reads from next generation sequencing data. Hence, this approach is also useful to quickly identify the most suitable reference genome for subsequent assembly steps.
License type:
PublisherCopyrights
Funding Info:
Description:
ISSN:
0219-7200
Files uploaded: