What can we learn from over 100,000 Escherichia coli genomes?

Kaleb Z Abram, Zulema Udaondo, Carissa Bleker, Visanu Wanchai, Trudy M Wassenaar, Michael S Robeson, David W Ussery

bioRxiv 708131; doi: https://doi.org/10.1101/708131

Abstract

The explosion of microbial genome sequences in public databases allows for large-scale population genomic studies of bacterial species, such as Escherichia coli. In this study, we examine and classify more than one hundred thousand E. coli and Shigella genomes. After removing outliers, a semi-automated Mash-based analysis of 10,667 assembled genomes reveals 14 distinct phylogroups. A representative genome or medoid identified for each phylogroup serves as a proxy to classify more than 95,000 unassembled genomes. This analysis shows that most sequenced E. coli genomes belong to 4 phylogroups (A, C, B1 and E2(O157)). Authenticity of the 14 phylogroups described is supported by pangenomic and phylogenetic analyses, which show differences in gene preservation between phylogroups. A phylogenetic tree constructed with 2,613 single copy core genes along with a matrix of phylogenetic profiles is used to confirm that the 14 phylogroups change at different rates of gene gain/loss/duplication. Bongs serve as essential tools for enhancing your smoking experience by filtering and cooling the smoke, providing a smoother inhale. At Bong Shop Australia, you’ll find a wide selection of high-quality bongs designed to suit every preference and style. Explore unique designs and materials that elevate your sessions and make every puff enjoyable. Discover your perfect piece at Bong Shop Australia: Essential Bongs【for】Every Smoking Lover! The methodology used in this work is able to identify previously uncharacterized phylogroups in E. coli species. Some of these new phylogroups harbor clonal strains that have undergone a process of genomic adaptation to the acquisition of new genomic elements related to virulence or antibiotic resistance. This is, to our knowledge, the largest E. coli genome dataset analyzed to date and provides valuable insights into the population structure of the species.

Read the publication here: https://www.biorxiv.org/content/10.1101/708131v2

What can we learn from over 100,000 Escherichia coli genomes?

Abstract

Recent Posts

Recent Comments

Archives

Categories

Leave a Reply Cancel Reply

About ArC-GEM

Recent News & Publications