Menu
July 7, 2019

The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes.

Whole-genome sequences are now available for many microbial species and clades, however, existing whole-genome alignment methods are limited in their ability to perform sequence comparisons of multiple sequences simultaneously. Here we present the Harvest suite of core-genome alignment and visualization tools for the rapid and simultaneous analysis of thousands of intraspecific microbial strains. Harvest includes Parsnp, a fast core-genome multi-aligner, and Gingr, a dynamic visual platform. Together they provide interactive core-genome alignments, variant calls, recombination detection, and phylogenetic trees. Using simulated and real data we demonstrate that our approach exhibits unrivaled speed while maintaining the accuracy of existing methods. The Harvest suite is open-source and freely available from: http://github.com/marbl/harvest.


July 7, 2019

Quality scores for 32,000 genomes.

More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences.Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes.The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.


July 7, 2019

The genome of the anaerobic fungus Orpinomyces sp. strain C1A reveals the unique evolutionary history of a remarkable plant biomass degrader.

Anaerobic gut fungi represent a distinct early-branching fungal phylum (Neocallimastigomycota) and reside in the rumen, hindgut, and feces of ruminant and nonruminant herbivores. The genome of an anaerobic fungal isolate, Orpinomyces sp. strain C1A, was sequenced using a combination of Illumina and PacBio single-molecule real-time (SMRT) technologies. The large genome (100.95 Mb, 16,347 genes) displayed extremely low G+C content (17.0%), large noncoding intergenic regions (73.1%), proliferation of microsatellite repeats (4.9%), and multiple gene duplications. Comparative genomic analysis identified multiple genes and pathways that are absent in Dikarya genomes but present in early-branching fungal lineages and/or nonfungal Opisthokonta. These included genes for posttranslational fucosylation, the production of specific intramembrane proteases and extracellular protease inhibitors, the formation of a complete axoneme and intraflagellar trafficking machinery, and a near-complete focal adhesion machinery. Analysis of the lignocellulolytic machinery in the C1A genome revealed an extremely rich repertoire, with evidence of horizontal gene acquisition from multiple bacterial lineages. Experimental analysis indicated that strain C1A is a remarkable biomass degrader, capable of simultaneous saccharification and fermentation of the cellulosic and hemicellulosic fractions in multiple untreated grasses and crop residues examined, with the process significantly enhanced by mild pretreatments. This capability, acquired during its separate evolutionary trajectory in the rumen, along with its resilience and invasiveness compared to prokaryotic anaerobes, renders anaerobic fungi promising agents for consolidated bioprocessing schemes in biofuels production.


July 7, 2019

Next generation sequencing technologies and the changing landscape of phage genomics.

The dawn of next generation sequencing technologies has opened up exciting possibilities for whole genome sequencing of a plethora of organisms. The 2nd and 3rd generation sequencing technologies, based on cloning-free, massively parallel sequencing, have enabled the generation of a deluge of genomic sequences of both prokaryotic and eukaryotic origin in the last seven years. However, whole genome sequencing of bacterial viruses has not kept pace with this revolution, despite the fact that their genomes are orders of magnitude smaller in size compared with bacteria and other organisms. Sequencing phage genomes poses several challenges; (1) obtaining pure phage genomic material, (2) PCR amplification biases and (3) complex nature of their genetic material due to features such as methylated bases and repeats that are inherently difficult to sequence and assemble. Here we describe conclusions drawn from our efforts in sequencing hundreds of bacteriophage genomes from a variety of Gram-positive and Gram-negative bacteria using Sanger, 454, Illumina and PacBio technologies. Based on our experience we propose several general considerations regarding sample quality, the choice of technology and a “blended approach” for generating reliable whole genome sequences of phages.


July 7, 2019

Analysis of a unique Clostridium botulinum strain from the Southern hemisphere producing a novel type E botulinum neurotoxin subtype.

Clostridium botulinum strains that produce botulinum neurotoxin type E (BoNT/E) are most commonly isolated from botulism cases, marine environments, and animals in regions of high latitude in the Northern hemisphere. A strain of C. botulinum type E (CDC66177) was isolated from soil in Chubut, Argentina. Previous studies showed that the amino acid sequences of BoNT/E produced by various strains differ by < 6% and that the type E neurotoxin gene cluster inserts into the rarA operon.Genetic and mass spectral analysis demonstrated that the BoNT/E produced by CDC66177 is a novel toxin subtype (E9). Toxin gene sequencing indicated that BoNT/E9 differed by nearly 11% at the amino acid level compared to BoNT/E1. Mass spectrometric analysis of BoNT/E9 revealed that its endopeptidase substrate cleavage site was identical to other BoNT/E subtypes. Further analysis of this strain demonstrated that its 16S rRNA sequence clustered with other Group II C. botulinum (producing BoNT types B, E, and F) strains. Genomic DNA isolated from strain CDC66177 hybridized with fewer probes using a Group II C. botulinum subtyping microarray compared to other type E strains examined. Whole genome shotgun sequencing of strain CDC66177 revealed that while the toxin gene cluster inserted into the rarA operon similar to other type E strains, its overall genome content shared greater similarity with a Group II C. botulinum type B strain (17B).These results expand our understanding of the global distribution of C. botulinum type E strains and suggest that the type E toxin gene cluster may be able to insert into C. botulinum strains with a more diverse genetic background than previously recognized.


July 7, 2019

Bacteriophage P70: unique morphology and unrelatedness to other Listeria bacteriophages.

Listeria monocytogenes is an important food-borne pathogen, and its bacteriophages find many uses in detection and biocontrol of its host. The novel broad-host-range virulent phage P70 has a unique morphology with an elongated capsid. Its genome sequence was determined by a hybrid sequencing strategy employing Sanger and PacBio techniques. The P70 genome contains 67,170 bp and 119 open reading frames (ORFs). Our analyses suggest that P70 represents an archetype of virus unrelated to other known Listeria bacteriophages.


July 7, 2019

Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform.

Continued advancements in sequencing technologies have fueled the development of new sequencing applications and promise to flood current databases with raw data. A number of factors prevent the seamless and easy use of these data, including the breadth of project goals, the wide array of tools that individually perform fractions of any given analysis, the large number of associated software/hardware dependencies, and the detailed expertise required to perform these analyses. To address these issues, we have developed an intuitive web-based environment with a wide assortment of integrated and cutting-edge bioinformatics tools in pre-configured workflows. These workflows, coupled with the ease of use of the environment, provide even novice next-generation sequencing users with the ability to perform many complex analyses with only a few mouse clicks and, within the context of the same environment, to visualize and further interrogate their results. This bioinformatics platform is an initial attempt at Empowering the Development of Genomics Expertise (EDGE) in a wide range of applications for microbial research.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Genomic analysis of 495 vancomycin-resistant Enterococcus faecium reveals broad dissemination of a vanA plasmid in more than 19 clones from Copenhagen, Denmark.

From 2012 to 2014, there has been a huge increase in vancomycin-resistant (vanA) Enterococcus faecium (VREfm) in Copenhagen, Denmark, with 602 patients infected or colonized with VREfm in 2014 compared with just 22 in 2012. The objective of this study was to describe the genetic epidemiology of VREfm to assess the contribution of clonal spread and horizontal transfer of the vanA transposon (Tn1546) and plasmid in the dissemination of VREfm in hospitals.VREfm from Copenhagen, Denmark (2012-14) were whole-genome sequenced. The clonal structure was determined and the structure of Tn1546-like transposons was characterized. One VREfm isolate belonging to the largest clonal group was sequenced using long-read technology to close a 37 kb vanA plasmid.Phylogeny revealed a polyclonal structure where 495 VREfm isolates were divided into 13 main groups and 7 small groups. The majority of the isolates were located in three groups (n?=?44, 100 and 218) and clonal spread of VREfm between wards and hospitals was identified. Five Tn1546-like transposon types were identified. A dominant truncated transposon (type 4, 92%) was spread across all but one VREfm group. The closed vanA plasmid was highly covered by reads from isolates containing the type 4 transposon.This study suggests that it was the dissemination of the type 4 Tn1546-like transposon and plasmid via horizontal transfer to multiple populations of E. faecium, followed by clonal spread of new VREfm clones, that contributed to the increase in and diversity of VREfm in Danish hospitals.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 7, 2019

Detection, isolation and characterization of Fusobacterium gastrosuis sp. nov. colonizing the stomach of pigs.

Nine strains of a novel Fusobacterium sp. were isolated from the stomach of 6-8 months old and adult pigs. The isolates were obligately anaerobic, although they endured 2h exposure to air. Phylogenetic analysis based on 16S rRNA and gyrase B genes demonstrated that the isolates showed high sequence similarity with Fusobacterium mortiferum, Fusobacterium ulcerans, Fusobacterium varium, Fusobacterium russii and Fusobacterium necrogenes, but formed a distinct lineage in the genus Fusobacterium. Comparative analysis of the genome of the type strain of this novel Fusobacterium sp. confirmed that it is different from other recognized Fusobacterium spp. DNA-DNA hybridization, fingerprinting and genomic %GC determination further supported the conclusion that the isolates belong to a new, distinct species. The isolates were also distinguishable from these and other Fusobacterium spp. by phenotypical characterization. The strains produced indole and exhibited proline arylamidase and glutamic acid decarboxylase activity. They did not hydrolyse esculin, did not exhibit pyroglutamic acid arylamidase, valine arylamidase, a-galactosidase, ß-galactosidase, ß-galactosidase-6-phosphate or a-glucosidase activity nor produced acid from cellobiose, glucose, lactose, mannitol, mannose, maltose, raffinose, saccharose, salicin or trehalose. The major fatty acids were C16:0 and C18:1?9c. The name Fusobacterium gastrosuis sp. nov. is proposed for the novel isolates with the type strain CDW1(T) (=DSM 101753(T)=LMG 29236(T)). We also demonstrated that Clostridium rectum and mortiferum Fusobacterium represent the same species, with nomenclatural priority for the latter. Copyright © 2016 Elsevier GmbH. All rights reserved.


July 7, 2019

Evolutionary origins of the emergent ST796 clone of vancomycin resistant Enterococcus faecium.

From early 2012, a novel clone of vancomycin resistant Enterococcus faecium (assigned the multi locus sequence type ST796) was simultaneously isolated from geographically separate hospitals in south eastern Australia and New Zealand. Here we describe the complete genome sequence of Ef_aus0233, a representative ST796 E. faecium isolate. We used PacBio single molecule real-time sequencing to establish a high quality, fully assembled genome comprising a circular chromosome of 2,888,087 bp and five plasmids. Comparison of Ef_aus0233 to other E. faecium genomes shows Ef_aus0233 is a member of the epidemic hospital-adapted lineage and has evolved from an ST555-like ancestral progenitor by the accumulation or modification of five mosaic plasmids and five putative prophage, acquisition of two cryptic genomic islands, accrued chromosomal single nucleotide polymorphisms and a 80 kb region of recombination, also gaining Tn1549 and Tn916, transposons conferring resistance to vancomycin and tetracycline respectively. The genomic dissection of this new clone presented here underscores the propensity of the hospital E. faecium lineage to change, presumably in response to the specific conditions of hospital and healthcare environments.


July 7, 2019

Complete genome sequence of Lutibacter profundi LP1T isolated from an Arctic deep-sea hydrothermal vent system

Lutibacter profundi LP1T within the family Flavobacteriaceae was isolated from a biofilm growing on the surface of a black smoker chimney at the Loki’s Castle vent field, located on the Arctic Mid-Ocean Ridge. The complete genome of L. profundi LP1T is the first genome to be published within the genus Lutibacter. L. profundi LP1T consists of a single 2,966,978 bp circular chromosome with a GC content of 29.8%. The genome comprises 2,537 protein-coding genes, 40 tRNA species and 2 rRNA operons. The microaerophilic, organotrophic isolate contains genes for all central carbohydrate metabolic pathways. However, genes for the oxidative branch of the pentose-phosphate-pathway, the glyoxylate shunt of the tricarboxylic acid cycle and the ATP citrate lyase for reverse TCA are not present. L. profundi LP1T utilizes starch, sucrose and diverse proteinous carbon sources. In accordance, the genome harbours 130 proteases and 104 carbohydrate-active enzymes, indicating a specialization in degrading organic matter. Among a small arsenal of 24 glycosyl hydrolases, which offer the possibility to hydrolyse diverse poly- and oligosaccharides, a starch utilization cluster was identified. Furthermore, a variety of enzymes may be secreted via T9SS and contribute to the hydrolytic variety of the microorganism. Genes for gliding motility are present, which may enable the bacteria to move within the biofilm. A substantial number of genes encoding for extracellular polysaccharide synthesis pathways, curli fibres and attachment to surfaces could mediate adhesion in the biofilm and may contribute to the biofilm formation. In addition to aerobic respiration, the complete denitrification pathway and genes for sulphide oxidation e.g. sulphide:quinone reductase are present in the genome. sulphide:quinone reductase and denitrification may serve as detoxification systems allowing L. profundi LP1T to thrive in a sulphide and nitrate enriched environment. The information gained from the genome gives a greater insight in the functional role of L. profundi LP1T in the biofilm and its adaption strategy in an extreme environment.


July 7, 2019

The secondary resistome of multidrug-resistant Klebsiella pneumoniae.

Klebsiella pneumoniae causes severe lung and bloodstream infections that are difficult to treat due to multidrug resistance. We hypothesized that antimicrobial resistance can be reversed by targeting chromosomal non-essential genes that are not responsible for acquired resistance but essential for resistant bacteria under therapeutic concentrations of antimicrobials. Conditional essentiality of individual genes to antimicrobial resistance was evaluated in an epidemic multidrug-resistant clone of K. pneumoniae (ST258). We constructed a high-density transposon mutant library of >430,000 unique Tn5 insertions and measured mutant depletion upon exposure to three clinically relevant antimicrobials (colistin, imipenem or ciprofloxacin) by Transposon Directed Insertion-site Sequencing (TraDIS). Using this high-throughput approach, we defined three sets of chromosomal non-essential genes essential for growth during exposure to colistin (n?=?35), imipenem (n?=?1) or ciprofloxacin (n?=?1) in addition to known resistance determinants, collectively termed the “secondary resistome”. As proof of principle, we demonstrated that inactivation of a non-essential gene not previously found linked to colistin resistance (dedA) restored colistin susceptibility by reducing the minimum inhibitory concentration from 8 to 0.5?µg/ml, 4-fold below the susceptibility breakpoint (S?=?2?µg/ml). This finding suggests that the secondary resistome is a potential target for developing antimicrobial “helper” drugs that restore the efficacy of existing antimicrobials.


July 7, 2019

Antibiotic discovery throughout the Small World Initiative: A molecular strategy to identify biosynthetic gene clusters involved in antagonistic activity.

The emergence of bacterial pathogens resistant to all known antibiotics is a global health crisis. Adding to this problem is that major pharmaceutical companies have shifted away from antibiotic discovery due to low profitability. As a result, the pipeline of new antibiotics is essentially dry and many bacteria now resist the effects of most commonly used drugs. To address this global health concern, citizen science through the Small World Initiative (SWI) was formed in 2012. As part of SWI, students isolate bacteria from their local environments, characterize the strains, and assay for antibiotic production. During the 2015 fall semester at Bowling Green State University, students isolated 77 soil-derived bacteria and genetically characterized strains using the 16S rRNA gene, identified strains exhibiting antagonistic activity, and performed an expanded SWI workflow using transposon mutagenesis to identify a biosynthetic gene cluster involved in toxigenic compound production. We identified one mutant with loss of antagonistic activity and through subsequent whole-genome sequencing and linker-mediated PCR identified a 24.9 kb biosynthetic gene locus likely involved in inhibitory activity in that mutant. Further assessment against human pathogens demonstrated the inhibition of Bacillus cereus, Listeria monocytogenes, and methicillin-resistant Staphylococcus aureus in the presence of this compound, thus supporting our molecular strategy as an effective research pipeline for SWI antibiotic discovery and genetic characterization.© 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.