Single Molecule, Real-Time (SMRT) Sequencing holds promise for addressing new frontiers to understand molecular mechanisms in evolution and gain insight into adaptive strategies. With read lengths exceeding 10 kb, we are able to sequence high-quality, closed microbial genomes with associated plasmids, and investigate large genome complexities, such as long, highly repetitive, low-complexity regions and multiple tandem-duplication events. Improved genome quality, observed at 99.9999% (QV60) consensus accuracy, and significant reduction of gap regions in reference genomes (up to and beyond 50%) allow researchers to better understand coding sequences with high confidence, investigate potential regulatory mechanisms in noncoding regions, and make inferences about evolutionary strategies that are otherwise missed by the coverage biases associated with short- read sequencing technologies. Additional benefits afforded by SMRT Sequencing include the simultaneous capability to detect epigenomic modifications and obtain full-length cDNA transcripts that obsolete the need for assembly. With direct sequencing of DNA in real-time, this has resulted in the identification of numerous base modifications and motifs, which genome-wide profiles have linked to specific methyltransferase activities. Our new offering, the Iso-Seq Application, allows for the accurate differentiation between transcript isoforms that are difficult to resolve with short-read technologies. PacBio reads easily span transcripts such that both 5’/3’ primers for cDNA library generation and the poly-A tail are observed. As such, exon configuration and intron retention events can be analyzed without ambiguity. This technological advance is useful for characterizing transcript diversity and improving gene structure annotations in reference genomes. We review solutions available with SMRT Sequencing, from targeted sequencing efforts to obtaining reference genomes (>100 Mb). This includes strategies for identifying microsatellites and conducting phylogenetic comparisons with targeted gene families. We highlight how to best leverage our long reads that have exceeded 20 kb in length for research investigations, as well as currently available bioinformatics strategies for analysis. Benefits for these applications are further realized with consistent use of size selection of input sample using the BluePippin™ device from Sage Science as demonstrated in our genome improvement projects. Using the latest P5-C3 chemistry on model organisms, these efforts have yielded an observed contig N50 of ~6 Mb, with the longest contig exceeding 12.5 Mb and an average base quality of QV50.
Complete genome sequence provides insights into the quorum sensing-related spoilage potential of Shewanella baltica 128 isolated from spoiled shrimp.
Shewanella baltica 128 is a specific spoilage organism (SSO) isolated from the refrigerated shrimp that results in shrimp spoilage. This study reported the complete genome sequencing of this strain, with the primary annotations associated with amino acid transport and metabolism (8.66%), indicating that S. baltica 128 has good potential for degrading proteins. In vitro experiments revealed Shewanella baltica 128 could adapt to the stress conditions by regulating its growth and biofilm formation. Genes that related to the spoilage-related metabolic pathways, including trimethylamine metabolism (torT), sulfur metabolism (cysM), putrescine metabolism (speC), biofilm formation (rpoS) and serine protease production (degS), were identified. Genes (LuxS, pfs, LuxR and qseC) that related to the specific QS system were also identified. Complete genome sequence of S. baltica 128 provide insights into the QS-related spoilage potential, which might provide novel information for the development of new approaches for spoilage detection and prevention based on QS target.Copyright © 2019. Published by Elsevier Inc.
Citrobacter rodentium strain DBS100 causes an infection of the intestines in mice. It provides an important model for human gastrointestinal pathogens, such as enteropathogenic and enterohemorrhagic Escherichia coli, which cause life-threatening infections. To identify the genetic determinants that are common across the enteropathogenic bacteria, we sequenced the DBS100 genome.Copyright © 2019 Popov et al.
The Complete Genome of the Atypical Enteropathogenic Escherichia coli Archetype Isolate E110019 Highlights a Role for Plasmids in Dissemination of the Type III Secreted Effector EspT.
Enteropathogenic Escherichia coli (EPEC) is a leading cause of moderate to severe diarrhea among young children in developing countries, and EPEC isolates can be subdivided into two groups. Typical EPEC (tEPEC) bacteria are characterized by the presence of both the locus of enterocyte effacement (LEE) and the plasmid-encoded bundle-forming pilus (BFP), which are involved in adherence and translocation of type III effectors into the host cells. Atypical EPEC (aEPEC) bacteria also contain the LEE but lack the BFP. In the current report, we describe the complete genome of outbreak-associated aEPEC isolate E110019, which carries four plasmids. Comparative genomic analysis demonstrated that the type III secreted effector EspT gene, an autotransporter gene, a hemolysin gene, and putative fimbrial genes are all carried on plasmids. Further investigation of 65 espT-containing E. coli genomes demonstrated that different espT alleles are associated with multiple plasmids that differ in their overall gene content from the E110019 espT-containing plasmid. EspT has been previously described with respect to its role in the ability of E110019 to invade host cells. While other type III secreted effectors of E. coli have been identified on insertion elements and prophages of the chromosome, we demonstrated in the current study that the espT gene is located on multiple unique plasmids. These findings highlight a role of plasmids in dissemination of a unique E. coli type III secreted effector that is involved in host invasion and severe diarrheal illness.Copyright © 2019 American Society for Microbiology.
SMRT sequencing reveals differential patterns of methylation in two O111:H- STEC isolates from a hemolytic uremic syndrome outbreak in Australia.
In 1995 a severe haemolytic-uremic syndrome (HUS) outbreak in Adelaide occurred. A recent genomic analysis of Shiga toxigenic Escherichia coli (STEC) O111:H- strains 95JB1 and 95NR1 from this outbreak found that the more virulent isolate, 95NR1, harboured two additional copies of the Shiga toxin 2 (Stx2) genes encoded within prophage regions. The structure of the Stx2-converting prophages could not be fully resolved using short-read sequence data alone and it was not clear if there were other genomic differences between 95JB1 and 95NR1. In this study we have used Pacific Biosciences (PacBio) single molecule real-time (SMRT) sequencing to characterise the genome and methylome of 95JB1 and 95NR1. We completely resolved the structure of all prophages including two, tandemly inserted, Stx2-converting prophages in 95NR1 that were absent from 95JB1. Furthermore we defined all insertion sequences and found an additional IS1203 element in the chromosome of 95JB1. Our analysis of the methylome of 95NR1 and 95JB1 identified hemi-methylation of a novel motif (5′-CTGCm6AG-3′) in more than 4000 sites in the 95NR1 genome. These sites were entirely unmethylated in the 95JB1 genome, and included at least 177 potential promoter regions that could contribute to regulatory differences between the strains. IS1203 mediated deactivation of a novel type IIG methyltransferase in 95JB1 is the likely cause of the observed differential patterns of methylation between 95NR1 and 95JB1. This study demonstrates the capability of PacBio SMRT sequencing to resolve complex prophage regions and reveal the genetic and epigenetic heterogeneity within a clonal population of bacteria.
Differential transcriptome analysis of enterohemorrhagic Escherichia coli strains reveals differences in response to plant-derived compounds.
Several serious vegetable-associated outbreaks of enterohemorrhagic Escherichia coli (EHEC) infections have occurred during the last decades. In this context, vegetables have been suggested to function as secondary reservoirs for EHEC strains. Increased knowledge about the interaction of EHEC with plants including gene expression patterns in response to plant-derived compounds is required. In the current study, EHEC O157:H7 strain Sakai, EHEC O157:H- strain 3072/96, and the EHEC/enteroaggregative E. coli (EAEC) hybrid O104:H4 strain C227-11fcu were grown in lamb’s lettuce medium and in M9 minimal medium to study the differential transcriptional response of these strains to plant-derived compounds with RNA-Seq technology.Many genes involved in carbohydrate degradation and peptide utilization were similarly upregulated in all three strains, suggesting that the lamb’s lettuce medium provides sufficient nutrients for proliferation. In particular, the genes galET and rbsAC involved in galactose metabolism and D-ribose catabolism, respectively, were uniformly upregulated in the investigated strains. The most prominent differences in shared genome transcript levels were observed for genes involved in the expression of flagella. Transcripts of all three classes of the flagellar hierarchy were highly abundant in strain C227-11fcu. Strain Sakai expressed only genes encoding the basal flagellar structure. In addition, both strains showed increased motility in presence of lamb’s lettuce extract. Moreover, strain 3072/96 showed increased transcription activity for genes encoding the type III secretion system (T3SS) including effectors, and was identified as a powerful biofilm-producer in M9 minimal medium.The current study provides clear evidence that EHEC and EHEC/EAEC strains are able to adjust their gene expression patterns towards metabolization of plant-derived compounds, demonstrating that they may proliferate well in a plant-associated environment. Moreover, we propose that flagella and other surface structures play a fundamental role in the interaction of EHEC and EHEC/EAEC with plants.
Comparative genomics reveals structural and functional features specific to the genome of a foodborne Escherichia coli O157:H7.
Escherichia coli O157:H7 (O157) has been linked to numerous foodborne disease outbreaks. The ability to rapidly sequence and analyze genomes is important for understanding epidemiology, virulence, survival, and evolution of outbreak strains. In the current study, we performed comparative genomics to determine structural and functional features of the genome of a foodborne O157 isolate NADC 6564 and infer its evolutionary relationship to other O157 strains.The chromosome of NADC 6564 contained 5466?kb compared to reference strains Sakai (5498?kb) and EDL933 (5547?kb) and shared 41 of its 43 Linear Conserved Blocks (LCB) with the reference strains. However, 18 of 41 LCB had inverse orientation in NADC 6564 compared to the reference strains. NADC 6564 shared 18 of 19 bacteriophages with reference strains except that the chromosomal positioning of some of the phages differed among these strains. The additional phage (P19) of NADC 6564 was located on a 39-kb insertion element (IE) encoding several hypothetical proteins, an integrase, transposases, transcriptional regulators, an adhesin, and a phosphoethanolamine transferase (PEA). The complete homologs of the 39-kb?IE were found in E. coli PCN061 of porcine origin. The IE-encoded PEA showed low homology (32-33%) to four other PEA in NADC 6564 and PEA linked to mobilizable colistin resistance in E. coli but was highly homologous (95%) to a PEA of uropathogenic, avian pathogenic, and enteroaggregative E. coli. NADC 6564 showed slightly higher minimum inhibitory concentration of colistin compared to the reference strains. The 39-kb?IE also contained dndBCDE and dptFGH operons encoding DNA S-modification and a restriction pathway, linked to oxidative stress tolerance and self-defense against foreign DNA, respectively. Evolutionary tree analysis grouped NADC 6564 with lineage I O157 strains.These results indicated that differential phage counts and different chromosomal positioning of many bacteriophages and genomic islands might have resulted in recombination events causing altered chromosomal organization in NADC 6564. Evolutionary analysis grouped NADC 6564 with lineage I strains and suggested its earlier divergence from these strains. The ability to perform S-DNA modification might affect tolerance of NADC 6564 to various stressors.