Menu
July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Global insights into acetic acid resistance mechanisms and genetic stability of Acetobacter pasteurianus strains by comparative genomics.

Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.


July 7, 2019

SMRT sequencing of the Campylobacter coli BfR-CA-9557 genome sequence reveals unique methylation motifs.

Campylobacter species are the most prevalent bacterial pathogen causing acute enteritis worldwide. In contrast to Campylobacter jejuni, about 5 % of Campylobacter coli strains exhibit susceptibility to restriction endonuclease digestion by DpnI cutting specifically 5′-G(m)ATC-3′ motifs. This indicates significant differences in DNA methylation between both microbial species. The goal of the study was to analyze the methylome of a C. coli strain susceptible to DpnI digestion, to identify its methylation motifs and restriction modification systems (RM-systems), and compare them to related organisms like C. jejuni and Helicobacter pylori. Using one SMRT cell and the PacBio RS sequencing technology followed by PacBio Modification and Motif Analysis the complete genome of the DpnI susceptible strain C. coli BfR-CA-9557 was sequenced to 500-fold coverage and assembled into a single contig of 1.7 Mbp. The genome contains a CJIE1-like element prophage and is phylogenetically closer to C. coli clade 1 isolates than clade 3. 45,881 6-methylated adenines (ca. 2.7 % of genome positions) that are predominantly arranged in eight different methylation motifs and 1,788 4-methylated cytosines (ca. 0.1 %) have been detected. Only two of these motifs correspond to known restriction modification motifs. Characteristic for this methylome was the very high fraction of methylation of motifs with mostly above 99 %.Only five dominant methylation motifs have been identified in C. jejuni, which have been associated with known RM-systems. C. coli BFR-CA-9557 shares one (RAATTY) of these, but four ORFs could be assigned to putative Type I RM-systems, seven ORFs to Type II RM-systems and three ORFs to Type IV RM-systems. In accordance with DpnI prescreening RM-system IIP, methylation of GATC motifs was detected in C. coli BfR-CA-9557. A homologous IIP RM-system has been described for H. pylori. The remaining methylation motifs are specific for C. coli BfR-CA-9557 and have been neither detected in C. jejuni nor in H. pylori. The results of this study give us new insights into epigenetics of Campylobacteraceae and provide the groundwork to resolve the function of RM-systems in C. coli.


July 7, 2019

Whole-genome sequence of an evolved Clostridium pasteurianum strain reveals Spo0A deficiency responsible for increased butanol production and superior growth.

Biodiesel production results in crude glycerol waste from the transesterification of fatty acids (10 % w/w). The solventogenic Clostridium pasteurianum, an anaerobic Firmicute, can produce butanol from glycerol as the sole carbon source. Coupling butanol fermentation with biodiesel production can improve the overall economic viability of biofuels. However, crude glycerol contains growth-inhibiting byproducts which reduce feedstock consumption and solvent production.To obtain a strain with improved characteristics, a random mutagenesis and directed evolution selection technique was used. A wild-type C. pasteurianum (ATCC 6013) culture was chemically mutagenized, and the resulting population underwent 10 days of selection in increasing concentrations of crude glycerol (80-150 g/L). The best-performing mutant (M150B) showed a 91 % increase in butanol production in 100 g/L crude glycerol compared to the wild-type strain, as well as increased growth rate, a higher final optical density, and less production of the side product PDO (1,3-propanediol). Wild-type and M150B strains were sequenced via Single Molecule Real-Time (SMRT) sequencing. Mutations introduced to the M150B genome were identified by sequence comparison to the wild-type and published closed sequences. A major mutation (a deletion) in the gene of the master transcriptional regulator of sporulation, Spo0A, was identified. A spo0A single gene knockout strain was constructed using a double–crossover genome-editing method. The Spo0A-deficient strain showed similar tolerance to crude glycerol as the evolved mutant strain M150B. Methylation patterns on genomic DNA identified by SMRT sequencing were used to transform plasmid DNA to overcome the native C. pasteurianum restriction endonuclease.Solvent production in the absence of Spo0A shows C. pasteurianum differs in solvent-production regulation compared to other solventogenic Clostridium. Growth-associated butanol production shows C. pasteurianum to be an attractive option for further engineering as it may prove a better candidate for butanol production through continuous fermentation.


July 7, 2019

Methylome diversification through changes in DNA methyltransferase sequence specificity.

Epigenetic modifications such as DNA methylation have large effects on gene expression and genome maintenance. Helicobacter pylori, a human gastric pathogen, has a large number of DNA methyltransferase genes, with different strains having unique repertoires. Previous genome comparisons suggested that these methyltransferases often change DNA sequence specificity through domain movement–the movement between and within genes of coding sequences of target recognition domains. Using single-molecule real-time sequencing technology, which detects N6-methyladenines and N4-methylcytosines with single-base resolution, we studied methylated DNA sites throughout the H. pylori genome for several closely related strains. Overall, the methylome was highly variable among closely related strains. Hypermethylated regions were found, for example, in rpoB gene for RNA polymerase. We identified DNA sequence motifs for methylation and then assigned each of them to a specific homology group of the target recognition domains in the specificity-determining genes for Type I and other restriction-modification systems. These results supported proposed mechanisms for sequence-specificity changes in DNA methyltransferases. Knocking out one of the Type I specificity genes led to transcriptome changes, which suggested its role in gene expression. These results are consistent with the concept of evolution driven by DNA methylation, in which changes in the methylome lead to changes in the transcriptome and potentially to changes in phenotype, providing targets for natural or artificial selection.


July 7, 2019

Complete genome sequences of Salmonella enterica serovar Heidelberg strains associated with a multistate food-borne illness investigation.

Next-generation sequencing is being evaluated for use with food-borne illness investigations, especially when the outbreak strains produce patterns that cannot be discriminated from non-outbreak strains using conventional procedures. Here we report complete genome assemblies of two Salmonella enterica serovar Heidelberg strains with a common pulsed-field gel electrophoresis pattern isolated during an outbreak investigation.


July 7, 2019

Strain Kaplan of pseudorabies virus genome sequenced by PacBio single-molecule real-time sequencing technology.

Pseudorabies virus (PRV) is a neurotropic herpesvirus that causes Aujeszky’s disease in pigs. PRV strains are widely used as transsynaptic tracers for mapping neural circuits. We present here the complete and fully annotated genome sequence of strain Kaplan of PRV, determined by Pacific Biosciences RSII long-read sequencing technology. Copyright © 2014 Tombácz et al.


July 7, 2019

Complete genome sequences of eight Helicobacter pylori strains with different virulence factor genotypes and methylation profiles, isolated from patients with diverse gastrointestinal diseases on Okinawa Island, Japan, determined using PacBio Single-Molecule Real-Time Technology.

We report the complete genome sequences of eight Helicobacter pylori strains isolated from patients with gastrointestinal diseases in Okinawa, Japan. Whole-genome sequencing and DNA methylation detection were performed using the PacBio platform. De novo assembly determined a single, complete contig for each strain. Furthermore, methylation analysis identified virulence factor genotype-dependent motifs.


July 7, 2019

Novel giant siphovirus from Bacillus anthracis features unusual genome characteristics.

Here we present vB_BanS-Tsamsa, a novel temperate phage isolated from Bacillus anthracis, the agent responsible for anthrax infections in wildlife, livestock and humans. Tsamsa phage is a giant siphovirus (order Caudovirales), featuring a long, flexible and non-contractile tail of 440 nm (not including baseplate structure) and an isometric head of 82 nm in diameter. We induced Tsamsa phage in samples from two different carcass sites in Etosha National Park, Namibia. The Tsamsa phage genome is the largest sequenced Bacillus siphovirus, containing 168,876 bp and 272 ORFs. The genome features an integrase/recombinase enzyme, indicative of a temperate lifestyle. Among bacterial strains tested, the phage infected only certain members of the Bacillus cereus sensu lato group (B. anthracis, B. cereus and B. thuringiensis) and exhibited moderate specificity for B. anthracis. Tsamsa lysed seven out of 25 B. cereus strains, two out of five B. thuringiensis strains and six out of seven B. anthracis strains tested. It did not lyse B. anthracis PAK-1, an atypical strain that is also resistant to both gamma phage and cherry phage. The Tsamsa endolysin features a broader lytic spectrum than the phage host range, indicating possible use of the enzyme in Bacillus biocontrol.


July 7, 2019

Type I restriction enzymes and their relatives.

Type I restriction enzymes (REases) are large pentameric proteins with separate restriction (R), methylation (M) and DNA sequence-recognition (S) subunits. They were the first REases to be discovered and purified, but unlike the enormously useful Type II REases, they have yet to find a place in the enzymatic toolbox of molecular biologists. Type I enzymes have been difficult to characterize, but this is changing as genome analysis reveals their genes, and methylome analysis reveals their recognition sequences. Several Type I REases have been studied in detail and what has been learned about them invites greater attention. In this article, we discuss aspects of the biochemistry, biology and regulation of Type I REases, and of the mechanisms that bacteriophages and plasmids have evolved to evade them. Type I REases have a remarkable ability to change sequence specificity by domain shuffling and rearrangements. We summarize the classic experiments and observations that led to this discovery, and we discuss how this ability depends on the modular organizations of the enzymes and of their S subunits. Finally, we describe examples of Type II restriction-modification systems that have features in common with Type I enzymes, with emphasis on the varied Type IIG enzymes.


July 7, 2019

Whole-genome assembly of Klebsiella pneumoniae coproducing NDM-1 and OXA-232 carbapenemases using Single-Molecule, Real-Time Sequencing.

The whole-genome sequence of a carbapenem-resistant Klebsiella pneumoniae strain, PittNDM01, which coproduces NDM-1 and OXA-232 carbapenemases, was determined in this study. The use of single-molecule, real-time (SMRT) sequencing provided a closed genome in a single sequencing run. K. pneumoniae PittNDM01 has a single chromosome of 5,348,284 bp and four plasmids: pPKPN1 (283,371 bp), pPKPN2 (103,694 bp), pPKPN3 (70,814 bp), and pPKPN4 (6,141 bp). The contents of the chromosome were similar to that of the K. pneumoniae reference genome strain MGH 78578, with the exception of a large inversion spanning 23.3% of the chromosome. In contrast, three of the four plasmids are unique. The plasmid pPKPN1, an IncHI1B-like plasmid, carries the blaNDM-1, armA, and qnrB1 genes, along with tellurium and mercury resistance operons. blaNDM-1 is carried on a unique structure in which Tn125 is further bracketed by IS26 downstream of a class 1 integron. The IncFIA-like plasmid pPKPN3 also carries an array of resistance elements, including blaCTX-M-15 and a mercury resistance operon. The ColE-type plasmid pPKPN4 carrying blaOXA-232 is identical to a plasmid previously reported from France. SMRT sequencing was useful in resolving the complex bacterial genomic structures in the de novo assemblies. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Methyltransferases acquired by lactococcal 936-type phage provide protection against restriction endonuclease activity

BACKGROUND:So-called 936-type phages are among the most frequently isolated phages in dairy facilities utilising Lactococcus lactis starter cultures. Despite extensive efforts to control phage proliferation and decades of research, these phages continue to negatively impact cheese production in terms of the final product quality and consequently, monetary return.RESULTS:Whole genome sequencing and in silico analysis of three 936-type phage genomes identified several putative (orphan) methyltransferase (MTase)-encoding genes located within the packaging and replication regions of the genome. Utilising SMRT sequencing, methylome analysis was performed on all three phages, allowing the identification of adenine modifications consistent with N-6 methyladenine sequence methylation, which in some cases could be attributed to these phage-encoded MTases. Heterologous gene expression revealed that M.Phi145I/M.Phi93I and M.Phi93DAM, encoded by genes located within the packaging module, provide protection against the restriction enzymes HphI and DpnII, respectively, representing the first functional MTases identified in members of 936-type phages.CONCLUSIONS:SMRT sequencing technology enabled the identification of the target motifs of MTases encoded by the genomes of three lytic 936-type phages and these MTases represent the first functional MTases identified in this species of phage. The presence of these MTase-encoding genes on 936-type phage genomes is assumed to represent an adaptive response to circumvent host encoded restriction-modification systems thereby increasing the fitness of the phages in a dynamic dairy environment.


July 7, 2019

Genome sequence of Porphyromonas gingivalis strain HG66 (DSM 28984).

Porphyromonas gingivalis is considered a major etiologic agent in adult periodontitis. Gingipains are among its most important virulence factors, but their release is unique in strain HG66. We present the genome sequence of HG66 with a single contig of 2,441,680 bp and a G+C content of 48.1%. Copyright © 2014 Siddiqui et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.