Menu
July 19, 2019

The epigenomic landscape of prokaryotes.

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.


July 19, 2019

Phase variation of a Type IIG restriction-modification enzyme alters site-specific methylation patterns and gene expression in Campylobacter jejuni strain NCTC11168.

Phase-variable restriction-modification systems are a feature of a diverse range of bacterial species. Stochastic, reversible switches in expression of the methyltransferase produces variation in methylation of specific sequences. Phase-variable methylation by both Type I and Type III methyltransferases is associated with altered gene expression and phenotypic variation. One phase-variable gene of Campylobacter jejuni encodes a homologue of an unusual Type IIG restriction-modification system in which the endonuclease and methyltransferase are encoded by a single gene. Using both inhibition of restriction and PacBio-derived methylome analyses of mutants and phase-variants, the cj0031c allele in C. jejuni strain NCTC11168 was demonstrated to specifically methylate adenine in 5’CCCGA and 5’CCTGA sequences. Alterations in the levels of specific transcripts were detected using RNA-Seq in phase-variants and mutants of cj0031c but these changes did not correlate with observed differences in phenotypic behaviour. Alterations in restriction of phage growth were also associated with phase variation (PV) of cj0031c and correlated with presence of sites in the genomes of these phages. We conclude that PV of a Type IIG restriction-modification system causes changes in site-specific methylation patterns and gene expression patterns that may indirectly change adaptive traits.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 19, 2019

Large genomic differences between Moraxella bovoculi isolates acquired from the eyes of cattle with infectious bovine keratoconjunctivitis versus the deep nasopharynx of asymptomatic cattle.

Moraxella bovoculi is a recently described bacterium that is associated with infectious bovine keratoconjunctivitis (IBK) or “pinkeye” in cattle. In this study, closed circularized genomes were generated for seven M. bovoculi isolates: three that originated from the eyes of clinical IBK bovine cases and four from the deep nasopharynx of asymptomatic cattle. Isolates that originated from the eyes of IBK cases profoundly differed from those that originated from the nasopharynx of asymptomatic cattle in genome structure, gene content and polymorphism diversity and consequently placed into two distinct phylogenetic groups. These results suggest that there are genetically distinct strains of M. bovoculi that may not associate with IBK.


July 19, 2019

Highly efficient CRISPR/Cas9-mediated cloning and functional characterization of gastric cancer-derived Epstein-Barr virus strains.

The Epstein-Barr virus (EBV) is etiologically linked to approximately 10% of gastric cancers, in which viral genomes are maintained as multicopy episomes. EBV-positive gastric cancer cells are incompetent for progeny virus production, making viral DNA cloning extremely difficult. Here we describe a highly efficient strategy for obtaining bacterial artificial chromosome (BAC) clones of EBV episomes by utilizing a CRISPR/Cas9-mediated strand break of the viral genome and subsequent homology-directed repair. EBV strains maintained in two gastric cancer cell lines (SNU719 and YCCEL1) were cloned, and their complete viral genome sequences were determined. Infectious viruses of gastric cancer cell-derived EBVs were reconstituted, and the viruses established stable latent infections in immortalized keratinocytes. While Ras oncoprotein overexpression caused massive vacuolar degeneration and cell death in control keratinocytes, EBV-infected keratinocytes survived in the presence of Ras expression. These results implicate EBV infection in predisposing epithelial cells to malignant transformation by inducing resistance to oncogene-induced cell death.Recent progress in DNA-sequencing technology has accelerated EBV whole-genome sequencing, and the repertoire of sequenced EBV genomes is increasing progressively. Accordingly, the presence of EBV variant strains that may be relevant to EBV-associated diseases has begun to attract interest. Clearly, the determination of additional disease-associated viral genome sequences will facilitate the identification of any disease-specific EBV variants. We found that CRISPR/Cas9-mediated cleavage of EBV episomal DNA enabled the cloning of disease-associated viral strains with unprecedented efficiency. As a proof of concept, two gastric cancer cell-derived EBV strains were cloned, and the infection of epithelial cells with reconstituted viruses provided important clues about the mechanism of EBV-mediated epithelial carcinogenesis. This experimental system should contribute to establishing the relationship between viral genome variation and EBV-associated diseases. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 19, 2019

The complete genome sequence of the murine pathobiont Helicobacter typhlonius.

Immuno-compromised mice infected with Helicobacter typhlonius are used to model microbially inducted inflammatory bowel disease (IBD). The specific mechanism through which H. typhlonius induces and promotes IBD is not fully understood. Access to the genome sequence is essential to examine emergent properties of this organism, such as its pathogenicity. To this end, we present the complete genome sequence of H. typhlonius MIT 97-6810, obtained through single-molecule real-time sequencing.The genome was assembled into a single circularized contig measuring 1.92 Mbp with an average GC content of 38.8%. In total 2,117 protein-encoding genes and 43 RNA genes were identified. Numerous pathogenic features were found, including a putative pathogenicity island (PAIs) containing components of type IV secretion system, virulence-associated proteins and cag PAI protein. We compared the genome of H. typhlonius to those of the murine pathobiont H. hepaticus and human pathobiont H. pylori. H. typhlonius resembles H. hepaticus most with 1,594 (75.3%) of its genes being orthologous to genes in H. hepaticus. Determination of the global methylation state revealed eight distinct recognition motifs for adenine and cytosine methylation. H. typhlonius shares four of its recognition motifs with H. pylori.The complete genome sequence of H. typhlonius MIT 97-6810 enabled us to identify many pathogenic features suggesting that H. typhlonius can act as a pathogen. Follow-up studies are necessary to evaluate the true nature of its pathogenic capabilities. We found many methylated sites and a plethora of restriction-modification systems. The genome, together with the methylome, will provide an essential resource for future studies investigating gene regulation, host interaction and pathogenicity of H. typhlonius. In turn, this work can contribute to unraveling the role of Helicobacter in enteric disease.


July 19, 2019

A role for the bacterial GATC methylome in antibiotic stress survival.

Antibiotic resistance is an increasingly serious public health threat. Understanding pathways allowing bacteria to survive antibiotic stress may unveil new therapeutic targets. We explore the role of the bacterial epigenome in antibiotic stress survival using classical genetic tools and single-molecule real-time sequencing to characterize genomic methylation kinetics. We find that Escherichia coli survival under antibiotic pressure is severely compromised without adenine methylation at GATC sites. Although the adenine methylome remains stable during drug stress, without GATC methylation, methyl-dependent mismatch repair (MMR) is deleterious and, fueled by the drug-induced error-prone polymerase Pol IV, overwhelms cells with toxic DNA breaks. In multiple E. coli strains, including pathogenic and drug-resistant clinical isolates, DNA adenine methyltransferase deficiency potentiates antibiotics from the ß-lactam and quinolone classes. This work indicates that the GATC methylome provides structural support for bacterial survival during antibiotic stress and suggests targeting bacterial DNA methylation as a viable approach to enhancing antibiotic activity.


July 19, 2019

A method for near full-length amplification and sequencing for six hepatitis C virus genotypes.

Hepatitis C virus (HCV) is a rapidly evolving RNA virus that has been classified into seven genotypes. All HCV genotypes cause chronic hepatitis, which ultimately leads to liver diseases such as cirrhosis. The genotypes are unevenly distributed across the globe, with genotypes 1 and 3 being the most prevalent. Until recently, molecular epidemiological studies of HCV evolution within the host and at the population level have been limited to the analyses of partial viral genome segments, as it has been technically challenging to amplify and sequence the full-length of the 9.6 kb HCV genome. Although recent improvements have been made in full genome sequencing methodologies, these protocols are still either limited to a specific genotype or cost-inefficient.In this study we describe a genotype-specific protocol for the amplification and sequencing of the near-full length genome of all six major HCV genotypes. We applied this protocol to 122 HCV positive clinical samples, and had a successful genome amplification rate of 90 %, when the viral load was greater than 15,000 IU/ml. The assay was shown to have a detection limit of 1-3 cDNA copies per reaction. The method was tested with both Illumina and PacBio single molecule, real-time (SMRT) sequencing technologies. Illumina sequencing resulted in deep coverage and allowed detection of rare variants as well as HCV co-infection with multiple genotypes. The application of the method with PacBio RS resulted in sequence reads greater than 9 kb that covered the near full-length HCV amplicon in a single read and enabled analysis of the near full-length quasispecies.The protocol described herein can be utilised for rapid amplification and sequencing of the near-full length HCV genome in a cost efficient manner suitable for a wide range of applications.


July 19, 2019

Towards better precision medicine: PacBio single-molecule long reads resolve the interpretation of HIV drug resistant mutation profiles at explicit quasispecies (haplotype) level.

Development of HIV-1 drug resistance mutations (HDRMs) is one of the major reasons for the clinical failure of antiretroviral therapy. Treatment success rates can be improved by applying personalized anti-HIV regimens based on a patient’s HDRM profile. However, the sensitivity and specificity of the HDRM profile is limited by the methods used for detection. Sanger-based sequencing technology has traditionally been used for determining HDRM profiles at the single nucleotide variant (SNV) level, but with a sensitivity of only = 20% in the HIV population of a patient. Next Generation Sequencing (NGS) technologies offer greater detection sensitivity (~ 1%) and larger scope (hundreds of samples per run). However, NGS technologies produce reads that are too short to enable the detection of the physical linkages of individual SNVs across the haplotype of each HIV strain present. In this article, we demonstrate that the single-molecule long reads generated using the Third Generation Sequencer (TGS), PacBio RS II, along with the appropriate bioinformatics analysis method, can resolve the HDRM profile at a more advanced quasispecies level. The case studies on patients’ HIV samples showed that the quasispecies view produced using the PacBio method offered greater detection sensitivity and was more comprehensive for understanding HDRM situations, which is complement to both Sanger and NGS technologies. In conclusion, the PacBio method, providing a promising new quasispecies level of HDRM profiling, may effect an important change in the field of HIV drug resistance research.


July 19, 2019

Comprehensive mutagenesis of the fimS promoter regulatory switch reveals novel regulation of type 1 pili in uropathogenic Escherichia coli.

Type 1 pili (T1P) are major virulence factors for uropathogenic Escherichia coli (UPEC), which cause both acute and recurrent urinary tract infections. T1P expression therefore is of direct relevance for disease. T1P are phase variable (both piliated and nonpiliated bacteria exist in a clonal population) and are controlled by an invertible DNA switch (fimS), which contains the promoter for the fim operon encoding T1P. Inversion of fimS is stochastic but may be biased by environmental conditions and other signals that ultimately converge at fimS itself. Previous studies of fimS sequences important for T1P phase variation have focused on laboratory-adapted E. coli strains and have been limited in the number of mutations or by alteration of the fimS genomic context. We surmounted these limitations by using saturating genomic mutagenesis of fimS coupled with accurate sequencing to detect both mutations and phase status simultaneously. In addition to the sequences known to be important for biasing fimS inversion, our method also identifies a previously unknown pair of 5′ UTR inverted repeats that act by altering the relative fimA levels to control phase variation. Thus we have uncovered an additional layer of T1P regulation potentially impacting virulence and the coordinate expression of multiple pilus systems.


July 19, 2019

Nested Russian doll-like genetic mobility drives rapid dissemination of the Carbapenem resistance gene blaKPC

The recent widespread emergence of carbapenem resistance in Enterobacteriaceae is a major public health concern, as carbapenems are a therapy of last resort against this family of common bacterial pathogens. Resistance genes can mobilize via various mechanisms, including conjugation and transposition; however, the importance of this mobility in short-term evolution, such as within nosocomial outbreaks, is unknown. Using a combination of short- and long-read whole-genome sequencing of 281 blaKPC-positive Enterobacteriaceae isolates from a single hospital over 5 years, we demonstrate rapid dissemination of this carbapenem resistance gene to multiple species, strains, and plasmids. Mobility of blaKPC occurs at multiple nested genetic levels, with transmission of blaKPC strains between individuals, frequent transfer of blaKPC plasmids between strains/species, and frequent transposition of blaKPC transposon Tn4401 between plasmids. We also identify a common insertion site for Tn4401 within various Tn2-like elements, suggesting that homologous recombination between Tn2-like elements has enhanced the spread of Tn4401 between different plasmid vectors. Furthermore, while short-read sequencing has known limitations for plasmid assembly, various studies have attempted to overcome this by the use of reference-based methods. We also demonstrate that, as a consequence of the genetic mobility observed in this study, plasmid structures can be extremely dynamic, and therefore these reference-based methods, as well as traditional partial typing methods, can produce very misleading conclusions. Overall, our findings demonstrate that nonclonal resistance gene dissemination can be extremely rapid, presenting significant challenges for public health surveillance and achieving effective control of antibiotic resistance. Copyright © 2016 Sheppard et al.


July 19, 2019

Accelerated cloning of a potato late blight-resistance gene using RenSeq and SMRT sequencing.

Global yields of potato and tomato crops have fallen owing to potato late blight disease, which is caused by Phytophthora infestans. Although most commercial potato varieties are susceptible to blight, many wild potato relatives show variation for resistance and are therefore a potential source of Resistance to P. infestans (Rpi) genes. Resistance breeding has exploited Rpi genes from closely related tuber-bearing potato relatives, but is laborious and slow. Here we report that the wild, diploid non-tuber-bearing Solanum americanum harbors multiple Rpi genes. We combine resistance (R) gene sequence capture (RenSeq) with single-molecule real-time (SMRT) sequencing (SMRT RenSeq) to clone Rpi-amr3i. This technology should enable de novo assembly of complete nucleotide-binding, leucine-rich repeat receptor (NLR) genes, their regulatory elements and complex multi-NLR loci from uncharacterized germplasm. SMRT RenSeq can be applied to rapidly clone multiple R genes for engineering pathogen-resistant crops.


July 19, 2019

SMRT RenSeq protocol

R gene enrichment and Sequencing (RenSeq, Jupe et al. 2013) is a genome complexity reduction method which allows to enrich for nucleotide-binding, leucine reach repeat (NLR) type plant disease resistance genes prior to sequencing. RenSeq was established and successfully used with Illumina platforms (Jupe et al. 2013, Andolfo et al. 2014), however the repetitive nature of NLR genes hampered de novo assembly of this family. Here we describe a protocol which enables to prepare long enriched libraries that are suitable for Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. Reads Of Inserts (ROI) generated with this protocol are around 3-4 kb in length (longer than the average NLR sequence). These long reads are especially well suited for de novo assembly of whole NLR genes including their regulatory elements


July 19, 2019

Initial assessment of the molecular epidemiology of blaNDM-1 in Colombia.

We report complete genome sequences of fourblaNDM-1-harboring Gram-negative multidrug resistant (MDR) isolates from Colombia. TheblaNDM-1genes were located 193Kb-Inc FIA, 178Kb-Inc A/C2 and 47Kb (unknown Inc type) plasmids. MLST revealed that isolates belong to ST10 (Escherichia coli), ST392 (Klebsiella pneumoniae), and ST322 and ST464 (Acinetobacter baumanniiandA. nosocomialis, respectively). Our analysis identified that the Inc A/C2 plasmid inE. colicontained a novel complex transposon (Tn125and Tn5393with 3 copies ofblaNDM-1) and a recombination “hotspot” for the acquisition of new resistance determinants. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 19, 2019

Genome structural diversity among 31 Bordetella pertussis isolates from two recent U.S. whooping cough statewide epidemics

During 2010 and 2012, California and Vermont, respectively, experienced statewide epidemics of pertussis with differences seen in the demographic affected, case clinical presentation, and molecular epidemiology of the circulating strains. To overcome limitations of the current molecular typing methods for pertussis, we utilized whole-genome sequencing to gain a broader understanding of how current circulating strains are causing large epidemics. Through the use of combined next-generation sequencing technologies, this study compared de novo, single-contig genome assemblies from 31 out of 33 Bordetella pertussis isolates collected during two separate pertussis statewide epidemics and 2 resequenced vaccine strains. Final genome architecture assemblies were verified with whole-genome optical mapping. Sixteen distinct genome rearrangement profiles were observed in epidemic isolate genomes, all of which were distinct from the genome structures of the two resequenced vaccine strains. These rearrangements appear to be mediated by repetitive sequence elements, such as high-copy-number mobile genetic elements and rRNA operons. Additionally, novel and previously identified single nucleotide polymorphisms were detected in 10 virulence-related genes in the epidemic isolates. Whole-genome variation analysis identified state-specific variants, and coding regions bearing nonsynonymous mutations were classified into functional annotated orthologous groups. Comprehensive studies on whole genomes are needed to understand the resurgence of pertussis and develop novel tools to better characterize the molecular epidemiology of evolving B.~pertussis populations.IMPORTANCE Pertussis, or whooping cough, is the most poorly controlled vaccine-preventable bacterial disease in the United States, which has experienced a resurgence for more than a decade. Once viewed as a monomorphic pathogen, B.~pertussis strains circulating during epidemics exhibit diversity visible on a genome structural level, previously undetectable by traditional sequence analysis using short-read technologies. For the first time, we combine short- and long-read sequencing platforms with restriction optical mapping for single-contig, de novo assembly of 31 isolates to investigate two geographically and temporally independent U.S. pertussis epidemics. These complete genomes reshape our understanding of B.~pertussis evolution and strengthen molecular epidemiology toward one day understanding the resurgence of pertussis.


July 19, 2019

Complete telomere-to-telomere de novo assembly of the Plasmodium falciparum genome through long-read (>11?kb), single molecule, real-time sequencing.

The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [~80.6% (A?+?T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12?kb, with 50% of the reads between 15.5 and 50?kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [~90-99% (A?+?T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.