Menu
July 19, 2019

Long read sequencing technology to solve complex genomic regions assembly in plants

Background: Numerous completed or on-going whole genome sequencing projects have highlighted the fact that obtaining a high quality genome sequence is necessary to address comparative genomics questions such as structural variations among genotypes and gain or loss of specific function. Despite the spectacular progress that has been made in sequencing technologies, obtaining accurate and reliable data is still a challenge, both at the whole genome scale and when targeting specific genomic regions. These problems are even more noticeable for complex plant genomes. Most plant genomes are known to be particularly challenging due to their size, high density of repetitive elements and various levels of ploidy. To overcome these problems, we have developed a strategy to reduce genome complexity by using the large insert BAC libraries combined with next generation sequencing technologies. Results: We compared two different technologies (Roche-454 and Pacific Biosciences PacBio RS II) to sequence pools of BAC clones in order to obtain the best quality sequence. We targeted nine BAC clones from different species (maize, wheat, strawberry, barley, sugarcane and sunflower) known to be complex in terms of sequence assembly. We sequenced the pools of the nine BAC clones with both technologies. We compared assembly results and highlighted differences due to the sequencing technologies used. Conclusions: We demonstrated that the long reads obtained with the PacBio RS II technology serve to obtain a better and more reliable assembly, notably by preventing errors due to duplicated or repetitive sequences in the same region.


July 19, 2019

Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen.

Eukaryotes display remarkable genome plasticity, which can include supernumerary chromosomes that differ markedly from the core chromosomes. Despite the widespread occurrence of supernumerary chromosomes in fungi, their origin, relation to the core genome and the reason for their divergent characteristics are still largely unknown. The complexity of genome assembly due to the presence of repetitive DNA partially accounts for this.Here we use single-molecule real-time (SMRT) sequencing to assemble the genome of a prominent fungal wheat pathogen, Fusarium poae, including at least one supernumerary chromosome. The core genome contains limited transposable elements (TEs) and no gene duplications, while the supernumerary genome holds up to 25 % TEs and multiple gene duplications. The core genome shows all hallmarks of repeat-induced point mutation (RIP), a defense mechanism against TEs, specific for fungi. The absence of RIP on the supernumerary genome accounts for the differences between the two (sub)genomes, and results in a functional crosstalk between them. The supernumerary genome is a reservoir for TEs that migrate to the core genome, and even large blocks of supernumerary sequence (>200 kb) have recently translocated to the core. Vice versa, the supernumerary genome acts as a refuge for genes that are duplicated from the core genome.For the first time, a mechanism was determined that explains the differences that exist between the core and supernumerary genome in fungi. Different biology rather than origin was shown to be responsible. A “living apart together” crosstalk exists between the core and supernumerary genome, accelerating chromosomal and organismal evolution.


July 19, 2019

Biosynthesis and function of modified bases in bacteria and their viruses.

Naturally occurring modification of the canonical A, G, C, and T bases can be found in the DNA of cellular organisms and viruses from all domains of life. Bacterial viruses (bacteriophages) are a particularly rich but still underexploited source of such modified variant nucleotides. The modifications conserve the coding and base-pairing functions of DNA, but add regulatory and protective functions. In prokaryotes, modified bases appear primarily to be part of an arms race between bacteriophages (and other genomic parasites) and their hosts, although, as in eukaryotes, some modifications have been adapted to convey epigenetic information. The first half of this review catalogs the identification and diversity of DNA modifications found in bacteria and bacteriophages. What is known about the biogenesis, context, and function of these modifications are also described. The second part of the review places these DNA modifications in the context of the arms race between bacteria and bacteriophages. It focuses particularly on the defense and counter-defense strategies that turn on direct recognition of the presence of a modified base. Where modification has been shown to affect other DNA transactions, such as expression and chromosome segregation, that is summarized, with reference to recent reviews.


July 19, 2019

High throughput random mutagenesis and Single Molecule Real Time Sequencing of the muscle nicotinic acetylcholine receptor.

High throughput random mutagenesis is a powerful tool to identify which residues are important for the function of a protein, and gain insight into its structure-function relation. The human muscle nicotinic acetylcholine receptor was used to test whether this technique previously used for monomeric receptors can be applied to a pentameric ligand-gated ion channel. A mutant library for the a1 subunit of the channel was generated by error-prone PCR, and full length sequences of all 2816 mutants were retrieved using single molecule real time sequencing. Each a1 mutant was co-transfected with wildtype ß1, d, and e subunits, and the channel function characterized by an ion flux assay. To test whether the strategy could map the structure-function relation of this receptor, we attempted to identify mutations that conferred resistance to competitive antagonists. Mutant hits were defined as receptors that responded to the nicotinic agonist epibatidine, but were not inhibited by either a-bungarotoxin or tubocurarine. Eight a1 subunit mutant hits were identified, six of which contained mutations at position Y233 or V275 in the transmembrane domain. Three single point mutations (Y233N, Y233H, and V275M) were studied further, and found to enhance the potencies of five channel agonists tested. This suggests that the mutations made the channel resistant to the antagonists, not by impairing antagonist binding, but rather by producing a gain-of-function phenotype, e.g. increased agonist sensitivity. Our data show that random high throughput mutagenesis is applicable to multimeric proteins to discover novel functional mutants, and outlines the benefits of using single molecule real time sequencing with regards to quality control of the mutant library as well as downstream mutant data interpretation.


July 19, 2019

IncFIIk plasmid harbouring an amplification of 16S rRNA methyltransferase-encoding gene rmtH associated with mobile element ISCR2.

To investigate the resistance mechanisms and genetic support underlying the high resistance level of the Klebsiella pneumoniae strain CMUL78 to aminoglycoside and ß-lactam antibiotics.Antibiotic susceptibility was assessed by the disc diffusion method and MICs were determined by the microdilution method. Antibiotic resistance genes and their genetic environment were characterized by PCR and Sanger sequencing. Plasmid contents were analysed in the clinical strain and transconjugants obtained by mating-out assays. Complete plasmid sequencing was performed with PacBio and Illumina technology.Strain CMUL78 co-produced the 16S rRNA methyltransferase (RMTase) RmtH, carbapenemase OXA-48 and ESBL SHV-12. The rmtH- and blaSHV-12-encoding genes were harboured by a novel ~115 kb IncFIIk plasmid designated pRmtH, and blaOXA-48 by a ~62 kb IncL/M plasmid related to pOXA-48a. pRmtH plasmid possessed seven different stability modules, one of which is a novel hybrid toxin-antitoxin system. Interestingly, pRmtH plasmid harboured a 4-fold amplification of an rmtH-ISCR2 unit arranged in tandem and inserted within a novel IS26-based composite transposon designated Tn6329.This is the first known report of the 16S RMTase-encoding gene rmtH in a plasmid. The rmtH-ISCR2 unit was inserted in a composite transposon as a 4-fold tandem repeat, a scarcely reported organization.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 19, 2019

The deep origin and recent loss of venom toxin genes in rattlesnakes.

The genetic origin of novel traits is a central but challenging puzzle in evolutionary biology. Among snakes, phospholipase A2 (PLA2)-related toxins have evolved in different lineages to function as potent neurotoxins, myotoxins, or hemotoxins. Here, we traced the genomic origin and evolution of PLA2 toxins by examining PLA2 gene number, organization, and expression in both neurotoxic and non-neurotoxic rattlesnakes. We found that even though most North American rattlesnakes do not produce neurotoxins, the genes of a specialized heterodimeric neurotoxin predate the origin of rattlesnakes and were present in their last common ancestor (~22 mya). The neurotoxin genes were then deleted independently in the lineages leading to the Western Diamondback (Crotalus atrox) and Eastern Diamondback (C. adamanteus) rattlesnakes (~6 mya), while a PLA2 myotoxin gene retained in C. atrox was deleted from the neurotoxic Mojave rattlesnake (C. scutulatus; ~4 mya). The rapid evolution of PLA2 gene number appears to be due to transposon invasion that provided a template for non-allelic homologous recombination. Copyright © 2016 Elsevier Ltd. All rights reserved.


July 19, 2019

Single-molecule sequencing revealing the presence of distinct JC polyomavirus populations in patients with progressive multifocal leukoencephalopathy.

Progressive multifocal leukoencephalopathy (PML) is a fatal disease caused by reactivation of JC polyomavirus (JCPyV) in immunosuppressed individuals and lytic infection by neurotropic JCPyV in glial cells. The exact content of neurotropic mutations within individual JCPyV strains has not been studied to our knowledge.We exploited the capacity of single-molecule real-time sequencing technology to determine the sequence of complete JCPyV genomes in single reads. The method was used to precisely characterize individual neurotropic JCPyV strains of 3 patients with PML without the bias caused by assembly of short sequence reads.In the cerebrospinal fluid sample of a 73-year-old woman with rapid PML onset, 3 distinct JCPyV populations could be identified. All viral populations were characterized by rearrangements within the noncoding regulatory region (NCCR) and 1 point mutation, S267L in the VP1 gene, suggestive of neurotropic strains. One patient with PML had a single neurotropic strain with rearranged NCCR, and 1 patient had a single strain with small NCCR alterations.We report here, for the first time, full characterization of individual neurotropic JCPyV strains in the cerebrospinal fluid of patients with PML. It remains to be established whether PML pathogenesis is driven by one or several neurotropic strains in an individual.


July 19, 2019

Host genome integration and giant virus-induced reactivation of the virophage mavirus.

Endogenous viral elements are increasingly found in eukaryotic genomes, yet little is known about their origins, dynamics, or function. Here we provide a compelling example of a DNA virus that readily integrates into a eukaryotic genome where it acts as an inducible antiviral defence system. We found that the virophage mavirus, a parasite of the giant Cafeteria roenbergensis virus (CroV), integrates at multiple sites within the nuclear genome of the marine protozoan Cafeteria roenbergensis. The endogenous mavirus is structurally and genetically similar to eukaryotic DNA transposons and endogenous viruses of the Maverick/Polinton family. Provirophage genes are not constitutively expressed, but are specifically activated by superinfection with CroV, which induces the production of infectious mavirus particles. Virophages can inhibit the replication of mimivirus-like giant viruses and an anti-viral protective effect of provirophages on their hosts has been hypothesized. We find that provirophage-carrying cells are not directly protected from CroV; however, lysis of these cells releases infectious mavirus particles that are then able to suppress CroV replication and enhance host survival during subsequent rounds of infection. The microbial host-parasite interaction described here involves an altruistic aspect and suggests that giant-virus-induced activation of provirophages might be ecologically relevant in natural protist populations.


July 19, 2019

CGG repeat-induced FMR1 silencing depends on the expansion size in human iPSCs and neurons carrying unmethylated full mutations.

In fragile X syndrome (FXS), CGG repeat expansion greater than 200 triplets is believed to trigger FMR1 gene silencing and disease etiology. However, FXS siblings have been identified with more than 200 CGGs, termed unmethylated full mutation (UFM) carriers, without gene silencing and disease symptoms. Here, we show that hypomethylation of the FMR1 promoter is maintained in induced pluripotent stem cells (iPSCs) derived from two UFM individuals. However, a subset of iPSC clones with large CGG expansions carries silenced FMR1. Furthermore, we demonstrate de novo silencing upon expansion of the CGG repeat size. FMR1 does not undergo silencing during neuronal differentiation of UFM iPSCs, and expression of large unmethylated CGG repeats has phenotypic consequences resulting in neurodegenerative features. Our data suggest that UFM individuals do not lack the cell-intrinsic ability to silence FMR1 and that inter-individual variability in the CGG repeat size required for silencing exists in the FXS population. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.


July 19, 2019

Exploiting members of the BAHD acyltransferase family to synthesize multiple hydroxycinnamate and benzoate conjugates in yeast.

BAHD acyltransferases, named after the first four biochemically characterized enzymes of the group, are plant-specific enzymes that catalyze the transfer of coenzyme A-activated donors onto various acceptor molecules. They are responsible for the synthesis in plants of a myriad of secondary metabolites, some of which are beneficial for humans either as therapeutics or as specialty chemicals such as flavors and fragrances. The production of pharmaceutical, nutraceutical and commodity chemicals using engineered microbes is an alternative, green route to energy-intensive chemical syntheses that consume petroleum-based precursors. However, identification of appropriate enzymes and validation of their functional expression in heterologous hosts is a prerequisite for the design and implementation of metabolic pathways in microbes for the synthesis of such target chemicals.For the synthesis of valuable metabolites in the yeast Saccharomyces cerevisiae, we selected BAHD acyltransferases based on their preferred donor and acceptor substrates. In particular, BAHDs that use hydroxycinnamoyl-CoAs and/or benzoyl-CoA as donors were targeted because a large number of molecules beneficial to humans belong to this family of hydroxycinnamate and benzoate conjugates. The selected BAHD coding sequences were synthesized and cloned individually on a vector containing the Arabidopsis gene At4CL5, which encodes a promiscuous 4-coumarate:CoA ligase active on hydroxycinnamates and benzoates. The various S. cerevisiae strains obtained for co-expression of At4CL5 with the different BAHDs effectively produced a wide array of valuable hydroxycinnamate and benzoate conjugates upon addition of adequate combinations of donors and acceptor molecules. In particular, we report here for the first time the production in yeast of rosmarinic acid and its derivatives, quinate hydroxycinnamate esters such as chlorogenic acid, and glycerol hydroxycinnamate esters. Similarly, we achieved for the first time the microbial production of polyamine hydroxycinnamate amides; monolignol, malate and fatty alcohol hydroxycinnamate esters; tropane alkaloids; and benzoate/caffeate alcohol esters. In some instances, the additional expression of Flavobacterium johnsoniae tyrosine ammonia-lyase (FjTAL) allowed the synthesis of p-coumarate conjugates and eliminated the need to supplement the culture media with 4-hydroxycinnamate.We demonstrate in this study the effectiveness of expressing members of the plant BAHD acyltransferase family in yeast for the synthesis of numerous valuable hydroxycinnamate and benzoate conjugates.


July 19, 2019

Methylome analysis of two Xanthomonas spp. using Single-Molecule Real-Time Sequencing.

Single-molecule real-time (SMRT) sequencing allows identification of methylated DNA bases and methylation patterns/motifs at the genome level. Using SMRT sequencing, diverse bacterial methylomes including those of Helicobacter pylori, Lactobacillus spp., and Escherichia coli have been determined, and previously unreported DNA methylation motifs have been identified. However, the methylomes of Xanthomonas species, which belong to the most important plant pathogenic bacterial genus, have not been documented. Here, we report the methylomes of Xanthomonas axonopodis pv. glycines (Xag) strain 8ra and X. campestris pv. vesicatoria (Xcv) strain 85-10. We identified N(6)-methyladenine (6mA) and N(4)-methylcytosine (4mC) modification in both genomes. In addition, we assigned putative DNA methylation motifs including previously unreported methylation motifs via REBASE and MotifMaker, and compared methylation patterns in both species. Although Xag and Xcv belong to the same genus, their methylation patterns were dramatically different. The number of 4mC DNA bases in Xag (66,682) was significantly higher (29 fold) than in Xcv (2,321). In contrast, the number of 6mA DNA bases (4,147) in Xag was comparable to the number in Xcv (5,491). Strikingly, there were no common or shared motifs in the 10 most frequently methylated motifs of both strains, indicating they possess unique species- or strain-specific methylation motifs. Among the 20 most frequent motifs from both strains, for 9 motifs at least 1% of the methylated bases were located in putative promoter regions. Methylome analysis by SMRT sequencing technology is the first step toward understanding the biology and functions of DNA methylation in this genus.


July 19, 2019

Mechanisms of evolution in high-consequence drug resistance plasmids.

The dissemination of resistance among bacteria has been facilitated by the fact that resistance genes are usually located on a diverse and evolving set of transmissible plasmids. However, the mechanisms generating diversity and enabling adaptation within highly successful resistance plasmids have remained obscure, despite their profound clinical significance. To understand these mechanisms, we have performed a detailed analysis of the mobilome (the entire mobile genetic element content) of a set of previously sequenced carbapenemase-producing Enterobacteriaceae (CPE) from the National Institutes of Health Clinical Center. This analysis revealed that plasmid reorganizations occurring in the natural context of colonization of human hosts were overwhelmingly driven by genetic rearrangements carried out by replicative transposons working in concert with the process of homologous recombination. A more complete understanding of the molecular mechanisms and evolutionary forces driving rearrangements in resistance plasmids may lead to fundamentally new strategies to address the problem of antibiotic resistance.The spread of antibiotic resistance among Gram-negative bacteria is a serious public health threat, as it can critically limit the types of drugs that can be used to treat infected patients. In particular, carbapenem-resistant members of the Enterobacteriaceae family are responsible for a significant and growing burden of morbidity and mortality. Here, we report on the mechanisms underlying the evolution of several plasmids carried by previously sequenced clinical Enterobacteriaceae isolates from the National Institutes of Health Clinical Center (NIH CC). Our ability to track genetic rearrangements that occurred within resistance plasmids was dependent on accurate annotation of the mobile genetic elements within the plasmids, which was greatly aided by access to long-read DNA sequencing data and knowledge of their mechanisms. Mobile genetic elements such as transposons and integrons have been strongly associated with the rapid spread of genes responsible for antibiotic resistance. Understanding the consequences of their actions allowed us to establish unambiguous evolutionary relationships between plasmids in the analysis set. Copyright © 2016 He et al.


July 19, 2019

Comprehensive genome analysis of carbapenemase-producing Enterobacter spp.: new insights into phylogeny, population structure and resistance mechanisms.

Knowledge regarding the genomic structure of Enterobacter spp., the second most prevalent carbapenemase-producing Enterobacteriaceae, remains limited. Here we sequenced 97 clinical Enterobacter species isolates that were both carbapenem susceptible and resistant from various geographic regions to decipher the molecular origins of carbapenem resistance and to understand the changing phylogeny of these emerging and drug-resistant pathogens. Of the carbapenem-resistant isolates, 30 possessed blaKPC-2, 40 had blaKPC-3, 2 had blaKPC-4, and 2 had blaNDM-1 Twenty-three isolates were carbapenem susceptible. Six genomes were sequenced to completion, and their sizes ranged from 4.6 to 5.1 Mbp. Phylogenomic analysis placed 96 of these genomes, 351 additional Enterobacter genomes downloaded from NCBI GenBank, and six newly sequenced type strains into 19 phylogenomic groups-18 groups (A to R) in the Enterobacter cloacae complex and Enterobacter aerogenes Diverse mechanisms underlying the molecular evolutionary trajectory of these drug-resistant Enterobacter spp. were revealed, including the acquisition of an antibiotic resistance plasmid, followed by clonal spread, horizontal transfer of blaKPC-harboring plasmids between different phylogenomic groups, and repeated transposition of the blaKPC gene among different plasmid backbones. Group A, which comprises multilocus sequence type 171 (ST171), was the most commonly identified (23% of isolates). Genomic analysis showed that ST171 isolates evolved from a common ancestor and formed two different major clusters; each acquiring unique blaKPC-harboring plasmids, followed by clonal expansion. The data presented here represent the first comprehensive study of phylogenomic interrogation and the relationship between antibiotic resistance and plasmid discrimination among carbapenem-resistant Enterobacter spp., demonstrating the genetic diversity and complexity of the molecular mechanisms driving antibiotic resistance in this genus.Enterobacter spp., especially carbapenemase-producing Enterobacter spp., have emerged as a clinically significant cause of nosocomial infections. However, only limited information is available on the distribution of carbapenem resistance across this genus. Augmenting this problem is an erroneous identification of Enterobacter strains because of ambiguous typing methods and imprecise taxonomy. In this study, we used a whole-genome-based comparative phylogenetic approach to (i) revisit and redefine the genus Enterobacter and (ii) unravel the emergence and evolution of the Klebsiella pneumoniae carbapenemase-harboring Enterobacter spp. Using genomic analysis of 447 sequenced strains, we developed an improved understanding of the species designations within this complex genus and identified the diverse mechanisms driving the molecular evolution of carbapenem resistance. The findings in this study provide a solid genomic framework that will serve as an important resource in the future development of molecular diagnostics and in supporting drug discovery programs. Copyright © 2016 Chavda et al.


July 19, 2019

Examining sources of error in PCR by single-molecule sequencing.

Next-generation sequencing technology has enabled the detection of rare genetic or somatic mutations and contributed to our understanding of disease progression and evolution. However, many next-generation sequencing technologies first rely on DNA amplification, via the Polymerase Chain Reaction (PCR), as part of sample preparation workflows. Mistakes made during PCR appear in sequencing data and contribute to false mutations that can ultimately confound genetic analysis. In this report, a single-molecule sequencing assay was used to comprehensively catalog the different types of errors introduced during PCR, including polymerase misincorporation, structure-induced template-switching, PCR-mediated recombination and DNA damage. In addition to well-characterized polymerase base substitution errors, other sources of error were found to be equally prevalent. PCR-mediated recombination by Taq polymerase was observed at the single-molecule level, and surprisingly found to occur as frequently as polymerase base substitution errors, suggesting it may be an underappreciated source of error for multiplex amplification reactions. Inverted repeat structural elements in lacZ caused polymerase template-switching between the top and bottom strands during replication and the frequency of these events were measured for different polymerases. For very accurate polymerases, DNA damage introduced during temperature cycling, and not polymerase base substitution errors, appeared to be the major contributor toward mutations occurring in amplification products. In total, we analyzed PCR products at the single-molecule level and present here a more complete picture of the types of mistakes that occur during DNA amplification.


July 19, 2019

Genetic stability of genome-scale deoptimized RNA virus vaccine candidates under selective pressure.

Recoding viral genomes by numerous synonymous but suboptimal substitutions provides live attenuated vaccine candidates. These vaccine candidates should have a low risk of deattenuation because of the many changes involved. However, their genetic stability under selective pressure is largely unknown. We evaluated phenotypic reversion of deoptimized human respiratory syncytial virus (RSV) vaccine candidates in the context of strong selective pressure. Codon pair deoptimized (CPD) versions of RSV were attenuated and temperature-sensitive. During serial passage at progressively increasing temperature, a CPD RSV containing 2,692 synonymous mutations in 9 of 11 ORFs did not lose temperature sensitivity, remained genetically stable, and was restricted at temperatures of 34 °C/35 °C and above. However, a CPD RSV containing 1,378 synonymous mutations solely in the polymerase L ORF quickly lost substantial attenuation. Comprehensive sequence analysis of virus populations identified many different potentially deattenuating mutations in the L ORF as well as, surprisingly, many appearing in other ORFs. Phenotypic analysis revealed that either of two competing mutations in the virus transcription antitermination factor M2-1, outside of the CPD area, substantially reversed defective transcription of the CPD L gene and substantially restored virus fitness in vitro and in case of one of these two mutations, also in vivo. Paradoxically, the introduction into Min L of one mutation each in the M2-1, N, P, and L proteins resulted in a virus with increased attenuation in vivo but increased immunogenicity. Thus, in addition to providing insights on the adaptability of genome-scale deoptimized RNA viruses, stability studies can yield improved synthetic RNA virus vaccine candidates.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.