Menu
July 19, 2019

CGG repeat-induced FMR1 silencing depends on the expansion size in human iPSCs and neurons carrying unmethylated full mutations.

In fragile X syndrome (FXS), CGG repeat expansion greater than 200 triplets is believed to trigger FMR1 gene silencing and disease etiology. However, FXS siblings have been identified with more than 200 CGGs, termed unmethylated full mutation (UFM) carriers, without gene silencing and disease symptoms. Here, we show that hypomethylation of the FMR1 promoter is maintained in induced pluripotent stem cells (iPSCs) derived from two UFM individuals. However, a subset of iPSC clones with large CGG expansions carries silenced FMR1. Furthermore, we demonstrate de novo silencing upon expansion of the CGG repeat size. FMR1 does not undergo silencing during neuronal differentiation of UFM iPSCs, and expression of large unmethylated CGG repeats has phenotypic consequences resulting in neurodegenerative features. Our data suggest that UFM individuals do not lack the cell-intrinsic ability to silence FMR1 and that inter-individual variability in the CGG repeat size required for silencing exists in the FXS population. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.


July 19, 2019

Mechanisms of evolution in high-consequence drug resistance plasmids.

The dissemination of resistance among bacteria has been facilitated by the fact that resistance genes are usually located on a diverse and evolving set of transmissible plasmids. However, the mechanisms generating diversity and enabling adaptation within highly successful resistance plasmids have remained obscure, despite their profound clinical significance. To understand these mechanisms, we have performed a detailed analysis of the mobilome (the entire mobile genetic element content) of a set of previously sequenced carbapenemase-producing Enterobacteriaceae (CPE) from the National Institutes of Health Clinical Center. This analysis revealed that plasmid reorganizations occurring in the natural context of colonization of human hosts were overwhelmingly driven by genetic rearrangements carried out by replicative transposons working in concert with the process of homologous recombination. A more complete understanding of the molecular mechanisms and evolutionary forces driving rearrangements in resistance plasmids may lead to fundamentally new strategies to address the problem of antibiotic resistance.The spread of antibiotic resistance among Gram-negative bacteria is a serious public health threat, as it can critically limit the types of drugs that can be used to treat infected patients. In particular, carbapenem-resistant members of the Enterobacteriaceae family are responsible for a significant and growing burden of morbidity and mortality. Here, we report on the mechanisms underlying the evolution of several plasmids carried by previously sequenced clinical Enterobacteriaceae isolates from the National Institutes of Health Clinical Center (NIH CC). Our ability to track genetic rearrangements that occurred within resistance plasmids was dependent on accurate annotation of the mobile genetic elements within the plasmids, which was greatly aided by access to long-read DNA sequencing data and knowledge of their mechanisms. Mobile genetic elements such as transposons and integrons have been strongly associated with the rapid spread of genes responsible for antibiotic resistance. Understanding the consequences of their actions allowed us to establish unambiguous evolutionary relationships between plasmids in the analysis set. Copyright © 2016 He et al.


July 19, 2019

Comprehensive genome analysis of carbapenemase-producing Enterobacter spp.: new insights into phylogeny, population structure and resistance mechanisms.

Knowledge regarding the genomic structure of Enterobacter spp., the second most prevalent carbapenemase-producing Enterobacteriaceae, remains limited. Here we sequenced 97 clinical Enterobacter species isolates that were both carbapenem susceptible and resistant from various geographic regions to decipher the molecular origins of carbapenem resistance and to understand the changing phylogeny of these emerging and drug-resistant pathogens. Of the carbapenem-resistant isolates, 30 possessed blaKPC-2, 40 had blaKPC-3, 2 had blaKPC-4, and 2 had blaNDM-1 Twenty-three isolates were carbapenem susceptible. Six genomes were sequenced to completion, and their sizes ranged from 4.6 to 5.1 Mbp. Phylogenomic analysis placed 96 of these genomes, 351 additional Enterobacter genomes downloaded from NCBI GenBank, and six newly sequenced type strains into 19 phylogenomic groups-18 groups (A to R) in the Enterobacter cloacae complex and Enterobacter aerogenes Diverse mechanisms underlying the molecular evolutionary trajectory of these drug-resistant Enterobacter spp. were revealed, including the acquisition of an antibiotic resistance plasmid, followed by clonal spread, horizontal transfer of blaKPC-harboring plasmids between different phylogenomic groups, and repeated transposition of the blaKPC gene among different plasmid backbones. Group A, which comprises multilocus sequence type 171 (ST171), was the most commonly identified (23% of isolates). Genomic analysis showed that ST171 isolates evolved from a common ancestor and formed two different major clusters; each acquiring unique blaKPC-harboring plasmids, followed by clonal expansion. The data presented here represent the first comprehensive study of phylogenomic interrogation and the relationship between antibiotic resistance and plasmid discrimination among carbapenem-resistant Enterobacter spp., demonstrating the genetic diversity and complexity of the molecular mechanisms driving antibiotic resistance in this genus.Enterobacter spp., especially carbapenemase-producing Enterobacter spp., have emerged as a clinically significant cause of nosocomial infections. However, only limited information is available on the distribution of carbapenem resistance across this genus. Augmenting this problem is an erroneous identification of Enterobacter strains because of ambiguous typing methods and imprecise taxonomy. In this study, we used a whole-genome-based comparative phylogenetic approach to (i) revisit and redefine the genus Enterobacter and (ii) unravel the emergence and evolution of the Klebsiella pneumoniae carbapenemase-harboring Enterobacter spp. Using genomic analysis of 447 sequenced strains, we developed an improved understanding of the species designations within this complex genus and identified the diverse mechanisms driving the molecular evolution of carbapenem resistance. The findings in this study provide a solid genomic framework that will serve as an important resource in the future development of molecular diagnostics and in supporting drug discovery programs. Copyright © 2016 Chavda et al.


July 19, 2019

Examining sources of error in PCR by single-molecule sequencing.

Next-generation sequencing technology has enabled the detection of rare genetic or somatic mutations and contributed to our understanding of disease progression and evolution. However, many next-generation sequencing technologies first rely on DNA amplification, via the Polymerase Chain Reaction (PCR), as part of sample preparation workflows. Mistakes made during PCR appear in sequencing data and contribute to false mutations that can ultimately confound genetic analysis. In this report, a single-molecule sequencing assay was used to comprehensively catalog the different types of errors introduced during PCR, including polymerase misincorporation, structure-induced template-switching, PCR-mediated recombination and DNA damage. In addition to well-characterized polymerase base substitution errors, other sources of error were found to be equally prevalent. PCR-mediated recombination by Taq polymerase was observed at the single-molecule level, and surprisingly found to occur as frequently as polymerase base substitution errors, suggesting it may be an underappreciated source of error for multiplex amplification reactions. Inverted repeat structural elements in lacZ caused polymerase template-switching between the top and bottom strands during replication and the frequency of these events were measured for different polymerases. For very accurate polymerases, DNA damage introduced during temperature cycling, and not polymerase base substitution errors, appeared to be the major contributor toward mutations occurring in amplification products. In total, we analyzed PCR products at the single-molecule level and present here a more complete picture of the types of mistakes that occur during DNA amplification.


July 19, 2019

The history of Bordetella pertussis genome evolution includes structural rearrangement.

Despite high pertussis vaccine coverage, reported cases of whooping cough (pertussis) have increased over the last decade in the United States and other developed countries. Although Bordetella pertussis is well known for its limited gene sequence variation, recent advances in long-read sequencing technology have begun to reveal genomic structural heterogeneity among otherwise indistinguishable isolates, even within geographically or temporally defined epidemics. We have compared rearrangements among complete genome assemblies from 257 B. pertussis isolates to examine the potential evolution of the chromosomal structure in a pathogen with minimal gene nucleotide sequence diversity. Discrete changes in gene order were identified that differentiated genomes from vaccine reference strains and clinical isolates of various genotypes, frequently along phylogenetic boundaries defined by single nucleotide polymorphisms. The observed rearrangements were primarily large inversions centered on the replication origin or terminus and flanked by IS481, a mobile genetic element with >240 copies per genome and previously suspected to mediate rearrangements and deletions by homologous recombination. These data illustrate that structural genome evolution in B. pertussis is not limited to reduction but also includes rearrangement. Therefore, although genomes of clinical isolates are structurally diverse, specific changes in gene order are conserved, perhaps due to positive selection, providing novel information for investigating disease resurgence and molecular epidemiology.IMPORTANCE Whooping cough, primarily caused by Bordetella pertussis, has resurged in the United States even though the coverage with pertussis-containing vaccines remains high. The rise in reported cases has included increased disease rates among all vaccinated age groups, provoking questions about the pathogen’s evolution. The chromosome of B. pertussis includes a large number of repetitive mobile genetic elements that obstruct genome analysis. However, these mobile elements facilitate large rearrangements that alter the order and orientation of essential protein-encoding genes, which otherwise exhibit little nucleotide sequence diversity. By comparing the complete genome assemblies from 257 isolates, we show that specific rearrangements have been conserved throughout recent evolutionary history, perhaps by eliciting changes in gene expression, which may also provide useful information for molecular epidemiology. Copyright © 2017 American Society for Microbiology.


July 19, 2019

Comparative genomics reveals the diversity of restriction-modification systems and DNA methylation sites in Listeria monocytogenes.

Listeria monocytogenes is a bacterial pathogen that is found in a wide variety of anthropogenic and natural environments. Genome sequencing technologies are rapidly becoming a powerful tool in facilitating our understanding of how genotype, classification phenotypes, and virulence phenotypes interact to predict the health risks of individual bacterial isolates. Currently, 57 closed L. monocytogenes genomes are publicly available, representing three of the four phylogenetic lineages, and they suggest that L. monocytogenes has high genomic synteny. This study contributes an additional 15 closed L. monocytogenes genomes that were used to determine the associations between the genome and methylome with host invasion magnitude. In contrast to previous findings, large chromosomal inversions and rearrangements were detected in five isolates at the chromosome terminus and within rRNA genes, including a previously undescribed inversion within rRNA-encoding regions. Each isolate’s epigenome contained highly diverse methyltransferase recognition sites, even within the same serotype and methylation pattern. Eleven strains contained a single chromosomally encoded methyltransferase, one strain contained two methylation systems (one system on a plasmid), and three strains exhibited no methylation, despite the occurrence of methyltransferase genes. In three isolates a new, unknown DNA modification was observed in addition to diverse methylation patterns, accompanied by a novel methylation system. Neither chromosome rearrangement nor strain-specific patterns of epigenome modification observed within virulence genes were correlated with serotype designation, clonal complex, or in vitro infectivity. These data suggest that genome diversity is larger than previously considered in L. monocytogenes and that as more genomes are sequenced, additional structure and methylation novelty will be observed in this organism.Listeria monocytogenes is the causative agent of listeriosis, a disease which manifests as gastroenteritis, meningoencephalitis, and abortion. Among Salmonella, Escherichia coli, Campylobacter, and Listeria-causing the most prevalent foodborne illnesses-infection by L. monocytogenes carries the highest mortality rate. The ability of L. monocytogenes to regulate its response to various harsh environments enables its persistence and transmission. Small-scale comparisons of L. monocytogenes focusing solely on genome contents reveal a highly syntenic genome yet fail to address the observed diversity in phenotypic regulation. This study provides a large-scale comparison of 302 L. monocytogenes isolates, revealing the importance of the epigenome and restriction-modification systems as major determinants of L. monocytogenes phylogenetic grouping and subsequent phenotypic expression. Further examination of virulence genes of select outbreak strains reveals an unprecedented diversity in methylation statuses despite high degrees of genome conservation. Copyright © 2017 American Society for Microbiology.


July 19, 2019

Genomic confirmation of vancomycin-resistant Enterococcus transmission from deceased donor to liver transplant recipient.

In a liver transplant recipient with vancomycin-resistant Enterococcus (VRE) surgical site and bloodstream infection, a combination of pulsed-field gel electrophoresis, multilocus sequence typing, and whole genome sequencing identified that donor and recipient VRE isolates were highly similar when compared to time-matched hospital isolates. Comparison of de novo assembled isolate genomes was highly suggestive of transplant transmission rather than hospital-acquired transmission and also identified subtle internal rearrangements between donor and recipient missed by other genomic approaches. Given the improved resolution, whole-genome assembly of pathogen genomes is likely to become an essential tool for investigation of potential organ transplant transmissions.


July 19, 2019

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based on tf-idf weighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either PacBio or Oxford Nanopore technologies, and achieves a contig NG50 of greater than 21 Mbp on both human and Drosophila melanogaster PacBio datasets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes. Published by Cold Spring Harbor Laboratory Press.


July 19, 2019

Single-molecule sequencing (PacBio) of the Staphylococcus capitis NRCS-A clone reveals the basis of multidrug resistance and adaptation to the Neonatal Intensive Care Unit environment.

The multi-resistant Staphylococcus capitis clone NRCS-A has recently been described as a major pathogen causing nosocomial, late-onset sepsis (LOS) in preterm neonates worldwide. NRCS-A representatives exhibit an atypical antibiotic resistance profile. Here, the complete closed genome (chromosomal and plasmid sequences) of NRCS-A prototype strain CR01 and the draft genomes of three other clinical NRCS-A strains from Australia, Belgium and the United Kingdom are annotated and compared to available non-NRCS-A S. capitis genomes. Our goal was to delineate the uniqueness of the NRCS-A clone with respect to antibiotic resistance, virulence factors and mobile genetic elements. We identified 6 antimicrobial resistance genes, all carried by mobile genetic elements. Previously described virulence genes present in the NRCS-A genomes are shared with the six non-NRCS-A S. capitis genomes. Overall, 63 genes are specific to the NRCS-A lineage, including 28 genes located in the methicillin-resistance cassette SCCmec. Among the 35 remaining genes, 25 are of unknown function, and 9 correspond to an additional type I restriction modification system (n = 3), a cytosine methylation operon (n = 2), and a cluster of genes related to the biosynthesis of teichoic acids (n = 4). Interestingly, a tenth gene corresponds to a resistance determinant for nisin (nsr gene), a bacteriocin secreted by potential NRCS-A strain niche competitors in the gut microbiota. The genomic characteristics presented here emphasize the contribution of mobile genetic elements to the emergence of multidrug resistance in the S. capitis NRCS-A clone. No NRCS-A-specific known virulence determinant was detected, which does not support a role for virulence as a driving force of NRCS-A emergence in NICUs worldwide. However, the presence of a nisin resistance determinant on the NRCS-A chromosome, but not in other S. capitis strains and most coagulase-negative representatives, might confer a competitive advantage to NRCS-A strains during the early steps of gut colonization in neonates. This suggests that the striking adaptation of NRCS-A to the NICU environment might be related to its specific antimicrobial resistance and also to a possible enhanced ability to challenge competing bacteria in its ecological niche.


July 19, 2019

Tofacitinib relieves symptoms of stimulator of interferon genes (STING)-associated vasculopathy with onset in infancy caused by 2 de novo variants in TMEM173.

To the Editor: Stimulator of interferon genes (STING), which is encoded by transmembrane protein 173 (TMEM173), is an important mediator in initiating innate immune responses by detecting aberrant DNA species or cyclic di-GMP-AMP (cGAMP) in the cytosol and driving synthesis of type I interferon.1-3cGAMP molecules, which are produced by cyclic GMP-AMP synthase, bind to STING homodimers embedded in the endoplasmic reticulum membrane and eventually cause phosphorylation of interferon regulatory factor 3 by activating Tank-binding kinase 1 (TBK1). Patients with activating mutations of STING display early onset of chronic inflammation and vasculopathy caused by increased type I interferon signaling, a condition termed STING-associated vasculopathy with onset in infancy (SAVI).2,3Improved understanding of STING’s function and its implications in disease pathogenesis has suggested new potential avenues of disease treatment options through modulating STING signaling pathway components.


July 19, 2019

DNA target recognition domains in the Type I restriction and modification systems of Staphylococcus aureus.

Staphylococcus aureus displays a clonal population structure in which horizontal gene transfer between different lineages is extremely rare. This is due, in part, to the presence of a Type I DNA restriction–modification (RM) system given the generic name of Sau1, which maintains different patterns of methylation on specific target sequences on the genomes of different lineages. We have determined the target sequences recognized by the Sau1 Type I RM systems present in a wide range of the most prevalent S. aureus lineages and assigned the sequences recognized to particular target recognition domains within the RM enzymes. We used a range of biochemical assays on purified enzymes and single molecule real-time sequencing on genomic DNA to determine these target sequences and their patterns of methylation. Knowledge of the main target sequences for Sau1 will facilitate the synthesis of new vectors for transformation of the most prevalent lineages of this ‘untransformable’ bacterium.


July 19, 2019

Detecting PKD1 variants in polycystic kidney disease patients by single-molecule long-read sequencing.

A genetic diagnosis of autosomal-dominant polycystic kidney disease (ADPKD) is challenging due to allelic heterogeneity, high GC content, and homology of the PKD1 gene with six pseudogenes. Short-read next-generation sequencing approaches, such as whole-genome sequencing and whole-exome sequencing, often fail at reliably characterizing complex regions such as PKD1. However, long-read single-molecule sequencing has been shown to be an alternative strategy that could overcome PKD1 complexities and discriminate between homologous regions of PKD1 and its pseudogenes. In this study, we present the increased power of resolution for complex regions using long-read sequencing to characterize a cohort of 19 patients with ADPKD. Our approach provided high sensitivity in identifying PKD1 pathogenic variants, diagnosing 94.7% of the patients. We show that reliable screening of ADPKD patients in a single test without interference of PKD1 homologous sequences, commonly introduced by residual amplification of PKD1 pseudogenes, by direct long-read sequencing is now possible. This strategy can be implemented in diagnostics and is highly suitable to sequence and resolve complex genomic regions that are of clinical relevance. © 2017 The Authors. Human Mutation published by Wiley Periodicals, Inc.


July 19, 2019

New advances in sequence assembly

Extract It may be hard to believe, but the idea of sequence assembly is around 40 years old. Consider this pair of quotes from Rodger Staden (Staden 1979): “With modern fast sequencing techniques and suitable computer programs it is now possible to sequence whole genomes without the need of restriction maps.” “If the 5′ end of the sequence from one gel reading is the same as the 3′ end of the sequence from another the data is said to overlap. If the overlap is of sufficient length to distinguish it from being a repeat in the sequence the two sequences must be contiguous. The data from the two gel readings can then be joined to form one longer continuous sequence.” Replace “gel reading” with “read” and these sentences would go unnoticed in the introduction of any paper today. Here you can also see the birth of jargon that now pervades the field: overlaps between reads form contigs (contiguous sequences). Just a few months later, Gingeras et al. (1979) described “Computer programs for the assembly of DNA sequences.” It all sounds so modern, until the discussion mentions FORTRAN code stored on magnetic tapes. How, then, can we fill an entire special issue of Genome Research with “new advances” so many years later? To me, this reflects the beauty of the problem—simple enough to be stated in a single paragraph, yet complex enough to sustain a field of research for decades. This dichotomy is common to many famous computational problems; indeed, mathematical formulations of sequence assembly fall into a class of problems known as “NP-hard” that do not admit an easy solution (Medvedev et al. 2007). There is another reason for continued advances in sequence assembly—advances in sequencing technology. As evident from the Staden quotes above, the first assembly methods were …


July 19, 2019

Selective graft-versus-leukemia depends on magnitude and diversity of the alloreactive T cell response.

Patients with leukemia who receive a T cell-depleted allogeneic stem cell graft followed by postponed donor lymphocyte infusion (DLI) can experience graft-versus-leukemia (GVL) reactivity, with a lower risk of graft-versus-host disease (GVHD). Here, we have investigated the magnitude, diversity, and specificity of alloreactive CD8 T cells in patients who developed GVL reactivity after DLI in the absence or presence of GVHD. We observed a lower magnitude and diversity of CD8 T cells for minor histocompatibility antigens (MiHAs) in patients with selective GVL reactivity without GVHD. Furthermore, we demonstrated that MiHA-specific T cell clones from patients with selective GVL reactivity showed lower reactivity against nonhematopoietic cells, even when pretreated with inflammatory cytokines. Expression analysis of MiHA-encoding genes showed that similar types of antigens were recognized in both patient groups, but in patients who developed GVHD, T cell reactivity was skewed to target broadly expressed MiHAs. As an inflammatory environment can render nonhematopoietic cells susceptible to T cell recognition, prevention of such circumstances favors induction of selective GVL reactivity without development of GVHD.


July 19, 2019

Revealing complete complex KIR haplotypes phased by long-read sequencing technology

The killer cell immunoglobulin-like receptor (KIR) region of human chromosome 19 contains up to 16 genes for natural killer (NK) cell receptors that recognize human leukocyte antigen (HLA)/peptide complexes and other ligands. The KIR proteins fulfill functional roles in infections, pregnancy, autoimmune diseases and transplantation. However, their characterization remains a constant challenge. Not only are the genes highly homologous due to their recent evolution by tandem duplications, but the region is structurally dynamic due to frequent transposon-mediated recombination. A sequencing approach that precisely captures the complexity of KIR haplotypes for functional annotation is desirable. We present a unique approach to haplotype the KIR loci using single-molecule, real-time (SMRT) sequencing. Using this method, we have—for the first time—comprehensively sequenced and phased sixteen KIR haplotypes from eight individuals without imputation. The information revealed four novel haplotype structures, a novel gene-fusion allele, novel and confirmed insertion/deletion events, a homozygous individual, and overall diversity for the structural haplotypes and their alleles. These KIR haplotypes augment our existing knowledge by providing high-quality references, evolutionary informers, and source material for imputation. The haplotype sequences and gene annotations provide alternative loci for the KIR region in the human genome reference GrCh38.p8.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.