Menu
July 19, 2019

Separate F-type plasmids have shaped the evolution of the H30 subclone of Escherichia coli sequence type 131.

The extraintestinal pathogenic Escherichia coli (ExPEC) H30 subclone of sequence type 131 (ST131-H30) has emerged abruptly as a dominant lineage of ExPEC responsible for human disease. The ST131-H30 lineage has been well described phylogenetically, yet its plasmid complement is not fully understood. Here, single-molecule, real-time sequencing was used to generate the complete plasmid sequences of ST131-H30 isolates and those belonging to other ST131 clades. Comparative analyses revealed separate F-type plasmids that have shaped the evolution of the main fluoroquinolone-resistant ST131-H30 clades. Specifically, an F1:A2:B20 plasmid is strongly associated with the H30R/C1 clade, whereas an F2:A1:B- plasmid is associated with the H30Rx/C2 clade. A series of plasmid gene losses, gains, and rearrangements involving IS26 likely led to the current plasmid complements within each ST131-H30 sublineage, which contain several overlapping gene clusters with putative functions in virulence and fitness, suggesting plasmid-mediated convergent evolution. Evidence suggests that the H30Rx/C2-associated F2:A1:B- plasmid type was present in strains ancestral to the acquisition of fluoroquinolone resistance and prior to the introduction of a multidrug resistance-encoding gene cassette harboring bla CTX-M-15. In vitro experiments indicated a host strain-independent low frequency of plasmid transfer, differential levels of plasmid stability even between closely related ST131-H30 strains, and possible epistasis for carriage of these plasmids within the H30R/Rx lineages. IMPORTANCE A clonal lineage of Escherichia coli known as ST131 has emerged as a dominating strain type causing extraintestinal infections in humans. The evolutionary history of ST131 E. coli is now well understood. However, the role of plasmids in ST131’s evolutionary history is poorly defined. This study utilized real-time, single-molecule sequencing to compare plasmids from various current and historical lineages of ST131. From this work, it was determined that a series of plasmid gains, losses, and recombinational events has led to the currently circulating plasmids of ST131 strains. These plasmids appear to have evolved to acquire similar gene clusters on multiple occasions, suggesting possible plasmid-mediated convergent evolution leading to evolutionary success. These plasmids also appear to be better suited to exist in specific strains of ST131 due to coadaptive mutations. Overall, a series of events has enabled the evolution of ST131 plasmids, possibly contributing to the lineage’s success.


July 19, 2019

Rapid sequencing of complete env genes from primary HIV-1 samples

The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences Single Molecule, Real-Time (SMRT) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.


July 19, 2019

Variation and evolution in the glutamine-rich repeat region of Drosophila argonaute-2.

RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. Copyright © 2016 Palmer and Obbard.


July 19, 2019

Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen.

Eukaryotes display remarkable genome plasticity, which can include supernumerary chromosomes that differ markedly from the core chromosomes. Despite the widespread occurrence of supernumerary chromosomes in fungi, their origin, relation to the core genome and the reason for their divergent characteristics are still largely unknown. The complexity of genome assembly due to the presence of repetitive DNA partially accounts for this.Here we use single-molecule real-time (SMRT) sequencing to assemble the genome of a prominent fungal wheat pathogen, Fusarium poae, including at least one supernumerary chromosome. The core genome contains limited transposable elements (TEs) and no gene duplications, while the supernumerary genome holds up to 25 % TEs and multiple gene duplications. The core genome shows all hallmarks of repeat-induced point mutation (RIP), a defense mechanism against TEs, specific for fungi. The absence of RIP on the supernumerary genome accounts for the differences between the two (sub)genomes, and results in a functional crosstalk between them. The supernumerary genome is a reservoir for TEs that migrate to the core genome, and even large blocks of supernumerary sequence (>200 kb) have recently translocated to the core. Vice versa, the supernumerary genome acts as a refuge for genes that are duplicated from the core genome.For the first time, a mechanism was determined that explains the differences that exist between the core and supernumerary genome in fungi. Different biology rather than origin was shown to be responsible. A “living apart together” crosstalk exists between the core and supernumerary genome, accelerating chromosomal and organismal evolution.


July 19, 2019

Biosynthesis and function of modified bases in bacteria and their viruses.

Naturally occurring modification of the canonical A, G, C, and T bases can be found in the DNA of cellular organisms and viruses from all domains of life. Bacterial viruses (bacteriophages) are a particularly rich but still underexploited source of such modified variant nucleotides. The modifications conserve the coding and base-pairing functions of DNA, but add regulatory and protective functions. In prokaryotes, modified bases appear primarily to be part of an arms race between bacteriophages (and other genomic parasites) and their hosts, although, as in eukaryotes, some modifications have been adapted to convey epigenetic information. The first half of this review catalogs the identification and diversity of DNA modifications found in bacteria and bacteriophages. What is known about the biogenesis, context, and function of these modifications are also described. The second part of the review places these DNA modifications in the context of the arms race between bacteria and bacteriophages. It focuses particularly on the defense and counter-defense strategies that turn on direct recognition of the presence of a modified base. Where modification has been shown to affect other DNA transactions, such as expression and chromosome segregation, that is summarized, with reference to recent reviews.


July 19, 2019

Condition-dependent co-regulation of genomic clusters of virulence factors in the grapevine trunk pathogen Neofusicoccum parvum.

The ascomycete Neofusicoccum parvum, one of the causal agents of Botryosphaeria dieback, is a destructive wood-infecting fungus and a serious threat to grape production worldwide. The capability to colonize woody tissue, combined with the secretion of phytotoxic compounds, is thought to underlie its pathogenicity and virulence. Here, we describe the repertoire of virulence factors and their transcriptional dynamics as the fungus feeds on different substrates and colonizes the woody stem. We assembled and annotated a highly contiguous genome using single-molecule real-time DNA sequencing. Transcriptome profiling by RNA sequencing determined the genome-wide patterns of expression of virulence factors both in vitro (potato dextrose agar or medium amended with grape wood as substrate) and in planta. Pairwise statistical testing of differential expression, followed by co-expression network analysis, revealed that physically clustered genes coding for putative virulence functions were induced depending on the substrate or stage of plant infection. Co-expressed gene clusters were significantly enriched not only in genes associated with secondary metabolism, but also in those associated with cell wall degradation, suggesting that dynamic co-regulation of transcriptional networks contributes to multiple aspects of N. parvum virulence. In most of the co-expressed clusters, all genes shared at least a common motif in their promoter region, indicative of co-regulation by the same transcription factor. Co-expression analysis also identified chromatin regulators with correlated expression with inducible clusters of virulence factors, suggesting a complex, multi-layered regulation of the virulence repertoire of N. parvum.© 2016 BSPP AND JOHN WILEY & SONS LTD.


July 19, 2019

Winding paths to simplicity: genome evolution in facultative insect symbionts.

Symbiosis between organisms is an important driving force in evolution. Among the diverse relationships described, extensive progress has been made in insect-bacteria symbiosis, which improved our understanding of the genome evolution in host-associated bacteria. Particularly, investigations on several obligate mutualists have pushed the limits of what we know about the minimal genomes for sustaining cellular life. To bridge the gap between those obligate symbionts with extremely reduced genomes and their non-host-restricted ancestors, this review focuses on the recent progress in genome characterization of facultative insect symbionts. Notable cases representing various types and stages of host associations, including those from multiple genera in the family Enterobacteriaceae (class Gammaproteobacteria), Wolbachia (Alphaproteobacteria) and Spiroplasma (Mollicutes), are discussed. Although several general patterns of genome reduction associated with the adoption of symbiotic relationships could be identified, extensive variation was found among these facultative symbionts. These findings are incorporated into the established conceptual frameworks to develop a more detailed evolutionary model for the discussion of possible trajectories. In summary, transitions from facultative to obligate symbiosis do not appear to be a universal one-way street; switches between hosts and lifestyles (e.g. commensalism, parasitism or mutualism) occur frequently and could be facilitated by horizontal gene transfer. © FEMS 2016.


July 19, 2019

De novo assembly and phasing of a Korean human genome.

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9?Mb and a scaffold N50 size of 44.8?Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03?Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6?Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.


July 19, 2019

Multiple origins of the pathogenic yeast Candida orthopsilosis by separate hybridizations between two parental species.

Mating between different species produces hybrids that are usually asexual and stuck as diploids, but can also lead to the formation of new species. Here, we report the genome sequences of 27 isolates of the pathogenic yeast Candida orthopsilosis. We find that most isolates are diploid hybrids, products of mating between two unknown parental species (A and B) that are 5% divergent in sequence. Isolates vary greatly in the extent of homogenization between A and B, making their genomes a mosaic of highly heterozygous regions interspersed with homozygous regions. Separate phylogenetic analyses of SNPs in the A- and B-derived portions of the genome produces almost identical trees of the isolates with four major clades. However, the presence of two mutually exclusive genotype combinations at the mating type locus, and recombinant mitochondrial genomes diagnostic of inter-clade mating, shows that the species C. orthopsilosis does not have a single evolutionary origin but was created at least four times by separate interspecies hybridizations between parents A and B. Older hybrids have lost more heterozygosity. We also identify two isolates with homozygous genomes derived exclusively from parent A, which are pure non-hybrid strains. The parallel emergence of the same hybrid species from multiple independent hybridization events is common in plant evolution, but is much less documented in pathogenic fungi.


July 19, 2019

Short tandem repeats, segmental duplications, gene deletion, and genomic instability in a rapidly diversified immune gene family.

Genomic regions with repetitive sequences are considered unstable and prone to swift DNA diversification processes. A highly diverse immune gene family of the sea urchin (Strongylocentrotus purpuratus), called Sp185/333, is composed of clustered genes with similar sequence as well as several types of repeats ranging in size from short tandem repeats (STRs) to large segmental duplications. This repetitive structure may have been the basis for the incorrect assembly of this gene family in the sea urchin genome sequence. Consequently, we have resolved the structure of the family and profiled the members by sequencing selected BAC clones using Illumina and PacBio approaches.BAC insert assemblies identified 15 predicted genes that are organized into three clusters. Two of the gene clusters have almost identical flanking regions, suggesting that they may be non-matching allelic clusters residing at the same genomic locus. GA STRs surround all genes and appear in large stretches at locations of putatively deleted genes. GAT STRs are positioned at the edges of segmental duplications that include a subset of the genes. The unique locations of the STRs suggest their involvement in gene deletions and segmental duplications. Genomic profiling of the Sp185/333 gene diversity in 10 sea urchins shows that no gene repertoires are shared among individuals indicating a very high gene diversification rate for this family.The repetitive genomic structure of the Sp185/333 family that includes STRs in strategic locations may serve as platform for a controlled mechanism which regulates the processes of gene recombination, gene conversion, duplication and deletion. The outcome is genomic instability and allelic mismatches, which may further drive the swift diversification of the Sp185/333 gene family that may improve the immune fitness of the species.


July 19, 2019

IncFIIk plasmid harbouring an amplification of 16S rRNA methyltransferase-encoding gene rmtH associated with mobile element ISCR2.

To investigate the resistance mechanisms and genetic support underlying the high resistance level of the Klebsiella pneumoniae strain CMUL78 to aminoglycoside and ß-lactam antibiotics.Antibiotic susceptibility was assessed by the disc diffusion method and MICs were determined by the microdilution method. Antibiotic resistance genes and their genetic environment were characterized by PCR and Sanger sequencing. Plasmid contents were analysed in the clinical strain and transconjugants obtained by mating-out assays. Complete plasmid sequencing was performed with PacBio and Illumina technology.Strain CMUL78 co-produced the 16S rRNA methyltransferase (RMTase) RmtH, carbapenemase OXA-48 and ESBL SHV-12. The rmtH- and blaSHV-12-encoding genes were harboured by a novel ~115 kb IncFIIk plasmid designated pRmtH, and blaOXA-48 by a ~62 kb IncL/M plasmid related to pOXA-48a. pRmtH plasmid possessed seven different stability modules, one of which is a novel hybrid toxin-antitoxin system. Interestingly, pRmtH plasmid harboured a 4-fold amplification of an rmtH-ISCR2 unit arranged in tandem and inserted within a novel IS26-based composite transposon designated Tn6329.This is the first known report of the 16S RMTase-encoding gene rmtH in a plasmid. The rmtH-ISCR2 unit was inserted in a composite transposon as a 4-fold tandem repeat, a scarcely reported organization.© The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 19, 2019

Recent advances in inferring viral diversity from high-throughput sequencing data.

Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.


July 19, 2019

Single-molecule sequencing revealing the presence of distinct JC polyomavirus populations in patients with progressive multifocal leukoencephalopathy.

Progressive multifocal leukoencephalopathy (PML) is a fatal disease caused by reactivation of JC polyomavirus (JCPyV) in immunosuppressed individuals and lytic infection by neurotropic JCPyV in glial cells. The exact content of neurotropic mutations within individual JCPyV strains has not been studied to our knowledge.We exploited the capacity of single-molecule real-time sequencing technology to determine the sequence of complete JCPyV genomes in single reads. The method was used to precisely characterize individual neurotropic JCPyV strains of 3 patients with PML without the bias caused by assembly of short sequence reads.In the cerebrospinal fluid sample of a 73-year-old woman with rapid PML onset, 3 distinct JCPyV populations could be identified. All viral populations were characterized by rearrangements within the noncoding regulatory region (NCCR) and 1 point mutation, S267L in the VP1 gene, suggestive of neurotropic strains. One patient with PML had a single neurotropic strain with rearranged NCCR, and 1 patient had a single strain with small NCCR alterations.We report here, for the first time, full characterization of individual neurotropic JCPyV strains in the cerebrospinal fluid of patients with PML. It remains to be established whether PML pathogenesis is driven by one or several neurotropic strains in an individual.


July 19, 2019

Targeted capture and sequencing of gene-sized DNA molecules.

Targeted capture provides an efficient and sensitive means for sequencing specific genomic regions in a high-throughput manner. To date, this method has mostly been used to capture exons from the genome (the exome) using short insert libraries and short-read sequencing technology, enabling the identification of genetic variants or new members of large gene families. Sequencing larger molecules results in the capture of whole genes, including intronic and intergenic sequences that are typically more polymorphic and allow the resolution of the gene structure of homologous genes, which are often clustered together on the chromosome. Here, we describe an improved method for the capture and single-molecule sequencing of DNA molecules as large as 7 kb by means of size selection and optimized PCR conditions. Our approach can be used to capture, sequence, and distinguish between similar members of the NB-LRR gene family-key genes in plant immune systems.


July 19, 2019

The establishment and diversification of epidemic-associated serogroup W meningococcus in the African meningitis belt, 1994 to 2012.

Epidemics of invasive meningococcal disease (IMD) caused by meningococcal serogroup A have been eliminated from the sub-Saharan African so-called “meningitis belt” by the meningococcal A conjugate vaccine (MACV), and yet, other serogroups continue to cause epidemics. Neisseria meningitidis serogroup W remains a major cause of disease in the region, with most isolates belonging to clonal complex 11 (CC11). Here, the genetic variation within and between epidemic-associated strains was assessed by sequencing the genomes of 92 N. meningitidis serogroup W isolates collected between 1994 and 2012 from both sporadic and epidemic IMD cases, 85 being from selected meningitis belt countries. The sequenced isolates belonged to either CC175 (n = 9) or CC11 (n = 83). The CC11 N. meningitidis serogroup W isolates belonged to a single lineage comprising four major phylogenetic subclades. Separate CC11 N. meningitidis serogroup W subclades were associated with the 2002 and 2012 Burkina Faso epidemics. The subclade associated with the 2012 epidemic included isolates found in Burkina Faso and Mali during 2011 and 2012, which descended from a strain very similar to the Hajj (Islamic pilgrimage to Mecca)-related Saudi Arabian outbreak strain from 2000. The phylogeny of isolates from 2012 reflected their geographic origin within Burkina Faso, with isolates from the Malian border region being closely related to the isolates from Mali. Evidence of ongoing evolution, international transmission, and strain replacement stresses the importance of maintaining N. meningitidis surveillance in Africa following the MACV implementation. IMPORTANCE Meningococcal disease (meningitis and bloodstream infections) threatens millions of people across the meningitis belt of sub-Saharan Africa. A vaccine introduced in 2010 protects against Africa’s then-most common cause of meningococcal disease, N. meningitidis serogroup A. However, other serogroups continue to cause epidemics in the region-including serogroup W. The rapid identification of strains that have been associated with prior outbreaks can improve the assessment of outbreak risk and enable timely preparation of public health responses, including vaccination. Phylogenetic analysis of newly sequenced serogroup W strains isolated from 1994 to 2012 identified two groups of strains linked to large epidemics in Burkina Faso, one being descended from a strain that caused an outbreak during the Hajj pilgrimage in 2000. We find that applying whole-genome sequencing to meningococcal disease surveillance collections improves the discrimination among strains, even within a single nation-wide epidemic, which can be used to better understand pathogen spread.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.