Menu
July 19, 2019

Characterization of a large antibiotic resistance plasmid found in enteropathogenic Escherichia coli strain B171 and its relatedness to plasmids of diverse E. coli and Shigella.

Enteropathogenic Escherichia coli (EPEC) is a leading cause of severe infantile diarrhea in developing countries. Previous research has focused on the diversity of the EPEC virulence plasmid, whereas less is known regarding the genetic content and distribution of antibiotic resistance plasmids carried by EPEC. A previous study demonstrated that in addition to the virulence plasmid, reference EPEC strain B171 harbors a second, larger plasmid that confers antibiotic resistance. To further understand the genetic diversity and dissemination of antibiotic resistance plasmids among EPEC strains, we describe the complete sequence of an antibiotic resistance plasmid from EPEC strain B171. The resistance plasmid, pB171_90, has a completed sequence length of 90,229 bp, a GC content of 54.55%, and carries protein-encoding genes involved in conjugative transfer, resistance to tetracycline (tetA), sulfonamides (sulI), and mercury, as well as several virulence-associated genes, including the transcriptional regulator hha and the putative calcium sequestration inhibitor (csi). In silico detection of the pB171_90 genes among 4,798 publicly available E. coli genome assemblies indicates that the unique genes of pB171_90 (csi and traI) are primarily restricted to genomes identified as EPEC or enterotoxigenic E. coli However, conserved regions of the pB171_90 plasmid containing genes involved in replication, stability, and antibiotic resistance were identified among diverse E. coli pathotypes. Interestingly, pB171_90 also exhibited significant similarity with a sequenced plasmid from Shigella dysenteriae type I. Our findings demonstrate the mosaic nature of EPEC antibiotic resistance plasmids and highlight the need for additional sequence-based characterization of antibiotic resistance plasmids harbored by pathogenic E. coli. Copyright © 2017 American Society for Microbiology.


July 19, 2019

Sequencing the CYP2D6 gene: from variant allele discovery to clinical pharmacogenetic testing.

CYP2D6 is one of the most studied enzymes in the field of pharmacogenetics. The CYP2D6 gene is highly polymorphic with over 100 catalogued star (*) alleles, and clinical CYP2D6 testing is increasingly accessible and supported by practice guidelines. However, the degree of variation at the CYP2D6 locus and homology with its pseudogenes make interrogating CYP2D6 by short-read sequencing challenging. Moreover, accurate prediction of CYP2D6 metabolizer status necessitates analysis of duplicated alleles when an increased copy number is detected. These challenges have recently been overcome by long-read CYP2D6 sequencing; however, such platforms are not widely available. This review highlights the genomic complexities of CYP2D6, current sequencing methods and the evolution of CYP2D6 from allele discovery to clinical pharmacogenetic testing.


July 19, 2019

SMRT Gate: A method for validation of synthetic constructs on Pacific Biosciences sequencing platforms.

Current DNA assembly methods are prone to sequence errors, requiring rigorous quality control (QC) to identify incorrect assemblies or synthesized constructs. Such errors can lead to misinterpretation of phenotypes. Because of this intrinsic problem, routine QC analysis is generally performed on three or more clones using a combination of restriction endonuclease assays, colony PCR, and Sanger sequencing. However, as new automation methods emerge that enable high-throughput assembly, QC using these techniques has become a major bottleneck. Here, we describe a quick and affordable methodology for the QC of synthetic constructs. Our method involves a one-pot digestion-ligation DNA assembly reaction, based on the Golden Gate assembly methodology, that is coupled with Pacific Biosciences’ Single Molecule, Real-Time (PacBio SMRT) sequencing technology.


July 19, 2019

Genomic epidemiology of global Klebsiella pneumoniae carbapenemase (KPC)-producing Escherichia coli.

The dissemination of carbapenem resistance in Escherichia coli has major implications for the management of common infections. bla KPC, encoding a transmissible carbapenemase (KPC), has historically largely been associated with Klebsiella pneumoniae, a predominant plasmid (pKpQIL), and a specific transposable element (Tn4401, ~10?kb). Here we characterize the genetic features of bla KPC emergence in global E. coli, 2008-2013, using both long- and short-read whole-genome sequencing. Amongst 43/45 successfully sequenced bla KPC-E. coli strains, we identified substantial strain diversity (n?=?21 sequence types, 18% of annotated genes in the core genome); substantial plasmid diversity (=9 replicon types); and substantial bla KPC-associated, mobile genetic element (MGE) diversity (50% not within complete Tn4401 elements). We also found evidence of inter-species, regional and international plasmid spread. In several cases bla KPC was found on high copy number, small Col-like plasmids, previously associated with horizontal transmission of resistance genes in the absence of antimicrobial selection pressures. E. coli is a common human pathogen, but also a commensal in multiple environmental and animal reservoirs, and easily transmissible. The association of bla KPC with a range of MGEs previously linked to the successful spread of widely endemic resistance mechanisms (e.g. bla TEM, bla CTX-M) suggests that it may become similarly prevalent.


July 19, 2019

Multiple independent changes in mitochondrial genome conformation in chlamydomonadalean algae

Chlamydomonadalean green algae are no stranger to linear mitochondrial genomes, particularly members of the Reinhardtinia clade. At least nine different Reinhardtinia species are known to have linear mitochondrial DNAs (mtDNAs), including the model species Chlamydomonas reinhardtii. Thus, it is no surprise that some have suggested that the most recent common ancestor of the Reinhardtinia clade had a linear mtDNA. But the recent uncovering of circular-mapping mtDNAs in a range of Reinhardtinia algae, such as Volvox carteri and Tetrabaena socialis, has shed doubt on this hypothesis. Here, we explore mtDNA sequence and structure within the colonial Reinhardtinia algae Yamagishiella unicocca and Eudorina sp. NIES-3984, which occupy phylogenetically intermediate positions between species with opposing mtDNA mapping structures. Sequencing and gel electrophoresis data indicate that Y. unicocca has a linear monomeric mitochondrial genome with long (3?kb) palindromic telomeres. Conversely, the mtDNA of Eudorina sp., despite having an identical gene order to that of Y. unicocca, assembled as a circular-mapping molecule. Restriction digests of Eudorina sp. mtDNA supported its circular map, but also revealed a linear monomeric form with a matching architecture and gene order to the Y. unicocca mtDNA. Based on these data, we suggest that there have been at least three separate shifts in mtDNA conformation in the Reinhardtinia, and that the common ancestor of this clade had a linear monomeric mitochondrial genome with palindromic telomeres.


July 19, 2019

The complete genome sequence of the phytopathogenic fungus Sclerotinia sclerotiorum reveals insights into the genome architecture of broad host range pathogens.

Sclerotinia sclerotiorum is a phytopathogenic fungus with over 400 hosts including numerous economically important cultivated species. This contrasts many economically destructive pathogens that only exhibit a single or very few hosts. Many plant pathogens exhibit a “two-speed” genome. So described because their genomes contain alternating gene rich, repeat sparse and gene poor, repeat-rich regions. In fungi, the repeat-rich regions may be subjected to a process termed repeat-induced point mutation (RIP). Both repeat activity and RIP are thought to play a significant role in evolution of secreted virulence proteins, termed effectors. We present a complete genome sequence of S. sclerotiorum generated using Single Molecule Real-Time Sequencing technology with highly accurate annotations produced using an extensive RNA sequencing data set. We identified 70 effector candidates and have highlighted their in planta expression profiles. Furthermore, we characterized the genome architecture of S. sclerotiorum in comparison to plant pathogens that exhibit “two-speed” genomes. We show that there is a significant association between positions of secreted proteins and regions with a high RIP index in S. sclerotiorum but we did not detect a correlation between secreted protein proportion and GC content. Neither did we detect a negative correlation between CDS content and secreted protein proportion across the S. sclerotiorum genome. We conclude that S. sclerotiorum exhibits subtle signatures of enhanced mutation of secreted proteins in specific genomic compartments as a result of transposition and RIP activity. However, these signatures are not observable at the whole-genome scale.


July 19, 2019

How Single Molecule Real-Time Sequencing and haplotype phasing have enabled reference-grade diploid genome assembly of wine grapes.

Domesticated grapevines (Vitis vinifera) have relatively small genomes of about 500 Mb (Lodhi and Reisch, 1995; Jaillon et al., 2007; Velasco et al., 2007), which is similar to other small-genomes species like rice (430 Mb; Goff et al., 2002), medicago (500 Mb; Tang et al., 2014), and poplar (465 Mb; Tuskan et al., 2006). Despite their small genome size, the sequencing and assembling of grapevine genomes is difficult because of high levels of heterozygosity. The high heterozygosity in domesticated grapes may be due, in part, to their domestication from an obligately outcrossing, dioecious wild progenitor. Domesticated grapes can be selfed, in theory, because their mating system transitioned to hermaphroditic, self-fertile flowers during domestication. In practice, however, selfed progeny tend to be non-viable, presumably due to a high deleterious recessive load and resulting inbreeding depression. As a consequence of these fitness effects, most grape cultivars are crosses between distantly related parents (Strefeler et al., 1992; Ohmi et al., 1993; Bowers and Meredith, 1997; Sefc et al., 1998; Lopes et al., 1999; Di Gaspero et al., 2005; Tapia et al., 2007; Ibáñez et al., 2009; Cipriani et al., 2010; Myles et al., 2011; Lacombe et al., 2013).


July 19, 2019

New technologies boost genome quality.

Three years ago, Erich Jarvis helped mastermind a massive DNA sequenc- ing effort that netted genomes for more than 40 bird species and produced a better avian family tree. But when he tried to compare the avian genomes to those of other species to learn about the evolution and function of several key brain genes, he was stymied. His team found that gene sequences from most of the comparison species—even humans—were incomplete, missing, or misplaced in the larger genome. The group had to resequence sections of sev- eral genomes to get the needed data, delaying their project many months.


July 19, 2019

Re-sequencing transgenic plants revealed rearrangements at T-DNA inserts, and integration of a short T-DNA fragment, but no increase of small mutations elsewhere.

Transformation resulted in deletions and translocations at T-DNA inserts, but not in genome-wide small mutations. A tiny T-DNA splinter was detected that probably would remain undetected by conventional techniques. We investigated to which extent Agrobacterium tumefaciens-mediated transformation is mutagenic, on top of inserting T-DNA. To prevent mutations due to in vitro propagation, we applied floral dip transformation of Arabidopsis thaliana. We re-sequenced the genomes of five primary transformants, and compared these to genomic sequences derived from a pool of four wild-type plants. By genome-wide comparisons, we identified ten small mutations in the genomes of the five transgenic plants, not correlated to the positions or number of T-DNA inserts. This mutation frequency is within the range of spontaneous mutations occurring during seed propagation in A. thaliana, as determined earlier. In addition, we detected small as well as large deletions specifically at the T-DNA insert sites. Furthermore, we detected partial T-DNA inserts, one of these a tiny 50-bp fragment originating from a central part of the T-DNA construct used, inserted into the plant genome without flanking other T-DNA. Because of its small size, we named this fragment a T-DNA splinter. As far as we know this is the first report of such a small T-DNA fragment insert in absence of any T-DNA border sequence. Finally, we found evidence for translocations from other chromosomes, flanking T-DNA inserts. In this study, we showed that next-generation sequencing (NGS) is a highly sensitive approach to detect T-DNA inserts in transgenic plants.


July 19, 2019

First report of two complete Clostridium chauvoei genome sequences and detailed in silico genome analysis.

Clostridium (C.) chauvoei is a Gram-positive, spore forming, anaerobic bacterium. It causes black leg in ruminants, a typically fatal histotoxic myonecrosis. High quality circular genome sequences were generated for the C. chauvoei type strain DSM 7528(T) (ATCC 10092(T)) and a field strain 12S0467 isolated in Germany. The origin of replication (oriC) was comparable to that of Bacillus subtilis in structure with two regions containing DnaA boxes. Similar prophages were identified in the genomes of both C. chauvoei strains which also harbored hemolysin and bacterial spore formation genes. A CRISPR type I-B system with limited variations in the repeat number was identified. Sporulation and germination process related genes were homologous to that of the Clostridia cluster I group but novel variations for regulatory genes were identified indicative for strain specific control of regulatory events. Phylogenomics showed a higher relatedness to C. septicum than to other so far sequenced genomes of species belonging to the genus Clostridium. Comparative genome analysis of three C. chauvoei circular genome sequences revealed the presence of few inversions and translocations in locally collinear blocks (LCBs). The species genome also shows a large number of genes involved in proteolysis, genes for glycosyl hydrolases and metal iron transportation genes which are presumably involved in virulence and survival in the host. Three conserved flagellar genes (fliC) were identified in each of the circular genomes. In conclusion this is the first comparative analysis of circular genomes for the species C. chauvoei, enabling insights into genome composition and virulence factor variation. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.


July 19, 2019

An improved Plasmodium cynomolgi genome assembly reveals an unexpected methyltransferase gene expansion.

Plasmodium cynomolgi, a non-human primate malaria parasite species, has been an important model parasite since its discovery in 1907. Similarities in the biology of P. cynomolgi to the closely related, but less tractable, human malaria parasite P. vivax make it the model parasite of choice for liver biology and vaccine studies pertinent to P. vivax malaria. Molecular and genome-scale studies of P. cynomolgi have relied on the current reference genome sequence, which remains highly fragmented with 1,649 unassigned scaffolds and little representation of the subtelomeres.  Methods: Using long-read sequence data (Pacific Biosciences SMRT technology), we assembled and annotated a new reference genome sequence, PcyM, sourced from an Indian rhesus monkey. We compare the newly assembled genome sequence with those of several other Plasmodium species, including a re-annotated P. coatneyi assembly.The new PcyM genome assembly is of significantly higher quality than the existing reference, comprising only 56 pieces, no gaps and an improved average gene length. Detailed manual curation has ensured a comprehensive annotation of the genome with 6,632 genes, nearly 1,000 more than previously attributed to P. cynomolgi. The new assembly also has an improved representation of the subtelomeric regions, which account for nearly 40% of the sequence. Within the subtelomeres, we identified more than 1300 Plasmodium interspersed repeat ( pir) genes, as well as a striking expansion of 36 methyltransferase pseudogenes that originated from a single copy on chromosome 9.The manually curated PcyM reference genome sequence is an important new resource for the malaria research community. The high quality and contiguity of the data have enabled the discovery of a novel expansion of methyltransferase in the subtelomeres, and illustrates the new comparative genomics capabilities that are being unlocked by complete reference genomes.


July 19, 2019

Defective HIV-1 proviruses are expressed and can be recognized by cytotoxic T lymphocytes, which shape the proviral landscape.

Despite antiretroviral therapy, HIV-1 persists in memory CD4(+) T cells, creating a barrier to cure. The majority of HIV-1 proviruses are defective and considered clinically irrelevant. Using cells from HIV-1-infected individuals and reconstructed patient-derived defective proviruses, we show that defective proviruses can be transcribed into RNAs that are spliced and translated. Proviruses with defective major splice donors (MSDs) can activate novel splice sites to produce HIV-1 transcripts, and cells with these proviruses can be recognized by HIV-1-specific cytotoxic T lymphocytes (CTLs). Further, cells with proviruses containing lethal mutations upstream of CTL epitopes can also be recognized by CTLs, potentially through aberrant translation. Thus, CTLs may change the landscape of HIV-1 proviruses by preferentially targeting cells with specific types of defective proviruses. Additionally, the expression of defective proviruses will need to be considered in the measurement of HIV-1 latency reversal. Copyright © 2017 Elsevier Inc. All rights reserved.


July 19, 2019

Evolutionary restoration of fertility in an interspecies hybrid yeast, by whole-genome duplication after a failed mating-type switch.

Many interspecies hybrids have been discovered in yeasts, but most of these hybrids are asexual and can replicate only mitotically. Whole-genome duplication has been proposed as a mechanism by which interspecies hybrids can regain fertility, restoring their ability to perform meiosis and sporulate. Here, we show that this process occurred naturally during the evolution of Zygosaccharomyces parabailii, an interspecies hybrid that was formed by mating between 2 parents that differed by 7% in genome sequence and by many interchromosomal rearrangements. Surprisingly, Z. parabailii has a full sexual cycle and is genetically haploid. It goes through mating-type switching and autodiploidization, followed by immediate sporulation. We identified the key evolutionary event that enabled Z. parabailii to regain fertility, which was breakage of 1 of the 2 homeologous copies of the mating-type (MAT) locus in the hybrid, resulting in a chromosomal rearrangement and irreparable damage to 1 MAT locus. This rearrangement was caused by HO endonuclease, which normally functions in mating-type switching. With 1 copy of MAT inactivated, the interspecies hybrid now behaves as a haploid. Our results provide the first demonstration that MAT locus damage is a naturally occurring evolutionary mechanism for whole-genome duplication and restoration of fertility to interspecies hybrids. The events that occurred in Z. parabailii strongly resemble those postulated to have caused ancient whole-genome duplication in an ancestor of Saccharomyces cerevisiae.


July 19, 2019

Discovery and biosynthesis of gladiolin: A Burkholderia gladioli antibiotic with promising activity against Mycobacterium tuberculosis.

An antimicrobial activity screen of Burkholderia gladioli BCC0238, a clinical isolate from a cystic fibrosis patient, led to the discovery of gladiolin, a novel macrolide antibiotic with potent activity against Mycobacterium tuberculosis H37Rv. Gladiolin is structurally related to etnangien, a highly unstable antibiotic from Sorangium cellulosum that is also active against Mycobacteria. Like etnangien, gladiolin was found to inhibit RNA polymerase, a validated drug target in M. tuberculosis. However, gladiolin lacks the highly labile hexaene moiety of etnangien and was thus found to possess significantly increased chemical stability. Moreover, gladiolin displayed low mammalian cytotoxicity and good activity against several M. tuberculosis clinical isolates, including four that are resistant to isoniazid and one that is resistant to both isoniazid and rifampicin. Overall, these data suggest that gladiolin may represent a useful starting point for the development of novel drugs to tackle multidrug-resistant tuberculosis. The B. gladioli BCC0238 genome was sequenced using Single Molecule Real Time (SMRT) technology. This resulted in four contiguous sequences: two large circular chromosomes and two smaller putative plasmids. Analysis of the chromosome sequences identified 49 putative specialized metabolite biosynthetic gene clusters. One such gene cluster, located on the smaller of the two chromosomes, encodes a trans-acyltransferase (trans-AT) polyketide synthase (PKS) multienzyme that was hypothesized to assemble gladiolin. Insertional inactivation of a gene in this cluster encoding one of the PKS subunits abrogated gladiolin production, confirming that the gene cluster is responsible for biosynthesis of the antibiotic. Comparison of the PKSs responsible for the assembly of gladiolin and etnangien showed that they possess a remarkably similar architecture, obfuscating the biosynthetic mechanisms responsible for most of the structural differences between the two metabolites.


July 19, 2019

A case study into microbial genome assembly gap sequences and finishing strategies.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.