Menu
July 19, 2019

The complete methylome of Helicobacter pylori UM032.

The genome of the human gastric pathogen Helicobacter pylori encodes a large number of DNA methyltransferases (MTases), some of which are shared among many strains, and others of which are unique to a given strain. The MTases have potential roles in the survival of the bacterium. In this study, we sequenced a Malaysian H. pylori clinical strain, designated UM032, by using a combination of PacBio Single Molecule, Real-Time (SMRT) and Illumina MiSeq next generation sequencing platforms, and used the SMRT data to characterize the set of methylated bases (the methylome).The N4-methylcytosine and N6-methyladenine modifications detected at single-base resolution using SMRT technology revealed 17 methylated sequence motifs corresponding to one Type I and 16 Type II restriction-modification (R-M) systems. Previously unassigned methylation motifs were now assigned to their respective MTases-coding genes. Furthermore, one gene that appears to be inactive in the H. pylori UM032 genome during normal growth was characterized by cloning.Consistent with previously-studied H. pylori strains, we show that strain UM032 contains a relatively large number of R-M systems, including some MTase activities with novel specificities. Additional studies are underway to further elucidating the biological significance of the R-M systems in the physiology and pathogenesis of H. pylori.


July 19, 2019

Chaos of rearrangements in the mating-type chromosomes of the anther-smut fungus Microbotryum lychnidis-dioicae.

Sex chromosomes in plants and animals and fungal mating-type chromosomes often show exceptional genome features, with extensive suppression of homologous recombination and cytological differentiation between members of the diploid chromosome pair. Despite strong interest in the genetics of these chromosomes, their large regions of suppressed recombination often are enriched in transposable elements and therefore can be challenging to assemble. Here we show that the latest improvements of the PacBio sequencing yield assembly of the whole genome of the anther-smut fungus, Microbotryum lychnidis-dioicae (the pathogenic fungus causing anther-smut disease of Silene latifolia), into finished chromosomes or chromosome arms, even for the repeat-rich mating-type chromosomes and centromeres. Suppressed recombination of the mating-type chromosomes is revealed to span nearly 90% of their lengths, with extreme levels of rearrangements, transposable element accumulation, and differentiation between the two mating types. We observed no correlation between allelic divergence and physical position in the nonrecombining regions of the mating-type chromosomes. This may result from gene conversion or from rearrangements of ancient evolutionary strata, i.e., successive steps of suppressed recombination. Centromeres were found to be composed mainly of copia-like transposable elements and to possess specific minisatellite repeats identical between the different chromosomes. We also identified subtelomeric motifs. In addition, extensive signs of degeneration were detected in the nonrecombining regions in the form of transposable element accumulation and of hundreds of gene losses on each mating-type chromosome. Furthermore, our study highlights the potential of the latest breakthrough PacBio chemistry to resolve complex genome architectures. Copyright © 2015 by the Genetics Society of America.


July 19, 2019

Insertion sequence IS26 reorganizes plasmids in clinically isolated multidrug-resistant bacteria by replicative transposition.

Carbapenemase-producing Enterobacteriaceae (CPE), which are resistant to most or all known antibiotics, constitute a global threat to public health. Transposable elements are often associated with antibiotic resistance determinants, suggesting a role in the emergence of resistance. One insertion sequence, IS26, is frequently associated with resistance determinants, but its role remains unclear. We have analyzed the genomic contexts of 70 IS26 copies in several clinical and surveillance CPE isolates from the National Institutes of Health Clinical Center. We used target site duplications and their patterns as guides and found that a large fraction of plasmid reorganizations result from IS26 replicative transpositions, including replicon fusions, DNA inversions, and deletions. Replicative transposition could also be inferred for transposon Tn4401, which harbors the carbapenemase blaKPC gene. Thus, replicative transposition is important in the ongoing reorganization of plasmids carrying multidrug-resistant determinants, an observation that carries substantial clinical and epidemiological implications for understanding how such extreme drug resistance phenotypes evolve.Although IS26 is frequently reported to reside in resistance plasmids of clinical isolates, the characteristic hallmark of transposition, target site duplication (TSD), is generally not observed, raising questions about the mode of transposition for IS26. The previous observation of cointegrate formation during transposition implies that IS26 transposes via a replicative mechanism. The other possible outcome of replicative transposition is DNA inversion or deletion, when transposition occurs intramolecularly, and this would also generate a specific TSD pattern that might also serve as supporting evidence for the transposition mechanism. The numerous examples we present here demonstrate that replicative transposition, used by many mobile elements (including IS26 and Tn4401), is prevalent in the plasmids of clinical isolates and results in significant plasmid reorganization. This study also provides a method to trace the evolution of resistance plasmids based on TSD patterns. Copyright © 2015 He et al.


July 19, 2019

Population structure of mitochondrial genomes in Saccharomyces cerevisiae.

Rigorous study of mitochondrial functions and cell biology in the budding yeast, Saccharomyces cerevisiae has advanced our understanding of mitochondrial genetics. This yeast is now a powerful model for population genetics, owing to large genetic diversity and highly structured populations among wild isolates. Comparative mitochondrial genomic analyses between yeast species have revealed broad evolutionary changes in genome organization and architecture. A fine-scale view of recent evolutionary changes within S. cerevisiae has not been possible due to low numbers of complete mitochondrial sequences.To address challenges of sequencing AT-rich and repetitive mitochondrial DNAs (mtDNAs), we sequenced two divergent S. cerevisiae mtDNAs using a single-molecule sequencing platform (PacBio RS). Using de novo assemblies, we generated highly accurate complete mtDNA sequences. These mtDNA sequences were compared with 98 additional mtDNA sequences gathered from various published collections. Phylogenies based on mitochondrial coding sequences and intron profiles revealed that intraspecific diversity in mitochondrial genomes generally recapitulated the population structure of nuclear genomes. Analysis of intergenic sequence indicated a recent expansion of mobile elements in certain populations. Additionally, our analyses revealed that certain populations lacked introns previously believed conserved throughout the species, as well as the presence of introns never before reported in S. cerevisiae.Our results revealed that the extensive variation in S. cerevisiae mtDNAs is often population specific, thus offering a window into the recent evolutionary processes shaping these genomes. In addition, we offer an effective strategy for sequencing these challenging AT-rich mitochondrial genomes for small scale projects.


July 19, 2019

Complete genome sequence of Sporisorium scitamineum and biotrophic interaction transcriptome with sugarcane.

Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence) revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions.


July 19, 2019

Identification of a common risk haplotype for canine idiopathic epilepsy in the ADAM23 gene.

Idiopathic epilepsy is a common neurological disease in human and domestic dogs but relatively few risk genes have been identified to date. The seizure characteristics, including focal and generalised seizures, are similar between the two species, with gene discovery facilitated by the reduced genetic heterogeneity of purebred dogs. We have recently identified a risk locus for idiopathic epilepsy in the Belgian Shepherd breed on a 4.4 megabase region on CFA37.We have expanded a previous study replicating the association with a combined analysis of 157 cases and 179 controls in three additional breeds: Schipperke, Finnish Spitz and Beagle (pc?=?2.9e-07, pGWAS?=?1.74E-02). A targeted resequencing of the 4.4 megabase region in twelve Belgian Shepherd cases and twelve controls with opposite haplotypes identified 37 case-specific variants within the ADAM23 gene. Twenty-seven variants were validated in 285 cases and 355 controls from four breeds, resulting in a strong replication of the ADAM23 locus (praw?=?2.76e-15) and the identification of a common 28 kb-risk haplotype in all four breeds. Risk haplotype was present in frequencies of 0.49-0.7 in the breeds, suggesting that ADAM23 is a low penetrance risk gene for canine epilepsy.These results implicate ADAM23 in common canine idiopathic epilepsy, although the causative variant remains yet to be identified. ADAM23 plays a role in synaptic transmission and interacts with known epilepsy genes, LGI1 and LGI2, and should be considered as a candidate gene for human epilepsies.


July 19, 2019

Genetic stabilization of the drug-resistant PMEN1 Pneumococcus lineage by its distinctive DpnIII restriction-modification system.

The human pathogen Streptococcus pneumoniae (pneumococcus) exhibits a high degree of genomic diversity and plasticity. Isolates with high genomic similarity are grouped into lineages that undergo homologous recombination at variable rates. PMEN1 is a pandemic, multidrug-resistant lineage. Heterologous gene exchange between PMEN1 and non-PMEN1 isolates is directional, with extensive gene transfer from PMEN1 strains and only modest transfer into PMEN1 strains. Restriction-modification (R-M) systems can restrict horizontal gene transfer, yet most pneumococcal strains code for either the DpnI or DpnII R-M system and neither limits homologous recombination. Our comparative genomic analysis revealed that PMEN1 isolates code for DpnIII, a third R-M system syntenic to the other Dpn systems. Characterization of DpnIII demonstrated that the endonuclease cleaves unmethylated double-stranded DNA at the tetramer sequence 5′ GATC 3′, and the cognate methylase is a C5 cytosine-specific DNA methylase. We show that DpnIII decreases the frequency of recombination under in vitro conditions, such that the number of transformants is lower for strains transformed with unmethylated DNA than in those transformed with cognately methylated DNA. Furthermore, we have identified two PMEN1 isolates where the DpnIII endonuclease is disrupted, and phylogenetic work by Croucher and colleagues suggests that these strains have accumulated genomic differences at a higher rate than other PMEN1 strains. We propose that the R-M locus is a major determinant of genetic acquisition; the resident R-M system governs the extent of genome plasticity.Pneumococcus is one of the most important community-acquired bacterial pathogens. Pneumococcal strains can develop resistance to antibiotics and to serotype vaccines by acquiring genes from other strains or species. Thus, genomic plasticity is associated with strain adaptability and pneumococcal success. PMEN1 is a widespread and multidrug-resistant highly pathogenic pneumococcal lineage, which has evolved over the past century and displays a relatively stable genome. In this study, we characterize DpnIII, a restriction-modification (R-M) system that limits recombination. DpnIII is encountered in the PMEN1 lineage, where it replaces other R-M systems that do not decrease plasticity. Our hypothesis is that this genomic region, where different pneumococcal lineages code for variable R-M systems, plays a role in the fine-tuning of the extent of genomic plasticity. It is possible that well-adapted lineages such as PMEN1 have a mechanism to increase genomic stability, rather than foster genomic plasticity. Copyright © 2015 Eutsey et al.


July 19, 2019

Assembly and diploid architecture of an individual human genome via single-molecule technologies.

We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.


July 19, 2019

Parallel epidemics of community-associated methicillin-resistant Staphylococcus aureus USA300 infection in North and South America.

The community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) epidemic in the United States is attributed to the spread of the USA300 clone. An epidemic of CA-MRSA closely related to USA300 has occurred in northern South America (USA300 Latin-American variant, USA300-LV). Using phylogenomic analysis, we aimed to understand the relationships between these 2 epidemics.We sequenced the genomes of 51 MRSA clinical isolates collected between 1999 and 2012 from the United States, Colombia, Venezuela, and Ecuador. Phylogenetic analysis was used to infer the relationships and times since the divergence of the major clades.Phylogenetic analyses revealed 2 dominant clades that segregated by geographical region, had a putative common ancestor in 1975, and originated in 1989, in North America, and in 1985, in South America. Emergence of these parallel epidemics coincides with the independent acquisition of the arginine catabolic mobile element (ACME) in North American isolates and a novel copper and mercury resistance (COMER) mobile element in South American isolates.Our results reveal the existence of 2 parallel USA300 epidemics that shared a recent common ancestor. The simultaneous rapid dissemination of these 2 epidemic clades suggests the presence of shared, potentially convergent adaptations that enhance fitness and ability to spread.© The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


July 19, 2019

Microplitis demolitor bracovirus proviral loci and clustered replication genes exhibit distinct DNA amplification patterns during replication.

Polydnaviruses are large, double-stranded DNA viruses that are beneficial symbionts of parasitoid wasps. Polydnaviruses in the genus Bracovirus (BVs) persist in wasps as proviruses, and their genomes consist of two functional components referred to as proviral segments and nudivirus-like genes. Prior studies established that the DNA domains where proviral segments reside are amplified during replication and that segments within amplified loci are circularized before packaging into nucleocapsids. One DNA domain where nudivirus-like genes are located is also amplified but never packaged into virions. We recently sequenced the genome of the braconid Microplitis demolitor, which carries M. demolitor bracovirus (MdBV). Here, we took advantage of this resource to characterize the DNAs that are amplified during MdBV replication using a combination of Illumina and Pacific Biosciences sequencing approaches. The results showed that specific nucleotide sites identify the boundaries of amplification for proviral loci. Surprisingly, however, amplification of loci 3, 4, 6, and 8 produced head-to-tail concatemeric intermediates; loci 1, 2, and 5 produced head-to-head/tail-to-tail concatemers; and locus 7 yielded no identified concatemers. Sequence differences at amplification junctions correlated with the types of amplification intermediates the loci produced, while concatemer processing gave rise to the circularized DNAs that are packaged into nucleocapsids. The MdBV nudivirus-like gene cluster was also amplified, albeit more weakly than most proviral loci and with nondiscrete boundaries. Overall, the MdBV genome exhibited three patterns of DNA amplification during replication. Our data also suggest that PacBio sequencing could be useful in studying the replication intermediates produced by other DNA viruses. Polydnaviruses are of fundamental interest because they provide a novel example of viruses evolving into beneficial symbionts. All polydnaviruses are associated with insects called parasitoid wasps, which are of additional applied interest because many are biological control agents of pest insects. Polydnaviruses in the genus Bracovirus (BVs) evolved ~100 million years ago from an ancestor related to the baculovirus-nudivirus lineage but have also established many novelties due to their symbiotic lifestyle. These include the fact that BVs are transmitted only vertically as proviruses and produce replication-defective virions that package only a portion of the viral genome. Here, we studied Microplitis demolitor bracovirus (MdBV) and report that its genome exhibits three distinct patterns of DNA amplification during replication. We also identify several previously unknown features of BV genomes that correlate with these different amplification patterns. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 19, 2019

Combining mass spectrometric metabolic profiling with genomic analysis: a powerful approach for discovering natural products from cyanobacteria.

An innovative approach was developed for the discovery of new natural products by combining mass spectrometric metabolic profiling with genomic analysis and resulted in the discovery of the columbamides, a new class of di- and trichlorinated acyl amides with cannabinomimetic activity. Three species of cultured marine cyanobacteria, Moorea producens 3L, Moorea producens JHB, and Moorea bouillonii PNG, were subjected to genome sequencing and analysis for their recognizable biosynthetic pathways, and this information was then compared with their respective metabolomes as detected by MS profiling. By genome analysis, a presumed regulatory domain was identified upstream of several previously described biosynthetic gene clusters in two of these cyanobacteria, M. producens 3L and M. producens JHB. A similar regulatory domain was identified in the M. bouillonii PNG genome, and a corresponding downstream biosynthetic gene cluster was located and carefully analyzed. Subsequently, MS-based molecular networking identified a series of candidate products, and these were isolated and their structures rigorously established. On the basis of their distinctive acyl amide structure, the most prevalent metabolite was evaluated for cannabinomimetic properties and found to be moderate affinity ligands for CB1.


July 19, 2019

Novel katG mutations causing isoniazid resistance in clinical M. tuberculosis isolates.

We report the discovery and confirmation of 23 novel mutations with previously undocumented role in isoniazid (INH) drug resistance, in catalase-peroxidase (katG) gene of Mycobacterium tuberculosis (Mtb) isolates. With these mutations, a synonymous mutation in fabG1 (g609a), and two canonical mutations, we were able to explain 98% of the phenotypic resistance observed in 366 clinical Mtb isolates collected from four high tuberculosis (TB)-burden countries: India, Moldova, Philippines, and South Africa. We conducted overlapping targeted and whole-genome sequencing for variant discovery in all clinical isolates with a variety of INH-resistant phenotypes. Our analysis showed that just two canonical mutations (katG 315AGC-ACC and inhA promoter-15C-T) identified 89.5% of resistance phenotypes in our collection. Inclusion of the 23 novel mutations reported here, and the previously documented point mutation in fabG1, increased the sensitivity of these mutations as markers of INH resistance to 98%. Only six (2%) of the 332 resistant isolates in our collection did not harbor one or more of these mutations. The third most prevalent substitution, at inhA promoter position -8, present in 39 resistant isolates, was of no diagnostic significance since it always co-occurred with katG 315. 79% of our isolates harboring novel mutations belong to genetic group 1 indicating a higher tendency for this group to go down an uncommon evolutionary path and evade molecular diagnostics. The results of this study contribute to our understanding of the mechanisms of INH resistance in Mtb isolates that lack the canonical mutations and could improve the sensitivity of next generation molecular diagnostics.


July 19, 2019

TAL effectors and activation of predicted host targets distinguish Asian from African strains of the rice pathogen Xanthomonas oryzae pv. oryzicola while strict conservation suggests universal importance of five TAL effectors.

Xanthomonas oryzae pv. oryzicola (Xoc) causes the increasingly important disease bacterial leaf streak of rice (BLS) in part by type III delivery of repeat-rich transcription activator-like (TAL) effectors to upregulate host susceptibility genes. By pathogen whole genome, single molecule, real-time sequencing and host RNA sequencing, we compared TAL effector content and rice transcriptional responses across 10 geographically diverse Xoc strains. TAL effector content is surprisingly conserved overall, yet distinguishes Asian from African isolates. Five TAL effectors are conserved across all strains. In a prior laboratory assay in rice cv. Nipponbare, only two contributed to virulence in strain BLS256 but the strict conservation indicates all five may be important, in different rice genotypes or in the field. Concatenated and aligned, TAL effector content across strains largely reflects relationships based on housekeeping genes, suggesting predominantly vertical transmission. Rice transcriptional responses did not reflect these relationships, and on average, only 28% of genes upregulated and 22% of genes downregulated by a strain are up- and down- regulated (respectively) by all strains. However, when only known TAL effector targets were considered, the relationships resembled those of the TAL effectors. Toward identifying new targets, we used the TAL effector-DNA recognition code to predict effector binding elements in promoters of genes upregulated by each strain, but found that for every strain, all upregulated genes had at least one. Filtering with a classifier we developed previously decreases the number of predicted binding elements across the genome, suggesting that it may reduce false positives among upregulated genes. Applying this filter and eliminating genes for which upregulation did not strictly correlate with presence of the corresponding TAL effector, we generated testable numbers of candidate targets for four of the five strictly conserved TAL effectors.


July 19, 2019

A biphasic epigenetic switch controls immunoevasion, virulence and niche adaptation in non-typeable Haemophilus influenzae.

Non-typeable Haemophilus influenzae contains an N(6)-adenine DNA-methyltransferase (ModA) that is subject to phase-variable expression (random ON/OFF switching). Five modA alleles, modA2, modA4, modA5, modA9 and modA10, account for over two-thirds of clinical otitis media isolates surveyed. Here, we use single molecule, real-time (SMRT) methylome analysis to identify the DNA-recognition motifs for all five of these modA alleles. Phase variation of these alleles regulates multiple proteins including vaccine candidates, and key virulence phenotypes such as antibiotic resistance (modA2, modA5, modA10), biofilm formation (modA2) and immunoevasion (modA4). Analyses of a modA2 strain in the chinchilla model of otitis media show a clear selection for ON switching of modA2 in the middle ear. Our results indicate that a biphasic epigenetic switch can control bacterial virulence, immunoevasion and niche adaptation in an animal model system.


July 19, 2019

Single-Molecule Real-Time Sequencing combined with optical mapping yields completely finished fungal genome.

Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio-generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism’s biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes.Studying whole-genome sequences has become an important aspect of biological research. The advent of next-generation sequencing (NGS) technologies has nowadays brought genomic science within reach of most research laboratories, including those that study nonmodel organisms. However, most genome sequencing initiatives typically yield (highly) fragmented genome assemblies. Nevertheless, considerable relevant information related to genome structure and evolution is likely hidden in those nonassembled regions. Here, we investigated a diverse set of strategies to obtain gapless genome assemblies, using the genome of a typical ascomycete fungus as the template. Eventually, we were able to show that a combination of PacBio-generated long reads and optical mapping yields a gapless telomere-to-telomere genome assembly, allowing in-depth genome analyses to facilitate functional studies into an organism’s biology. Copyright © 2015 Faino et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.