Menu
September 22, 2019

Intraspecific comparative genomics of isolates of the Norway spruce pathogen (Heterobasidion parviporum) and identification of its potential virulence factors.

Heterobasidion parviporum is an economically most important fungal forest pathogen in northern Europe, causing root and butt rot disease of Norway spruce (Picea abies (L.) Karst.). The mechanisms underlying the pathogenesis and virulence of this species remain elusive. No reference genome to facilitate functional analysis is available for this species.To better understand the virulence factor at both phenotypic and genomic level, we characterized 15 H. parviporum isolates originating from different locations across Finland for virulence, vegetative growth, sporulation and saprotrophic wood decay. Wood decay capability and latitude of fungal origins exerted interactive effects on their virulence and appeared important for H. parviporum virulence. We sequenced the most virulent isolate, the first full genome sequences of H. parviporum as a reference genome, and re-sequenced the remaining 14 H. parviporum isolates. Genome-wide alignments and intrinsic polymorphism analysis showed that these isolates exhibited overall high genomic similarity with an average of at least 96% nucleotide identity when compared to the reference, yet had remarkable intra-specific level of polymorphism with a bias for CpG to TpG mutations. Reads mapping coverage analysis enabled the classification of all predicted genes into five groups and uncovered two genomic regions exclusively present in the reference with putative contribution to its higher virulence. Genes enriched for copy number variations (deletions and duplications) and nucleotide polymorphism were involved in oxidation-reduction processes and encoding domains relevant to transcription factors. Some secreted protein coding genes based on the genome-wide selection pressure, or the presence of variants were proposed as potential virulence candidates.Our study reported on the first reference genome sequence for this Norway spruce pathogen (H. parviporum). Comparative genomics analysis gave insight into the overall genomic variation among this fungal species and also facilitated the identification of several secreted protein coding genes as putative virulence factors for the further functional analysis. We also analyzed and identified phenotypic traits potentially linked to its virulence.


September 22, 2019

Primordial origin and diversification of plasmids in Lyme disease agent bacteria.

With approximately one-third of their genomes consisting of linear and circular plasmids, the Lyme disease agent cluster of species has the most complex genomes among known bacteria. We report here a comparative analysis of plasmids in eleven Borreliella (also known as Borrelia burgdorferi sensu lato) species.We sequenced the complete genomes of two B. afzelii, two B. garinii, and individual B. spielmanii, B. bissettiae, B. valaisiana and B. finlandensis isolates. These individual isolates carry between seven and sixteen plasmids, and together harbor 99 plasmids. We report here a comparative analysis of these plasmids, along with 70 additional Borreliella plasmids available in the public sequence databases. We identify only one new putative plasmid compatibility type (the 30th) among these 169 plasmid sequences, suggesting that all or nearly all such types have now been discovered. We find that the linear plasmids in the non-B. burgdorferi species have undergone the same kinds of apparently random, chaotic rearrangements mediated by non-homologous recombination that we previously discovered in B. burgdorferi. These rearrangements occurred independently in the different species lineages, and they, along with an expanded chromosomal phylogeny reported here, allow the identification of several whole plasmid transfer events among these species. Phylogenetic analyses of the plasmid partition genes show that a majority of the plasmid compatibility types arose early, most likely before separation of the Lyme agent Borreliella and relapsing fever Borrelia clades, and this, with occasional cross species plasmid transfers, has resulted in few if any species-specific or geographic region-specific Borreliella plasmid types.The primordial origin and persistent maintenance of the Borreliella plasmid types support their functional indispensability as well as evolutionary roles in facilitating genome diversity. The improved resolution of Borreliella plasmid phylogeny based on conserved partition-gene clusters will lead to better determination of gene orthology which is essential for prediction of biological function, and it will provide a basis for inferring detailed evolutionary mechanisms of Borreliella genomic variability including homologous gene and plasmid exchanges as well as non-homologous rearrangements.


September 22, 2019

A sub-population of group A Streptococcus elicits a population-wide production of bacteriocins to establish dominance in the host.

Bacteria use quorum sensing (QS) to regulate gene expression. We identified a group A Streptococcus (GAS) strain possessing the QS system sil, which produces functional bacteriocins, through a sequential signaling pathway integrating host and bacterial signals. Host cells infected by GAS release asparagine (ASN), which is sensed by the bacteria to alter its gene expression and rate of proliferation. We show that upon ASN sensing, GAS upregulates expression of the QS autoinducer peptide SilCR. Initial SilCR expression activates the autoinduction cycle for further SilCR production. The autoinduction process propagates throughout the GAS population, resulting in bacteriocin production. Subcutaneous co-injection of mice with a bacteriocin-producing strain and the globally disseminated M1T1 GAS clone results in M1T1 killing within soft tissue. Thus, by sensing host signals, a fraction of a bacterial population can trigger an autoinduction mechanism mediated by QS, which acts on the entire bacterial community to outcompete other bacteria within the infection. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

DNA strand-exchange patterns associated with double-strand break-induced and spontaneous mitotic crossovers in Saccharomyces cerevisiae.

Mitotic recombination can result in loss of heterozygosity and chromosomal rearrangements that shape genome structure and initiate human disease. Engineered double-strand breaks (DSBs) are a potent initiator of recombination, but whether spontaneous events initiate with the breakage of one or both DNA strands remains unclear. In the current study, a crossover (CO)-specific assay was used to compare heteroduplex DNA (hetDNA) profiles, which reflect strand exchange intermediates, associated with DSB-induced versus spontaneous events in yeast. Most DSB-induced CO products had the two-sided hetDNA predicted by the canonical DSB repair model, with a switch in hetDNA position from one product to the other at the position of the break. Approximately 40% of COs, however, had hetDNA on only one side of the initiating break. This anomaly can be explained by a modified model in which there is frequent processing of an early invasion (D-loop) intermediate prior to extension of the invading end. Finally, hetDNA tracts exhibited complexities consistent with frequent expansion of the DSB into a gap, migration of strand-exchange junctions, and template switching during gap-filling reactions. hetDNA patterns in spontaneous COs isolated in either a wild-type background or in a background with elevated levels of reactive oxygen species (tsa1? mutant) were similar to those associated with the DSB-induced events, suggesting that DSBs are the major instigator of spontaneous mitotic recombination in yeast.


September 22, 2019

Whole genome sequencing of greater amberjack (Seriola dumerili) for SNP identification on aligned scaffolds and genome structural variation analysis using parallel resequencing

Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8?Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence.


September 22, 2019

Unique genetic cassettes in a Thermoanaerobacterium contribute to simultaneous conversion of cellulose and monosugars into butanol.

The demand for cellulosic biofuels is on the rise because of the anticipation for sustainable energy and less greenhouse gas emissions in the future. However, production of cellulosic biofuels, especially cellulosic butanol, has been hampered by the lack of potent microbes that are capable of converting cellulosic biomass into biofuels. We report a wild-type Thermoanaerobacterium thermosaccharolyticum strain TG57, which is capable of using microcrystalline cellulose directly to produce butanol (1.93 g/liter) as the only final product (without any acetone or ethanol produced), comparable to that of engineered microbes thus far. Strain TG57 exhibits significant advances including unique genes responsible for a new butyrate synthesis pathway, no carbon catabolite repression, and the absence of genes responsible for acetone synthesis (which is observed as the main by-product in most Clostridium strains known today). Furthermore, the use of glucose analog 2-deoxyglucose posed a selection pressure to facilitate isolation of strain TG57 with deletion/silencing of carbon catabolite repressor genes-the ccr and xylR genes-and thus is able to simultaneously ferment glucose, xylose, and arabinose to produce butanol (7.33 g/liter) as the sole solvent. Combined analysis of genomic and transcriptomic data revealed unusual aspects of genome organization, numerous determinants for unique bioconversions, regulation of central metabolic pathways, and distinct transcriptomic profiles. This study provides a genome-level understanding of how cellulose is metabolized by T. thermosaccharolyticum and sheds light on the potential of competitive and sustainable biofuel production.


September 22, 2019

Engineering of Halomonas bluephagenesis for low cost production of poly(3-hydroxybutyrate-co-4-hydroxybutyrate) from glucose.

Poly(3-hydroxybutyrate-co-4-hydroxybutyrate) [P(3HB-co-4HB)] is one of the most promising biomaterials expected to be used in a wide range of scenarios. However, its large-scale production is still hindered by the high cost. Here we report the engineering of Halomonas bluephagenesis as a low-cost platform for non-sterile and continuous fermentative production of P(3HB-co-4HB) from glucose. Two interrelated 4-hydroxybutyrate (4HB) biosynthesis pathways were constructed to guarantee 4HB monomer supply for P(3HB-co-4HB) synthesis by working in concert with 3-hydroxybutyrate (3HB) pathway. Interestingly, only 0.17?mol% 4HB in the copolymer was obtained during shake flask studies. Pathway debugging using structurally related carbon source located the failure as insufficient 4HB accumulation. Further whole genome sequencing and comparative genomic analysis identified multiple orthologs of succinate semialdehyde dehydrogenase (gabD) that may compete with 4HB synthesis flux in H. bluephagenesis. Accordingly, combinatory gene-knockout strains were constructed and characterized, through which the molar fraction of 4HB was increased by 24-fold in shake flask studies. The best-performing strain was grown on glucose as the single carbon source for 60?h under non-sterile conditions in a 7-L bioreactor, reaching 26.3?g/L of dry cell mass containing 60.5% P(3HB-co-17.04?mol%4HB). Besides, 4HB molar fraction in the copolymer can be tuned from 13?mol% to 25?mol% by controlling the residual glucose concentration in the cultures. This is the first study to achieve the production of P(3HB-co-4HB) from only glucose using Halomonas. Copyright © 2018 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Occurrence, evolution, and functions of DNA phosphorothioate epigenetics in bacteria.

The chemical diversity of physiological DNA modifications has expanded with the identification of phosphorothioate (PT) modification in which the nonbridging oxygen in the sugar-phosphate backbone of DNA is replaced by sulfur. Together with DndFGH as cognate restriction enzymes, DNA PT modification, which is catalyzed by the DndABCDE proteins, functions as a bacterial restriction-modification (R-M) system that protects cells against invading foreign DNA. However, the occurrence of dnd systems across a large number of bacterial genomes and their functions other than R-M are poorly understood. Here, a genomic survey revealed the prevalence of bacterial dnd systems: 1,349 bacterial dnd systems were observed to occur sporadically across diverse phylogenetic groups, and nearly half of these occur in the form of a solitary dndBCDE gene cluster that lacks the dndFGH restriction counterparts. A phylogenetic analysis of 734 complete PT R-M pairs revealed the coevolution of M and R components, despite the observation that several PT R-M pairs appeared to be assembled from M and R parts acquired from distantly related organisms. Concurrent epigenomic analysis, transcriptome analysis, and metabolome characterization showed that a solitary PT modification contributed to the overall cellular redox state, the loss of which perturbed the cellular redox balance and induced Pseudomonas fluorescens to reconfigure its metabolism to fend off oxidative stress. An in vitro transcriptional assay revealed altered transcriptional efficiency in the presence of PT DNA modification, implicating its function in epigenetic regulation. These data suggest the versatility of PT in addition to its involvement in R-M protection.


September 22, 2019

Reproducible integration of multiple sequencing datasets to form high-confidence SNP, indel, and reference calls for five human genome reference materials

Benchmark small variant calls from the Genome in a Bottle Consortium (GIAB) for the CEPH/HapMap genome NA12878 (HG001) have been used extensively for developing, optimizing, and demonstrating performance of sequencing and bioinformatics methods. Here, we develop a reproducible, cloud-based pipeline to integrate multiple sequencing datasets and form benchmark calls, enabling application to arbitrary human genomes. We use these reproducible methods to form high-confidence calls with respect to GRCh37 and GRCh38 for HG001 and 4 additional broadly-consented genomes from the Personal Genome Project that are available as NIST Reference Materials. These new genomes’ broad, open consent with few restrictions on availability of samples and data is enabling a uniquely diverse array of applications. Our new methods produce 17% more high-confidence SNPs, 176% more indels, and 12% larger regions than our previously published calls. To demonstrate that these calls can be used for accurate benchmarking, we compare other high-quality callsets to ours (e.g., Illumina Platinum Genomes), and we demonstrate that the majority of discordant calls are errors in the other callsets, We also highlight challenges in interpreting performance metrics when benchmarking against imperfect high-confidence calls. We show that benchmarking tools from the Global Alliance for Genomics and Health can be used with our calls to stratify performance metrics by variant type and genome context and elucidate strengths and weaknesses of a method.


September 22, 2019

Repeat-driven generation of antigenic diversity in a major human pathogen, Trypanosoma cruzi

Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina sequencing of 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene structure reveals an unusual duality in the organisation of the parasite genome, a core genomic region syntenous with related protozoa flanked by unique and highly plastic subtelomeric regions encoding surface antigens. The presence of abundant interspersed retrotransposons in the subtelomeres suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.


September 22, 2019

Dynamic evolution of a-gliadin prolamin gene family in homeologous genomes of hexaploid wheat.

Wheat Gli-2 loci encode complex groups of a-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of a-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous a-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 a-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of a-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of a-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of a-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.


September 22, 2019

New Delhi metallo-beta-lactamase-producing Enterobacteriaceae in South Korea between 2010 and 2015.

This study was carried out to investigate the epidemiological time-course of New Delhi metallo-beta-lactamase- (NDM-) mediated carbapenem resistance in Enterobacteriaceae in South Korea. A total of 146 non-duplicate NDM-producing Enterobacteriaceae recovered between 2010 and 2015 were voluntarily collected from 33 general hospitals and confirmed by PCR. The species were identified by sequences of the 16S rDNA. Antimicrobial susceptibility was determined either by the disk diffusion method or by broth microdilution, and the carbapenem MICs were determined by agar dilution. Then, multilocus sequence typing and PCR-based replicon typing was carried out. Co-carried genes for drug resistance were identified by PCR and sequencing. The entire genomes of eight random selected NDM producers were sequenced. A total of 69 Klebsiella pneumoniae of 12 sequence types (STs), 34 Escherichia coli of 15 STs, 28 Enterobacter spp. (including one Enterobacter aerogenes), nine Citrobacter freundii, four Raoultella spp., and two Klebsiella oxytoca isolates produced either NDM-1 (n = 126), NDM-5 (n = 18), or NDM-7 (n = 2). The isolates co-produced CTX-M-type ESBL (52.1%), AmpCs (27.4%), additional carbapenemases (7.1%), and/or 16S rRNA methyltransferases (4.8%), resulting in multidrug-resistance (47.9%) or extensively drug-resistance (52.1%). Among plasmids harboring blaNDM, IncX3 was predominant (77.4%), followed by the IncFII type (5.8%). Genome analysis revealed inter-species and inter-strain horizontal gene transfer of the plasmid. Both clonal dissemination and plasmid transfer contributed to the wide dissemination of NDM producers in South Korea.


September 22, 2019

Comparative genomic insights into endofungal lifestyles of two bacterial endosymbionts, Mycoavidus cysteinexigens and Burkholderia rhizoxinica.

Endohyphal bacteria (EHB), dwelling within fungal hyphae, markedly affect the growth and metabolic potential of their hosts. To date, two EHB belonging to the family Burkholderiaceae have been isolated and characterized as new taxa, Burkholderia rhizoxinica (HKI 454T) and Mycoavidus cysteinexigens (B1-EBT), in Japan. Metagenome sequencing was recently reported for Mortierella elongata AG77 together with its endosymbiont M. cysteinexigens (Mc-AG77) from a soil/litter sample in the USA. In the present study, we elucidated the complete genome sequence of B1-EBT and compared it with those of Mc-AG77 and HKI 454T. The genomes of B1-EBT and Mc-AG77 contained a higher level of prophage sequences and were markedly smaller than that of HKI 454T. Although the B1-EBT and Mc-AG77 genomes lacked the chitinolytic enzyme genes responsible for invasion into fungal cells, they contained several predicted toxin-antitoxin systems including an insecticidal toxin complex and PIN domain imposing an addiction-like mechanism essential for endohyphal growth control during host colonization. Despite the different host fungi, the alignment of amino acid sequences showed that the HKI 454T genome consisted of 1,265 (32.6%) and 1,221 (31.5%) orthologous coding sequences (CDSs) with those of B1-EBT and Mc-AG77, respectively. This comparative study of three phylogenetically associated endosymbionts has provided insights into their origin and evolution, and suggests the later bacterial invasion and adaptation of B1-EBT to its host metabolism.


September 22, 2019

Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy.

Epilepsy is a common neurological disorder, and mutations in genes encoding ion channels or neurotransmitter receptors are frequent causes of monogenic forms of epilepsy. Here we show that abnormal expansions of TTTCA and TTTTA repeats in intron 4 of SAMD12 cause benign adult familial myoclonic epilepsy (BAFME). Single-molecule, real-time sequencing of BAC clones and nanopore sequencing of genomic DNA identified two repeat configurations in SAMD12. Intriguingly, in two families with a clinical diagnosis of BAFME in which no repeat expansions in SAMD12 were observed, we identified similar expansions of TTTCA and TTTTA repeats in introns of TNRC6A and RAPGEF2, indicating that expansions of the same repeat motifs are involved in the pathogenesis of BAFME regardless of the genes in which the expanded repeats are located. This discovery that expansions of noncoding repeats lead to neuronal dysfunction responsible for myoclonic tremor and epilepsy extends the understanding of diseases with such repeat expansion.


September 22, 2019

Comparative genome analysis reveals a complex population structure of Legionella pneumophila subspecies.

The majority of Legionnaires’ disease (LD) cases are caused by Legionella pneumophila, a genetically heterogeneous species composed of at least 17 serogroups. Previously, it was demonstrated that L. pneumophila consists of three subspecies: pneumophila, fraseri and pascullei. During an LD outbreak investigation in 2012, we detected that representatives of both subspecies fraseri and pascullei colonized the same water system and that the outbreak-causing strain was a new member of the least represented subspecies pascullei. We used partial sequence based typing consensus patterns to mine an international database for additional representatives of fraseri and pascullei subspecies. As a result, we identified 46 sequence types (STs) belonging to subspecies fraseri and two STs belonging to subspecies pascullei. Moreover, a recent retrospective whole genome sequencing analysis of isolates from New York State LD clusters revealed the presence of a fourth L. pneumophila subspecies that we have termed raphaeli. This subspecies consists of 15 STs. Comparative analysis was conducted using the genomes of multiple members of all four L. pneumophila subspecies. Whereas each subspecies forms a distinct phylogenetic clade within the L. pneumophila species, they share more average nucleotide identity with each other than with other Legionella species. Unique genes for each subspecies were identified and could be used for rapid subspecies detection. Improved taxonomic classification of L. pneumophila strains may help identify environmental niches and virulence attributes associated with these genetically distinct subspecies. Published by Elsevier B.V.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.