Menu
September 22, 2019  |  

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019  |  

Phosphagen kinase function in flagellated spores of the oomycete Phytophthora infestans integrates transcriptional regulation, metabolic dynamics and protein retargeting.

Flagellated spores play important roles in the infection of plants and animals by many eukaryotic microbes. The oomycete Phytophthora infestans, which causes potato blight, expresses two phosphagen kinases (PKs). These enzymes store energy in taurocyamine, and are hypothesized to resolve spatial and temporal imbalances between rates of ATP creation and use in zoospores. A dimeric PK is found at low levels in vegetative mycelia, but high levels in ungerminated sporangia and zoospores. In contrast, a monomeric PK protein is at similar levels in all tissues, although is transcribed primarily in mycelia. Subcellular localization studies indicate that the monomeric PK is mitochondrial. In contrast, the dimeric PK is cytoplasmic in mycelia and sporangia but is retargeted to flagellar axonemes during zoosporogenesis. This supports a model in which PKs shuttle energy from mitochondria to and through flagella. Metabolite analysis indicates that deployment of the flagellar PK is coordinated with a large increase in taurocyamine, synthesized by sporulation-induced enzymes that were lost during the evolution of zoospore-lacking oomycetes. Thus, PK function is enabled by coordination of the transcriptional, metabolic and protein targeting machinery during the life cycle. Since plants lack PKs, the enzymes may be useful targets for inhibitors of oomycete plant pathogens.© 2018 John Wiley & Sons Ltd.


September 22, 2019  |  

Characterization of the antimonite- and arsenite-oxidizing bacterium Bosea sp. AS-1 and its potential application in arsenic removal.

Arsenic (As) and antinomy (Sb) usually coexist in natural environments where both of them pollute soils and water. Microorganisms that oxidize arsenite [As(III)] and tolerate Sb have great potential in As and Sb bioremediation, In this study, a Gram-negative bacterial strain, Bosea sp. AS-1, was isolated from a mine slag sample collected in Xikuangshan Sb mine in China. AS-1 could tolerate 120?mM of As(III) and 50?mM of antimonite [Sb(III)]. It could also oxidize 2?mM of As(III) or Sb(III) completely under heterotrophic and aerobic conditions. Interestingly, strain AS-1 preferred to oxidize As(III) with yeast extract as the carbon source, whereas Sb(III) oxidation was favored with lactate in the medium. Genomic analysis of AS-1 confirmed the presence of several gene islands for As resistance and oxidation. Notably, a system of AS-1 and goethite was found to be able to remove 99% of the As with the initial concentration of 500?µg/L As(III) and 500?µg/L Sb(III), which suggests the potential of this approach for As removal in environments especially with the presence of high Sb. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019  |  

A homeobox gene, BarH-1, underlies a female alternative life-history strategy

Colias butterflies (the “clouded sulphurs”) often occur in mixed populations where females exhibit two color morphs, yellow/orange or white. White females, known as the Alba morph, reallocate resources from the synthesis of costly colored pigments to reproductive and somatic development 1. Due to this tradeoff Alba females develop faster and have higher fecundity than orange females 2. However orange females, that have instead invested in pigments, are preferred by males who in turn provide a nutrient rich spermatophore during mating 2,3,4. Thus the wing color morphs represent alternative life history strategies (ALHS) that are female-limited, wherein tradeoffs, due to divergent resource investment, result in distinct phenotypes with associated fitness consequences. Here we map the genetic basis of Alba in Colias crocea to a transposable element insertion downstream of the Colias homolog of BarH-1. To investigate the phenotypic effects of this insertion we use CRISPR/Cas9 to validate BarH-1’s functional role in the wing color switch and antibody staining to confirm expression differences in the scale building cells of pupal wings. We then use scanning electron microscopy to determine that BarH-1 expression in the wings causes a reduction in pigment granules within wing scales, and thereby gives rise to the white color. Finally, lipid and transcriptome analyses reveal additional physiological differences that arise due to Alba, suggesting pleiotropic effects beyond wing color. Together these findings provide the first well documented mechanism for a female ALHS and support an alternative view of color polymorphism as indicative of pleiotropic effects with life history consequences.


September 22, 2019  |  

Comparative genomics of Czech vaccine strains of Bordetella pertussis.

Bordetella pertussis is a strictly human pathogen causing the respiratory infectious disease called whooping cough or pertussis. B. pertussis adaptation to acellular pertussis vaccine pressure has been repeatedly highlighted, but recent data indicate that adaptation of circulating strains started already in the era of the whole cell pertussis vaccine (wP) use. We sequenced the genomes of five B. pertussis wP vaccine strains isolated in the former Czechoslovakia in the pre-wP (1954-1957) and early wP (1958-1965) eras, when only limited population travel into and out of the country was possible. Four isolates exhibit a similar genome organization and form a distinct phylogenetic cluster with a geographic signature. The fifth strain is rather distinct, both in genome organization and SNP-based phylogeny. Surprisingly, despite isolation of this strain before 1966, its closest sequenced relative appears to be a recent isolate from the US. On the genome content level, the five vaccine strains contained both new and already described regions of difference. One of the new regions contains duplicated genes potentially associated with transport across the membrane. The prevalence of this region in recent isolates indicates that its spread might be associated with selective advantage leading to increased strain fitness.


September 22, 2019  |  

Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species.

The African wild rice species Oryza longistaminata has several beneficial traits compared to cultivated rice species, such as resistance to biotic stresses, clonal propagation via rhizomes, and increased biomass production. To facilitate breeding efforts and functional genomics studies, we de-novo assembled a high-quality, haploid-phased genome. Here, we present our assembly, with a total length of 351?Mb, of which 92.2% was anchored onto 12 chromosomes. We detected 34,389 genes and 38.1% of the genome consisted of repetitive content. We validated our assembly by a comparative linkage analysis and by examining well-characterized gene families. This genome assembly will be a useful resource to exploit beneficial alleles found in O. longistaminata. Our results also show that it is possible to generate a high-quality, functionally complete rice genome assembly from moderate SMRT read coverage by exploiting synteny in a closely related Oryza species.


September 22, 2019  |  

Recurrent loss of HMGCS2 shows that ketogenesis is not essential for the evolution of large mammalian brains.

Apart from glucose, fatty acid-derived ketone bodies provide metabolic energy for the brain during fasting and neonatal development. We investigated the evolution of HMGCS2, the key enzyme required for ketone body biosynthesis (ketogenesis). Unexpectedly, we found that three mammalian lineages, comprising cetaceans (dolphins and whales), elephants and mastodons, and Old World fruit bats have lost this gene. Remarkably, many of these species have exceptionally large brains and signs of intelligent behavior. While fruit bats are sensitive to starvation, cetaceans and elephants can still withstand periods of fasting. This suggests that alternative strategies to fuel large brains during fasting evolved repeatedly and reveals flexibility in mammalian energy metabolism. Furthermore, we show that HMGCS2 loss preceded brain size expansion in toothed whales and elephants. Thus, while ketogenesis was likely important for brain size expansion in modern humans, ketogenesis is not a universal precondition for the evolution of large mammalian brains.© 2018, Jebb et al.


September 22, 2019  |  

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

The ascomycete fungus Colletotrichum truncatum is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in C. truncatum draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different Colletotrichum species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in C. truncatum, C. higginsianum, and C. scovillei. TEs in C. truncatum were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in C. truncatum showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). C. orbiculare and C. graminicola showed strong signatures of RIP across their genomes and “two-speed” genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of Colletotrichum species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in C. truncatum and their comparative analysis with six other Colletotrichum species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of Colletotrichum fungi and for development of SSR-based molecular markers for population genomic studies.


September 22, 2019  |  

Bacillus wiedmannii biovar thuringiensis: A specialized mosquitocidal pathogen with plasmids from diverse origins.

Bacillus cereus sensu lato also known as B. cereus group is composed of an ecologically diverse bacterial group with an increasing number of related species, some of which are medically or agriculturally important. Numerous e?orts have been undertaken to allow presumptive di?erentiation of B. cereus group species from one another. FCC41 is a Bacillus sp. strain toxic against mosquito species like Aedes aegypti, Aedes (Ochlerotatus) albifasciatus, Culex pipiens, Culex quinquefasciatus, and Culex apicinus, some of them responsible for the transmission of vector-borne diseases. Here, we report the complete genome sequence of FCC41 strain, which consists of one circular chromosome and eight circular plasmids ranging in size from 8 to 490?kb. This strain harbors six crystal protein genes, including cry24Ca, two cry4-like and two cry52-like, a cry41-like parasporin gene and multiple virulence factors. The phylogenetic analysis of the whole-genome sequence of this strain with molecular approaches places this strain into the Bacillus wiedmannii cluster. However, according with phenotypical characteristics such as the mosquitocidal activity due to the presence of Cry proteins found in the parasporal body and cry genes encoded in plasmids of different sizes, indicate that this strain could be renamed as B. wiedmannii biovar thuringiensis strain FCC41.


September 22, 2019  |  

Discovery of mcr-1-mediated colistin resistance in a highly virulent Escherichia coli lineage.

Resistance to last-line polymyxins mediated by the plasmid-borne mobile colistin resistance gene (mcr-1) represents a new threat to global human health. Here we present the complete genome sequence of an mcr-1-positive multidrug-resistant Escherichia coli strain (MS8345). We show that MS8345 belongs to serotype O2:K1:H4, has a large 241,164-bp IncHI2 plasmid that carries 15 other antibiotic resistance genes (including the extended-spectrum ß-lactamase blaCTX-M-1) and 3 putative multidrug efflux systems, and contains 14 chromosomally encoded antibiotic resistance genes. MS8345 also carries a large ColV-like virulence plasmid that has been associated with E. coli bacteremia. Whole-genome phylogeny revealed that MS8345 clusters within a discrete clade in the sequence type 95 (ST95) lineage, and MS8345 is very closely related to the highly virulent O45:K1:H4 clone associated with neonatal meningitis. Overall, the acquisition of a plasmid carrying resistance to colistin and multiple other antibiotics in this virulent E. coli lineage is concerning and might herald an era where the empirical treatment of ST95 infections becomes increasingly more difficult.IMPORTANCEEscherichia coli ST95 is a globally disseminated clone frequently associated with bloodstream infections and neonatal meningitis. However, the ST95 lineage is defined by low levels of drug resistance amongst clinical isolates, which normally provides for uncomplicated treatment options. Here, we provide the first detailed genomic analysis of an E. coli ST95 isolate that has both high virulence potential and resistance to multiple antibiotics. Using the genome, we predicted its virulence and antibiotic resistance mechanisms, which include resistance to last-line antibiotics mediated by the plasmid-borne mcr-1 gene. Finding an ST95 isolate resistant to nearly all antibiotics that also has a high virulence potential is of major clinical importance and underscores the need to monitor new and emerging trends in antibiotic resistance development in this important global lineage. Copyright © 2018 Forde et al.


September 22, 2019  |  

An introduced crop plant is driving diversification of the virulent bacterial pathogen Erwinia tracheiphila.

Erwinia tracheiphila is the causal agent of bacterial wilt of cucurbits, an economically important phytopathogen affecting an economically important phytopathogen affecting few cultivated Cucurbitaceae few cultivated Cucurbitaceae host plant species in temperate eastern North America. However, essentially nothing is known about E. tracheiphila population structure or genetic diversity. To address this shortcoming, a representative collection of 88 E. tracheiphila isolates was gathered from throughout its geographic range, and their genomes were sequenced. Phylogenomic analysis revealed three genetic clusters with distinct hrpT3SS virulence gene repertoires, host plant association patterns, and geographic distributions. Low genetic heterogeneity within each cluster suggests a recent population bottleneck followed by population expansion. We showed that in the field and greenhouse, cucumber (Cucumis sativus), which was introduced to North America by early Spanish conquistadors, is the most susceptible host plant species and the only species susceptible to isolates from all three lineages. The establishment of large agricultural populations of highly susceptible C. sativus in temperate eastern North America may have facilitated the original emergence of E. tracheiphila into cucurbit agroecosystems, and this introduced plant species may now be acting as a highly susceptible reservoir host. Our findings have broad implications for agricultural sustainability by drawing attention to how worldwide crop plant movement, agricultural intensification, and locally unique environments may affect the emergence, evolution, and epidemic persistence of virulent microbial pathogens.IMPORTANCEErwinia tracheiphila is a virulent phytopathogen that infects two genera of cucurbit crop plants, Cucurbita spp. (pumpkin and squash) and Cucumis spp. (muskmelon and cucumber). One of the unusual ecological traits of this pathogen is that it is limited to temperate eastern North America. Here, we complete the first large-scale sequencing of an E. tracheiphila isolate collection. From phylogenomic, comparative genomic, and empirical analyses, we find that introduced Cucumis spp. crop plants are driving the diversification of E. tracheiphila into multiple lineages. Together, the results from this study show that locally unique biotic (plant population) and abiotic (climate) conditions can drive the evolutionary trajectories of locally endemic pathogens in unexpected ways. Copyright © 2018 Shapiro et al.


September 22, 2019  |  

Comparative analysis of blaKPC-2- and rmtB-carrying IncFII-family pKPC-LK30/pHN7A8 hybrid plasmids from Klebsiella pneumoniae CG258 strains disseminated among multiple Chinese hospitals.

We recently reported the complete sequence of a blaKPC-2- and rmtB-carrying IncFII-family plasmid p675920-1 with the pKPC-LK30/pHN7A8 hybrid structure. Comparative genomics of additional sequenced plasmids with similar hybrid structures and their prevalence in blaKPC-carrying Klebsiella pneumoniae strains from China were investigated in this follow-up study.A total of 51 blaKPC-carrying K. pneumoniae strains were isolated from 2012 to 2016 from five Chinese hospitals and genotyped by multilocus sequence typing. The blaKPC-carrying plasmids from four representative strains were sequenced and compared with p675920-1 and pCT-KPC. Plasmid transfer, carbapenemase activity determination, and bacterial antimicrobial susceptibility test were performed to characterize resistance phenotypes mediated by these plasmids. The prevalence of pCT-KPC-like plasmids in these blaKPC-carrying K. pneumoniae strains was screened by PCR.The six KPC-encoding plasmids p1068-KPC, p20049-KPC, p12139-KPC and p64917-KPC (sequenced in this study) and p675920-1 and pCT-KPC slightly differed from one another due to deletion and acquisition of various backbone and accessory regions. Two major accessory resistance regions, which included the blaKPC-2 region harboring blaKPC-2 (carbapenem resistance) and blaSHV-12 (ß-lactam resistance), and the MDR region carrying rmtB (aminoglycoside resistance), fosA3 (fosfomycin resistance), blaTEM-1B (ß-lactam resistance) and blaCTX-M-65 (ß-lactam resistance), were found in each of these six plasmids and exhibited several parallel evolution routes. The pCT-KPC-like plasmids were present in all the 51 K. pneumoniae isolates, all of which belonged to CG258.There was clonal dissemination of K. pneumoniae CG258 strains, harboring blaKPC-2- and rmtB-carrying IncFII-family pKPC-LK30/pHN7A8 hybrid plasmids, among multiple Chinese hospitals.


September 22, 2019  |  

The genomic basis of color pattern polymorphism in the Harlequin ladybird.

Many animal species comprise discrete phenotypic forms. A common example in natural populations of insects is the occurrence of different color patterns, which has motivated a rich body of ecological and genetic research [1-6]. The occurrence of dark, i.e., melanic, forms displaying discrete color patterns is found across multiple taxa, but the underlying genomic basis remains poorly characterized. In numerous ladybird species (Coccinellidae), the spatial arrangement of black and red patches on adult elytra varies wildly within species, forming strikingly different complex color patterns [7, 8]. In the harlequin ladybird, Harmonia axyridis, more than 200 distinct color forms have been described, which classic genetic studies suggest result from allelic variation at a single, unknown, locus [9, 10]. Here, we combined whole-genome sequencing, population-based genome-wide association studies, gene expression, and functional analyses to establish that the transcription factor Pannier controls melanic pattern polymorphism in H. axyridis. We show that pannier is necessary for the formation of melanic elements on the elytra. Allelic variation in pannier leads to protein expression in distinct domains on the elytra and thus determines the distinct color patterns in H. axyridis. Recombination between pannier alleles may be reduced by a highly divergent sequence of ~170 kb in the cis-regulatory regions of pannier, with a 50 kb inversion between color forms. This most likely helps maintain the distinct alleles found in natural populations. Thus, we propose that highly variable discrete color forms can arise in natural populations through cis-regulatory allelic variation of a single gene. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.


September 22, 2019  |  

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.


September 22, 2019  |  

SKA: Split Kmer Analysis Toolkit for Bacterial Genomic Epidemiology

Genome sequencing is revolutionising infectious disease epidemiology, providing a huge step forward in sensitivity and specificity over more traditional molecular typing techniques. However, the complexity of genome data often means that its analysis and interpretation requires high-performance compute infrastructure and dedicated bioinformatics support. Furthermore, current methods have limitations that can differ between analyses and are often opaque to the user, and their reliance on multiple external dependencies makes reproducibility difficult. Here I introduce SKA, a toolkit for analysis of genome sequence data from closely-related, small, haploid genomes. SKA uses split kmers to rapidly identify variation between genome sequences, making it possible to analyse hundreds of genomes on a standard home computer. Tests on publicly available simulated and real-life data show that SKA is both faster and more efficient than the gold standard methods used today while retaining similar levels of accuracy for epidemiological purposes. SKA can take raw read data or genome assemblies as input and calculate pairwise distances, create single linkage clusters and align genomes to a reference genome or using a reference-free approach. SKA requires few decisions to be made by the user, which, along with its computational efficiency, allows genome analysis to become accessible to those with only basic bioinformatics training. The limitations of SKA are also far more transparent than for current approaches, and future improvements to mitigate these limitations are possible. Overall, SKA is a powerful addition to the armoury of the genomic epidemiologist. SKA source code is available from Github (https://github.com/simonrharris/SKA).


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.