Menu
September 22, 2019  |  

Whole-genome resequencing and pan-transcriptome reconstruction highlight the impact of genomic structural Variation on secondary metabolite gene clusters in the grapevine Esca pathogen Phaeoacremonium minimum.

The Ascomycete fungus Phaeoacremonium minimum is one of the primary causal agents of Esca, a widespread and damaging grapevine trunk disease. Variation in virulence among Pm. minimum isolates has been reported, but the underlying genetic basis of the phenotypic variability remains unknown. The goal of this study was to characterize intraspecific genetic diversity and explore its potential impact on virulence functions associated with secondary metabolism, cellular transport, and cell wall decomposition. We generated a chromosome-scale genome assembly, using single molecule real-time sequencing, and resequenced the genomes and transcriptomes of multiple isolates to identify sequence and structural polymorphisms. Numerous insertion and deletion events were found for a total of about 1 Mbp in each isolate. Structural variation in this extremely gene dense genome frequently caused presence/absence polymorphisms of multiple adjacent genes, mostly belonging to biosynthetic clusters associated with secondary metabolism. Because of the observed intraspecific diversity in gene content due to structural variation we concluded that a transcriptome reference developed from a single isolate is insufficient to represent the virulence factor repertoire of the species. We therefore compiled a pan-transcriptome reference of Pm. minimum comprising a non-redundant set of 15,245 protein-coding sequences. Using naturally infected field samples expressing Esca symptoms, we demonstrated that mapping of meta-transcriptomics data on a multi-species reference that included the Pm. minimum pan-transcriptome allows the profiling of an expanded set of virulence factors, including variable genes associated with secondary metabolism and cellular transport.


September 22, 2019  |  

The hpRNA/RNAi pathway is essential to resolve intragenomic conflict in the Drosophila male germline.

Intragenomic conflicts are fueled by rapidly evolving selfish genetic elements, which induce selective pressures to innovate opposing repressive mechanisms. This is patently manifest in sex-ratio (SR) meiotic drive systems, in which distorter and suppressor factors bias and restore equal transmission of X and Y sperm. Here, we reveal that multiple SR suppressors in Drosophila simulans (Nmy and Tmy) encode related hairpin RNAs (hpRNAs), which generate endo-siRNAs that repress the paralogous distorters Dox and MDox. All components in this drive network are recently evolved and largely testis restricted. To connect SR hpRNA function to the RNAi pathway, we generated D. simulans null mutants of Dcr-2 and AGO2. Strikingly, these core RNAi knockouts massively derepress Dox and MDox and are in fact completely male sterile and exhibit highly defective spermatogenesis. Altogether, our data reveal how the adaptive capacity of hpRNAs is critically deployed to restrict selfish gonadal genetic systems that can exterminate a species. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019  |  

Opposite polarity monospore genome de novo sequencing and comparative analysis reveal the possible heterothallic life cycle of Morchella importuna.

Morchella is a popular edible fungus worldwide due to its rich nutrition and unique flavor. Many research efforts were made on the domestication and cultivation of Morchella all over the world. In recent years, the cultivation of Morchella was successfully commercialized in China. However, the biology is not well understood, which restricts the further development of the morel fungus cultivation industry. In this paper, we performed de novo sequencing and assembly of the genomes of two monospores with a different mating type (M04M24 and M04M26) isolated from the commercially cultivated strain M04. Gene annotation and comparative genome analysis were performed to study differences in CAZyme (Carbohydrate-active enzyme) enzyme content, transcription factors, duplicated sequences, structure of mating type sites, and differences at the gene and functional levels between the two monospore strains of M. importuna. Results showed that the de novo assembled haploid M04M24 and M04M26 genomes were 48.98 and 51.07 Mb, respectively. A complete fine physical map of M. importuna was obtained from genome coverage and gene completeness evaluation. A total of 10,852 and 10,902 common genes and 667 and 868 endemic genes were identified from the two monospore strains, respectively. The Gene Ontology (GO) and KAAS (KEGG Automatic Annotation Serve) enrichment analyses showed that the endemic genes performed different functions. The two monospore strains had 99.22% collinearity with each other, accompanied with certain position and rearrangement events. Analysis of complete mating-type loci revealed that the two monospore M. importuna strains contained an independent mating-type structure and remained conserved in sequence and location. The phylogenetic and divergence time of M. importuna was analyzed at the whole-genome level for the first time. The bifurcation time of morel and tuber was estimated to be 201.14 million years ago (Mya); the two monospore strains with a different mating type represented the evolution of different nuclei, and the single copy homologous genes between them were also different due to a genetic differentiation distance about 0.65 Mya. Compared with truffles, M. importuna had an extension of 28 clusters of orthologous genes (COGs) and a contraction of two COGs. The two different polar nuclei with different degrees of contraction and expansion suggested that they might have undergone different evolutionary processes. The different mating-type structures, together with the functional clustering and enrichment analysis results of the endemic genes of the two different polar nuclei, imply that M. importuna might be a heterothallic fungus and the interaction between the endemic genes may be necessary for its complete life history. Studies on the genome of M. importuna facilitate a better understanding of morel biology and evolution.


September 22, 2019  |  

Genome analyses of the microalga Picochlorum provide insights into the evolution of thermotolerance in the green lineage.

While the molecular events involved in cell responses to heat stress have been extensively studied, our understanding of the genetic basis of basal thermotolerance, and particularly its evolution within the green lineage, remains limited. Here, we present the 13.3-Mb haploid genome and transcriptomes of a halotolerant and thermotolerant unicellular green alga, Picochlorum costavermella (Trebouxiophyceae) to investigate the evolution of the genomic basis of thermotolerance. Differential gene expression at high and standard temperatures revealed that more of the gene families containing up-regulated genes at high temperature were recently evolved, and less originated at the ancestor of green plants. Inversely, there was an excess of ancient gene families containing transcriptionally repressed genes. Interestingly, there is a striking overlap between the thermotolerance and halotolerance transcriptional rewiring, as more than one-third of the gene families up-regulated at 35?°C were also up-regulated under variable salt concentrations in Picochlorum SE3. Moreover, phylogenetic analysis of the 9,304 protein coding genes revealed 26 genes of horizontally transferred origin in P. costavermella, of which five were differentially expressed at higher temperature. Altogether, these results provide new insights about how the genomic basis of adaptation to halo- and thermotolerance evolved in the green lineage.


September 22, 2019  |  

The structure of a conserved telomeric region associated with variant antigen loci in the blood parasite Trypanosoma congolense

African trypanosomiasis is a vector-borne disease of humans and livestock caused by African trypanosomes (Trypanosoma spp.). Survival in the vertebrate bloodstream depends on antigenic variation of Variant Surface Glycoproteins (VSGs) coating the parasite surface. In T. brucei, a model for antigenic variation, monoallelic VSG expression originates from dedicated VSG expression sites (VES). Trypanosoma brucei VES have a conserved structure consisting of a telomeric VSG locus downstream of unique, repeat sequences, and an independent promoter. Additional protein-coding sequences, known as “Expression Site Associated Genes (ESAGs)”, are also often present and are implicated in diverse, bloodstream-stage functions. Trypanosoma congolense is a related veterinary pathogen, also displaying VSG-mediated antigenic variation. A T. congolense VES has not been described, making it unclear if regulation of VSG expression is conserved between species. Here, we describe a conserved telomeric region associated with VSG loci from long-read DNA sequencing of two T. congolense strains, which consists of a distal repeat, conserved noncoding elements and other genes besides the VSG; although these are not orthologous to T. brucei ESAGs. Most conserved telomeric regions are associated with accessory minichromosomes, but the same structure may also be associated with megabase chromosomes. We propose that this region represents the T. congolense VES, and through comparison with T. brucei, we discuss the parallel evolution of antigenic switching mechanisms, and unique adaptation of the T. brucei VES for developmental regulation of bloodstream-stage genes. Hence, we provide a basis for understanding antigenic switching in T. congolense and the origins of the African trypanosome VES.


September 22, 2019  |  

Genomic insights into host adaptation between the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei).

Plant fungal pathogens can rapidly evolve and adapt to new environmental conditions in response to sudden changes of host populations in agro-ecosystems. However, the genomic basis of their host adaptation, especially at the forma specialis level, remains unclear.We sequenced two isolates each representing Puccinia striiformis f. sp. tritici (Pst) and P. striiformis f. sp. hordei (Psh), different formae speciales of the stripe rust fungus P. striiformis highly adapted to wheat and barley, respectively. The divergence of Pst and Psh, estimated to start 8.12 million years ago, has been driven by high nucleotide mutation rates. The high genomic variation within dikaryotic urediniospores of P. striiformis has provided raw genetic materials for genome evolution. No specific gene families have enriched in either isolate, but extensive gene loss events have occurred in both Pst and Psh after the divergence from their most recent common ancestor. A large number of isolate-specific genes were identified, with unique genomic features compared to the conserved genes, including 1) significantly shorter in length; 2) significantly less expressed; 3) significantly closer to transposable elements; and 4) redundant in pathways. The presence of specific genes in one isolate (or forma specialis) was resulted from the loss of the homologues in the other isolate (or forma specialis) by the replacements of transposable elements or losses of genomic fragments. In addition, different patterns and numbers of telomeric repeats were observed between the isolates.Host adaptation of P. striiformis at the forma specialis level is a complex pathogenic trait, involving not only virulence-related genes but also other genes. Gene loss, which might be adaptive and driven by transposable element activities, provides genomic basis for host adaptation of different formae speciales of P. striiformis.


September 22, 2019  |  

Asymmetric processing of DNA ends at a double-strand break leads to unconstrained dynamics and ectopic translocation.

Multiple pathways regulate the repair of double-strand breaks (DSBs) to suppress potentially dangerous ectopic recombination. Both sequence and chromatin context are thought to influence pathway choice between non-homologous end-joining (NHEJ) and homology-driven recombination. To test the effect of repetitive sequences on break processing, we have inserted TG-rich repeats on one side of an inducible DSB at the budding yeast MAT locus on chromosome III. Five clustered Rap1 sites within a break-proximal TG repeat are sufficient to block Mre11-Rad50-Xrs2 recruitment, impair resection, and favor elongation by telomerase. The two sides of the break lose end-to-end tethering and show enhanced, uncoordinated movement. Only the TG-free side is resected and shifts to the nuclear periphery. In contrast to persistent DSBs without TG repeats that are repaired by imprecise NHEJ, nearly all survivors of repeat-proximal DSBs repair the break by a homology-driven, non-reciprocal translocation from ChrIII-R to ChrVII-L. This suppression of imprecise NHEJ at TG-repeat-flanked DSBs requires the Uls1 translocase activity. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019  |  

Genome-wide analysis of Borrelia turcica and ‘Candidatus Borrelia tachyglossi’ shows relapsing fever-like genomes with unique genomic links to Lyme disease Borrelia.

Borrelia are tick-borne bacteria that in humans are the aetiological agents of Lyme disease and relapsing fever. Here we present the first genomes of B. turcica and B. tachyglossi, members of a recently described and rapidly expanding Borrelia clade associated with reptile (B. turcica) or echidna (B. tachyglossi) hosts, transmitted by hard ticks, and of unknown pathogenicity. Borrelia tachyglossi and B. turcica genomes are similar to those of relapsing fever Borrelia species, containing a linear ~ 900?kb chromosome, a single long (> 70?kb) linear plasmid, and numerous short (< 40?kb) linear and circular plasmids, as well as a suite of housekeeping and macronutrient biosynthesis genes which are not found in Lyme disease Borrelia. Additionally, both B. tachyglossi and B. turcica contain paralogous vsp and vlp proteins homologous to those used in the multiphasic antigen-switching system used by relapsing fever Borrelia to evade vertebrate immune responses, although their number was greatly reduced compared to human-infectious species. However, B. tachyglossi and B. turcica chromosomes also contain numerous genes orthologous to Lyme disease Borrelia-specific genes, demonstrating a unique evolutionary, and potentially phenotypic link between these groups. Borrelia tachyglossi and B. turcica genomes also have unique genetic features, including degraded and deleted tRNA modification genes, and an expanded range of macronutrient salvage and biosynthesis genes compared to relapsing fever and Lyme disease Borrelia. These genomes and genomic comparisons provide an insight into the biology and evolutionary origin of these Borrelia, and provide a valuable resource for future work. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019  |  

Extraordinary genome instability and widespread chromosome rearrangements during vegetative growth

The haploid genome of the pathogenic fungus Zymoseptoria tritici is contained on “core” and “accessory” chromosomes. While 13 core chromosomes are found in all strains, as many as eight accessory chromosomes show presence/absence variation and rearrangements among field isolates. The factors influencing these presence/absence polymorphisms are so far unknown. We investigated chromosome stability using experimental evolution, karyotyping, and genome sequencing. We report extremely high and variable rates of accessory chromosome loss during mitotic propagation in vitro and in planta Spontaneous chromosome loss was observed in 2 to >50% of cells during 4 weeks of incubation. Similar rates of chromosome loss in the closely related Zymoseptoria ardabiliae suggest that this extreme chromosome dynamic is a conserved phenomenon in the genus. Elevating the incubation temperature greatly increases instability of accessory and even core chromosomes, causing severe rearrangements involving telomere fusion and chromosome breakage. Chromosome losses do not affect the fitness of Zymoseptoria tritici in vitro, but some lead to increased virulence, suggesting an adaptive role of this extraordinary chromosome instability. Copyright © 2018 by the Genetics Society of America.


September 22, 2019  |  

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.


September 22, 2019  |  

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019  |  

Nondestructive, base-resolution sequencing of 5-hydroxymethylcytosine using a DNA deaminase.

Here we present APOBEC-coupled epigenetic sequencing (ACE-seq), a bisulfite-free method for localizing 5-hydroxymethylcytosine (5hmC) at single-base resolution with low DNA input. The method builds on the observation that AID/APOBEC family DNA deaminase enzymes can potently discriminate between cytosine modification states and exploits the non-destructive nature of enzymatic, rather than chemical, deamination. ACE-seq yielded high-confidence 5hmC profiles with at least 1,000-fold less DNA input than conventional methods. Applying ACE-seq to generate a base-resolution map of 5hmC in tissue-derived cortical excitatory neurons, we found that 5hmC was almost entirely confined to CG dinucleotides. The whole-genome map permitted cytosine, 5-methylcytosine (5mC) and 5hmC to be parsed and revealed genomic features that diverged from global patterns, including enhancers and imprinting control regions with high and low 5hmC/5mC ratios, respectively. Enzymatic deamination overcomes many challenges posed by bisulfite-based methods, thus expanding the scope of epigenome profiling to include scarce samples and opening new lines of inquiry regarding the role of cytosine modifications in genome biology.


September 22, 2019  |  

Whole genome sequencing and microsatellite analysis of the Plasmodium falciparum E5 NF54 strain show that the var, rifin and stevor gene families follow Mendelian inheritance.

Plasmodium falciparum exhibits a high degree of inter-isolate genetic diversity in its variant surface antigen (VSA) families: P. falciparum erythrocyte membrane protein 1, repetitive interspersed family (RIFIN) and subtelomeric variable open reading frame (STEVOR). The role of recombination for the generation of this diversity is a subject of ongoing research. Here the genome of E5, a sibling of the 3D7 genome strain is presented. Short and long read whole genome sequencing (WGS) techniques (Ilumina, Pacific Bioscience) and a set of 84 microsatellites (MS) were employed to characterize the 3D7 and non-3D7 parts of the E5 genome. This is the first time that VSA genes in sibling parasites were analysed with long read sequencing technology.Of the 5733 E5 genes only 278 genes, mostly var and rifin/stevor genes, had no orthologues in the 3D7 genome. WGS and MS analysis revealed that chromosomal crossovers occurred at a rate of 0-3 per chromosome. var, stevor and rifin genes were inherited within the respective non-3D7 or 3D7 chromosomal context. 54 of the 84 MS PCR fragments correctly identified the respective MS as 3D7- or non-3D7 and this correlated with var and rifin/stevor gene inheritance in the adjacent chromosomal regions. E5 had 61 var and 189 rifin/stevor genes. One large non-chromosomal recombination event resulted in a new var gene on chromosome 14. The remainder of the E5 3D7-type subtelomeric and central regions were identical to 3D7.The data show that the rifin/stevor and var gene families represent the most diverse compartments of the P. falciparum genome but that the majority of var genes are inherited without alterations within their respective parental chromosomal context. Furthermore, MS genotyping with 54 MS can successfully distinguish between two sibling progeny of a natural P. falciparum cross and thus can be used to investigate identity by descent in field isolates.


September 22, 2019  |  

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

The ascomycete fungus Colletotrichum truncatum is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in C. truncatum draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different Colletotrichum species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in C. truncatum, C. higginsianum, and C. scovillei. TEs in C. truncatum were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in C. truncatum showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). C. orbiculare and C. graminicola showed strong signatures of RIP across their genomes and “two-speed” genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of Colletotrichum species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in C. truncatum and their comparative analysis with six other Colletotrichum species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of Colletotrichum fungi and for development of SSR-based molecular markers for population genomic studies.


September 22, 2019  |  

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.