Menu
September 22, 2019

Genomic assemblies of newly sequenced Trypanosoma cruzi strains reveal new genomic expansion and greater complexity.

Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X for Y). Our analysis reveals a new genomic expansion and greater complexity for those multi-copy gene families related to infection process and disease development, such as Trans-sialidases, Mucins and Mucin Associated Surface Proteins, among others. On one side, we demonstrate that multi-copy gene families are located near telomeric regions of the “chromosome-like” 1.3?Mb contig assembled of Bug2148, where they likely suffer high evolutive pressure. On the other hand, we identified several strain-specific single copy genes that might help to understand the differences in infectivity and physiology among strains. In summary, our results indicate that T. cruzi has a complex genomic architecture that may have promoted its evolution.


September 22, 2019

Antiviral adaptive immunity and tolerance in the mosquito Aedes aegyti

Mosquitoes spread pathogenic arboviruses while themselves tolerate infection. We here characterize an immunity pathway providing long-term antiviral protection and define how this pathway discriminates between self and non-self. Mosquitoes use viral RNAs to create viral derived cDNAs (vDNAs) central to the antiviral response. vDNA molecules are acquired through a process of reverse-transcription and recombination directed by endogenous retrotransposons. These vDNAs are thought to integrate in the host genome as endogenous viral elements (EVEs). Sequencing of pre-integrated vDNA revealed that the acquisition process exquisitely distinguishes viral from host RNA, providing one layer of self-nonself discrimination. Importantly, we show EVE-derived piRNAs have antiviral activity and are loaded onto Piwi4 to inhibit virus replication. In a second layer of self-non-self discrimination, Piwi4 preferentially loads EVE-derived piRNAs, discriminating against transposon-targeting piRNAs. Our findings define a fundamental virus-specific immunity pathway in mosquitoes that uses EVEs as a potent and specific antiviral transgenerational mechanism.


September 22, 2019

Nondestructive, base-resolution sequencing of 5-hydroxymethylcytosine using a DNA deaminase.

Here we present APOBEC-coupled epigenetic sequencing (ACE-seq), a bisulfite-free method for localizing 5-hydroxymethylcytosine (5hmC) at single-base resolution with low DNA input. The method builds on the observation that AID/APOBEC family DNA deaminase enzymes can potently discriminate between cytosine modification states and exploits the non-destructive nature of enzymatic, rather than chemical, deamination. ACE-seq yielded high-confidence 5hmC profiles with at least 1,000-fold less DNA input than conventional methods. Applying ACE-seq to generate a base-resolution map of 5hmC in tissue-derived cortical excitatory neurons, we found that 5hmC was almost entirely confined to CG dinucleotides. The whole-genome map permitted cytosine, 5-methylcytosine (5mC) and 5hmC to be parsed and revealed genomic features that diverged from global patterns, including enhancers and imprinting control regions with high and low 5hmC/5mC ratios, respectively. Enzymatic deamination overcomes many challenges posed by bisulfite-based methods, thus expanding the scope of epigenome profiling to include scarce samples and opening new lines of inquiry regarding the role of cytosine modifications in genome biology.


September 22, 2019

Whole genome sequencing and microsatellite analysis of the Plasmodium falciparum E5 NF54 strain show that the var, rifin and stevor gene families follow Mendelian inheritance.

Plasmodium falciparum exhibits a high degree of inter-isolate genetic diversity in its variant surface antigen (VSA) families: P. falciparum erythrocyte membrane protein 1, repetitive interspersed family (RIFIN) and subtelomeric variable open reading frame (STEVOR). The role of recombination for the generation of this diversity is a subject of ongoing research. Here the genome of E5, a sibling of the 3D7 genome strain is presented. Short and long read whole genome sequencing (WGS) techniques (Ilumina, Pacific Bioscience) and a set of 84 microsatellites (MS) were employed to characterize the 3D7 and non-3D7 parts of the E5 genome. This is the first time that VSA genes in sibling parasites were analysed with long read sequencing technology.Of the 5733 E5 genes only 278 genes, mostly var and rifin/stevor genes, had no orthologues in the 3D7 genome. WGS and MS analysis revealed that chromosomal crossovers occurred at a rate of 0-3 per chromosome. var, stevor and rifin genes were inherited within the respective non-3D7 or 3D7 chromosomal context. 54 of the 84 MS PCR fragments correctly identified the respective MS as 3D7- or non-3D7 and this correlated with var and rifin/stevor gene inheritance in the adjacent chromosomal regions. E5 had 61 var and 189 rifin/stevor genes. One large non-chromosomal recombination event resulted in a new var gene on chromosome 14. The remainder of the E5 3D7-type subtelomeric and central regions were identical to 3D7.The data show that the rifin/stevor and var gene families represent the most diverse compartments of the P. falciparum genome but that the majority of var genes are inherited without alterations within their respective parental chromosomal context. Furthermore, MS genotyping with 54 MS can successfully distinguish between two sibling progeny of a natural P. falciparum cross and thus can be used to investigate identity by descent in field isolates.


September 22, 2019

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

The ascomycete fungus Colletotrichum truncatum is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in C. truncatum draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different Colletotrichum species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in C. truncatum, C. higginsianum, and C. scovillei. TEs in C. truncatum were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in C. truncatum showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). C. orbiculare and C. graminicola showed strong signatures of RIP across their genomes and “two-speed” genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of Colletotrichum species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in C. truncatum and their comparative analysis with six other Colletotrichum species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of Colletotrichum fungi and for development of SSR-based molecular markers for population genomic studies.


September 22, 2019

Type II restriction modification system in Ureaplasma parvum OMC-P162 strain.

Ureaplasma parvum serovar 3 strain, OMC-P162, was isolated from the human placenta of a preterm delivery at 26 weeks’ gestation. In this study, we sequenced the complete genome of OMC-P162 and compared it with other serovar 3 strains isolated from patients with different clinical conditions. Ten unique genes in OMC-P162, five of which encoded for hypothetical proteins, were identified. Of these, genes UPV_229 and UPV_230 formed an operon whose open reading frames were predicted to code for a DNA methyltransferase and a hypothetical protein, respectively. DNA modification analysis of the OMC-P162 genome identified N4-methylcytosine (m4C) and N6-methyladenine (m6A), but not 5-methylocytosine (m5C). UPV230 recombinant protein displayed endonuclease activity and recognized the CATG sequence, resulting in a blunt cut between A and T. This restriction enzyme activity was identical to that of the cultivated OMC-P162 strain, suggesting that this restriction enzyme was naturally expressed in OMC-P162. We designated this enzyme as UpaP162. Treatment of pT7Blue plasmid with recombinant protein UPV229 completely blocked UpaP162 restriction enzyme activity. These results suggest that the UPV_229 and UPV_230 genes act as a type II restriction-modification system in Ureaplasma OMC-P162.


September 22, 2019

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.


September 22, 2019

Constant conflict between Gypsy LTR retrotransposons and CHH methylation within a stress-adapted mangrove genome.

The evolutionary dynamics of the conflict between transposable elements (TEs) and their host genome remain elusive. This conflict will be intense in stress-adapted plants as stress can often reactivate TEs. Mangroves reduce TE load convergently in their adaptation to intertidal environments and thus provide a unique opportunity to address the host-TE conflict and its interaction with stress adaptation. Using the mangrove Rhizophora apiculata as a model, we investigated methylation and short interfering RNA (siRNA) targeting patterns in relation to the abundance and age of long terminal repeat (LTR) retrotransposons. We also examined the distance of LTR retrotransposons to genes, the impact on neighboring gene expression and population frequencies. We found differential accumulation amongst classes of LTR retrotransposons despite high overall methylation levels. This can be attributed to 24-nucleotide siRNA-mediated CHH methylation preferentially targeting Gypsy elements, particularly in their LTR regions. Old Gypsy elements possess unusually abundant siRNAs which show cross-mapping to young copies. Gypsy elements appear to be closer to genes and under stronger purifying selection than other classes. Our results suggest a continuous host-TE battle masked by the TE load reduction in R. apiculata. This conflict may enable mangroves, such as R. apiculata, to maintain genetic diversity and thus evolutionary potential during stress adaptation.© 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


September 22, 2019

Computational tools to unmask transposable elements.

A substantial proportion of the genome of many species is derived from transposable elements (TEs). Moreover, through various self-copying mechanisms, TEs continue to proliferate in the genomes of most species. TEs have contributed numerous regulatory, transcript and protein innovations and have also been linked to disease. However, notwithstanding their demonstrated impact, many genomic studies still exclude them because their repetitive nature results in various analytical complexities. Fortunately, a growing array of methods and software tools are being developed to cater for them. This Review presents a summary of computational resources for TEs and highlights some of the challenges and remaining gaps to perform comprehensive genomic analyses that do not simply ‘mask’ repeats.


September 22, 2019

Genomic characterization reveals significant divergence within Chlorella sorokiniana (Chlorellales, Trebouxiophyceae)

Selection of highly productive algal strains is crucial for establishing economically viable biomass and biopro- duct cultivation systems. Characterization of algal genomes, including understanding strain-specific differences in genome content and architecture is a critical step in this process. Using genomic analyses, we demonstrate significant differences between three strains of Chlorella sorokiniana (strain 1228, UTEX 1230, and DOE1412). We found that unique, strain-specific genes comprise a substantial proportion of each genome, and genomic regions with> 80% local nucleotide identity constitute <15% of each genome among the strains, indicating substantial strain specific evolution. Furthermore, cataloging of meiosis and other sex-related genes in C. sor- okiniana strains suggests strategic breeding could be utilized to improve biomass and bioproduct yields if a sexual cycle can be characterized. Finally, preliminary investigation of epigenetic machinery suggests the pre- sence of potentially unique transcriptional regulation in each strain. Our data demonstrate that these three C. sorokiniana strains represent significantly different genomic content. Based on these findings, we propose in- dividualized assessment of each strain for potential performance in cultivation systems.


September 22, 2019

Full-length extension of HLA allele sequences by HLA allele-specific hemizygous Sanger sequencing (SSBT).

The gold standard for typing at the allele level of the highly polymorphic Human Leucocyte Antigen (HLA) gene system is sequence based typing. Since sequencing strategies have mainly focused on identification of the peptide binding groove, full-length sequence information is lacking for >90% of the HLA alleles. One of the goals of the 17th IHIWS workshop is to establish full-length sequences for as many HLA alleles as possible. In our component “Extension of HLA sequences by full-length HLA allele-specific hemizygous Sanger sequencing” we have used full-length hemizygous Sanger Sequence Based Typing to achieve this goal. We selected samples of which full length sequences were not available in the IPD-IMGT/HLA database. In total we have generated the full-length sequences of 48 HLA-A, 45 -B and 31 -C alleles. For HLA-A extended alleles, 39/48 showed no intron differences compared to the first allele of the corresponding allele group, for HLA-B this was 26/45 and for HLA-C 20/31. Comparing the intron sequences to other alleles of the same allele group revealed that in 5/48 HLA-A, 16/45 HLA-B and 8/31 HLA-C alleles the intron sequence was identical to another allele of the same allele group. In the remaining 10 cases, the sequence either showed polymorphism at a conserved nucleotide or was the result of a gene conversion event. Elucidation of the full-length sequence gives insight in the polymorphic content of the alleles and facilitates the identification of its evolutionary origin. Copyright © 2018 American Society for Histocompatibility and Immunogenetics. All rights reserved.


September 22, 2019

Reassessment of the evolution of wheat chromosomes 4A, 5A, and 7B.

Comparison of genome sequences of wild emmer wheat and Aegilops tauschii suggests a novel scenario of the evolution of rearranged wheat chromosomes 4A, 5A, and 7B. Past research suggested that wheat chromosome 4A was subjected to a reciprocal translocation T(4AL;5AL)1 that occurred in the diploid progenitor of the wheat A subgenome and to three major rearrangements that occurred in polyploid wheat: pericentric inversion Inv(4AS;4AL)1, paracentric inversion Inv(4AL;4AL)1, and reciprocal translocation T(4AL;7BS)1. Gene collinearity along the pseudomolecules of tetraploid wild emmer wheat (Triticum turgidum ssp. dicoccoides, subgenomes AABB) and diploid Aegilops tauschii (genomes DD) was employed to confirm these rearrangements and to analyze the breakpoints. The exchange of distal regions of chromosome arms 4AS and 4AL due to pericentric inversion Inv(4AS;4AL)1 was detected, and breakpoints were validated with an optical Bionano genome map. Both breakpoints contained satellite DNA. The breakpoints of reciprocal translocation T(4AL;7BS)1 were also found. However, the breakpoints that generated paracentric inversion Inv(4AL;4AL)1 appeared to be collocated with the 4AL breakpoints that had produced Inv(4AS;4AL)1 and T(4AL;7BS)1. Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 either originated sequentially, and Inv(4AL;4AL)1 was produced by recurrent chromosome breaks at the same breakpoints that generated Inv(4AS;4AL)1 and T(4AL;7BS)1, or Inv(4AS;4AL)1, Inv(4AL;4AL)1, and T(4AL;7BS)1 originated simultaneously. We prefer the latter hypothesis since it makes fewer assumptions about the sequence of events that produced these chromosome rearrangements.


September 22, 2019

Comparative genomic and methylome analysis of non-virulent D74 and virulent Nagasaki Haemophilus parasuis isolates.

Haemophilus parasuis is a respiratory pathogen of swine and the etiological agent of Glässer’s disease. H. parasuis isolates can exhibit different virulence capabilities ranging from lethal systemic disease to subclinical carriage. To identify genomic differences between phenotypically distinct strains, we obtained the closed whole-genome sequence annotation and genome-wide methylation patterns for the highly virulent Nagasaki strain and for the non-virulent D74 strain. Evaluation of the virulence-associated genes contained within the genomes of D74 and Nagasaki led to the discovery of a large number of toxin-antitoxin (TA) systems within both genomes. Five predicted hemolysins were identified as unique to Nagasaki and seven putative contact-dependent growth inhibition toxin proteins were identified only in strain D74. Assessment of all potential vtaA genes revealed thirteen present in the Nagasaki genome and three in the D74 genome. Subsequent evaluation of the predicted protein structure revealed that none of the D74 VtaA proteins contain a collagen triple helix repeat domain. Additionally, the predicted protein sequence for two D74 VtaA proteins is substantially longer than any predicted Nagasaki VtaA proteins. Fifteen methylation sequence motifs were identified in D74 and fourteen methylation sequence motifs were identified in Nagasaki using SMRT sequencing analysis. Only one of the methylation sequence motifs was observed in both strains indicative of the diversity between D74 and Nagasaki. Subsequent analysis also revealed diversity in the restriction-modification systems harbored by D74 and Nagasaki. The collective information reported in this study will aid in the development of vaccines and intervention strategies to decrease the prevalence and disease burden caused by H. parasuis.


September 22, 2019

Streptococcus suis contains multiple phase-variable methyltransferases that show a discrete lineage distribution.

Streptococcus suis is a major pathogen of swine, responsible for a number of chronic and acute infections, and is also emerging as a major zoonotic pathogen, particularly in South-East Asia. Our study of a diverse population of S. suis shows that this organism contains both Type I and Type III phase-variable methyltransferases. In all previous examples, phase-variation of methyltransferases results in genome wide methylation differences, and results in differential regulation of multiple genes, a system known as the phasevarion (phase-variable regulon). We hypothesized that each variant in the Type I and Type III systems encoded a methyltransferase with a unique specificity, and could therefore control a distinct phasevarion, either by recombination-driven shuffling between different specificities (Type I) or by biphasic on-off switching via simple sequence repeats (Type III). Here, we present the identification of the target specificities for each Type III allelic variant from S. suis using single-molecule, real-time methylome analysis. We demonstrate phase-variation is occurring in both Type I and Type III methyltransferases, and show a distinct association between methyltransferase type and presence, and population clades. In addition, we show that the phase-variable Type I methyltransferase was likely acquired at the origin of a highly virulent zoonotic sub-population.


September 22, 2019

Excision-reintegration at a pneumococcal phase-variable restriction-modification locus drives within- and between-strain epigenetic differentiation and inhibits gene acquisition.

Phase-variation of Type I restriction-modification systems can rapidly alter the sequence motifs they target, diversifying both the epigenetic patterns and endonuclease activity within clonally descended populations. Here, we characterize the Streptococcus pneumoniae SpnIV phase-variable Type I RMS, encoded by the translocating variable restriction (tvr) locus, to identify its target motifs, mechanism and regulation of phase variation, and effects on exchange of sequence through transformation. The specificity-determining hsdS genes were shuffled through a recombinase-mediated excision-reintegration mechanism involving circular intermediate molecules, guided by two types of direct repeat. The rate of rearrangements was limited by an attenuator and toxin-antitoxin system homologs that inhibited recombinase gene transcription. Target motifs for both the SpnIV, and multiple Type II, MTases were identified through methylation-sensitive sequencing of a panel of recombinase-null mutants. This demonstrated the species-wide diversity observed at the tvr locus can likely specify nine different methylation patterns. This will reduce sequence exchange in this diverse species, as the native form of the SpnIV RMS was demonstrated to inhibit the acquisition of genomic islands by transformation. Hence the tvr locus can drive variation in genome methylation both within and between strains, and limits the genomic plasticity of S. pneumoniae.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.