Menu
September 22, 2019

Repeat-driven generation of antigenic diversity in a major human pathogen, Trypanosoma cruzi

Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina sequencing of 34 T. cruzi TcI isolates and clones from different geographic locations, sample sources and clinical outcomes. Resolution of the surface molecule gene structure reveals an unusual duality in the organisation of the parasite genome, a core genomic region syntenous with related protozoa flanked by unique and highly plastic subtelomeric regions encoding surface antigens. The presence of abundant interspersed retrotransposons in the subtelomeres suggests that these elements are involved in a recombination mechanism for the generation of antigenic variation and evasion of the host immune response. The comparative genomic analysis of the cohort of TcI strains revealed multiple cases of such recombination events involving surface molecule genes and has provided new insights into T. cruzi population structure.


September 22, 2019

The global distribution and spread of the mobilized colistin resistance gene mcr-1.

Colistin represents one of the few available drugs for treating infections caused by carbapenem-resistant Enterobacteriaceae. As such, the recent plasmid-mediated spread of the colistin resistance gene mcr-1 poses a significant public health threat, requiring global monitoring and surveillance. Here, we characterize the global distribution of mcr-1 using a data set of 457 mcr-1-positive sequenced isolates. We find mcr-1 in various plasmid types but identify an immediate background common to all mcr-1 sequences. Our analyses establish that all mcr-1 elements in circulation descend from the same initial mobilization of mcr-1 by an ISApl1 transposon in the mid 2000s (2002-2008; 95% highest posterior density), followed by a marked demographic expansion, which led to its current global distribution. Our results provide the first systematic phylogenetic analysis of the origin and spread of mcr-1, and emphasize the importance of understanding the movement of antibiotic resistance genes across multiple levels of genomic organization.


September 22, 2019

CliqueSNV: Scalable reconstruction of intra-host viral populations from NGS reads

Highly mutable RNA viruses such as influenza A virus, human immunodeficiency virus and hepatitis C virus exist in infected hosts as highly heterogeneous populations of closely related genomic variants. The presence of low-frequency variants with few mutations with respect to major strains may result in an immune escape, emergence of drug resistance, and an increase of virulence and infectivity. Next-generation sequencing technologies permit detection of sample intra-host viral population at extremely great depth, thus providing an opportunity to access low-frequency variants. Long read lengths offered by single-molecule sequencing technologies allow all viral variants to be sequenced in a single pass. However, high sequencing error rates limit the ability to study heterogeneous viral populations composed of rare, closely related variants. In this article, we present CliqueSNV, a novel reference-based method for reconstruction of viral variants from NGS data. It efficiently constructs an allele graph based on linkage between single nucleotide variations and identifies true viral variants by merging cliques of that graph using combinatorial optimization techniques. The new method outperforms existing methods in both accuracy and running time on experimental and simulated NGS data for titrated levels of known viral variants. For PacBio reads, it accurately reconstructs variants with frequency as low as 0.1%. For Illumina reads, it fully reconstructs main variants. The open source implementation of CliqueSNV is freely available for download at https://github.com/vyacheslav-tsivina/CliqueSNV


September 22, 2019

Dynamic evolution of a-gliadin prolamin gene family in homeologous genomes of hexaploid wheat.

Wheat Gli-2 loci encode complex groups of a-gliadin prolamins that are important for breadmaking, but also major triggers of celiac disease (CD). Elucidation of a-gliadin evolution provides knowledge to produce wheat with better end-use properties and reduced immunogenic potential. The Gli-2 loci contain a large number of tandemly duplicated genes and highly repetitive DNA, making sequence assembly of their genomic regions challenging. Here, we constructed high-quality sequences spanning the three wheat homeologous a-gliadin loci by aligning PacBio-based sequence contigs with BioNano genome maps. A total of 47 a-gliadin genes were identified with only 26 encoding intact full-length protein products. Analyses of a-gliadin loci and phylogenetic tree reconstruction indicate significant duplications of a-gliadin genes in the last ~2.5 million years after the divergence of the A, B and D genomes, supporting its rapid lineage-independent expansion in different Triticeae genomes. We showed that dramatic divergence in expression of a-gliadin genes could not be attributed to sequence variations in the promoter regions. The study also provided insights into the evolution of CD epitopes and identified a single indel event in the hexaploid wheat D genome that likely resulted in the generation of the highly toxic 33-mer CD epitope.


September 22, 2019

Genomic diversity of Taylorella equigenitalis introduced into the United States from 1978 to 2012.

Contagious equine metritis is a disease of worldwide concern in equids. The United States is considered to be free of the disease although sporadic outbreaks have occurred over the last few decades that were thought to be associated with the importation of horses. The objective of this study was to create finished, reference quality genomes that characterize the diversity of Taylorella equigenitalis isolates introduced into the USA, and identify their differences. Five isolates of T. equigenitalis associated with introductions into the USA from unique sources were sequenced using both short and long read chemistries allowing for complete assembly and annotation. These sequences were compared to previously published genomes as well as the short read sequences of the 200 isolates in the National Veterinary Services Laboratories’ diagnostic repository to identify unique regions and genes, potential virulence factors, and characterize diversity. The 5 genomes varied in size by up to 100,000 base pairs, but averaged 1.68 megabases. The majority of that diversity in size can be explained by repeat regions and 4 main regions of difference, which ranged in size from 15,000 to 45,000 base pairs. The first region of difference contained mostly hypothetical proteins, the second contained the CRISPR, the third contained primarily hemagglutinin proteins, and the fourth contained primarily segments of a type IV secretion system. As expected and previously reported, little evidence of recombination was found within these genomes. Several additional areas of interest were also observed including a mechanism for streptomycin resistance and other virulence factors. A SNP distance comparison of the T. equigenitalis isolates and Mycobacterium tuberculosis complex (MTBC) showed that relatively, T. equigenitalis was a more diverse species than the entirety of MTBC.


September 22, 2019

Comparative genomic insights into endofungal lifestyles of two bacterial endosymbionts, Mycoavidus cysteinexigens and Burkholderia rhizoxinica.

Endohyphal bacteria (EHB), dwelling within fungal hyphae, markedly affect the growth and metabolic potential of their hosts. To date, two EHB belonging to the family Burkholderiaceae have been isolated and characterized as new taxa, Burkholderia rhizoxinica (HKI 454T) and Mycoavidus cysteinexigens (B1-EBT), in Japan. Metagenome sequencing was recently reported for Mortierella elongata AG77 together with its endosymbiont M. cysteinexigens (Mc-AG77) from a soil/litter sample in the USA. In the present study, we elucidated the complete genome sequence of B1-EBT and compared it with those of Mc-AG77 and HKI 454T. The genomes of B1-EBT and Mc-AG77 contained a higher level of prophage sequences and were markedly smaller than that of HKI 454T. Although the B1-EBT and Mc-AG77 genomes lacked the chitinolytic enzyme genes responsible for invasion into fungal cells, they contained several predicted toxin-antitoxin systems including an insecticidal toxin complex and PIN domain imposing an addiction-like mechanism essential for endohyphal growth control during host colonization. Despite the different host fungi, the alignment of amino acid sequences showed that the HKI 454T genome consisted of 1,265 (32.6%) and 1,221 (31.5%) orthologous coding sequences (CDSs) with those of B1-EBT and Mc-AG77, respectively. This comparative study of three phylogenetically associated endosymbionts has provided insights into their origin and evolution, and suggests the later bacterial invasion and adaptation of B1-EBT to its host metabolism.


September 22, 2019

Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy.

Epilepsy is a common neurological disorder, and mutations in genes encoding ion channels or neurotransmitter receptors are frequent causes of monogenic forms of epilepsy. Here we show that abnormal expansions of TTTCA and TTTTA repeats in intron 4 of SAMD12 cause benign adult familial myoclonic epilepsy (BAFME). Single-molecule, real-time sequencing of BAC clones and nanopore sequencing of genomic DNA identified two repeat configurations in SAMD12. Intriguingly, in two families with a clinical diagnosis of BAFME in which no repeat expansions in SAMD12 were observed, we identified similar expansions of TTTCA and TTTTA repeats in introns of TNRC6A and RAPGEF2, indicating that expansions of the same repeat motifs are involved in the pathogenesis of BAFME regardless of the genes in which the expanded repeats are located. This discovery that expansions of noncoding repeats lead to neuronal dysfunction responsible for myoclonic tremor and epilepsy extends the understanding of diseases with such repeat expansion.


September 22, 2019

Comparative genome analysis reveals a complex population structure of Legionella pneumophila subspecies.

The majority of Legionnaires’ disease (LD) cases are caused by Legionella pneumophila, a genetically heterogeneous species composed of at least 17 serogroups. Previously, it was demonstrated that L. pneumophila consists of three subspecies: pneumophila, fraseri and pascullei. During an LD outbreak investigation in 2012, we detected that representatives of both subspecies fraseri and pascullei colonized the same water system and that the outbreak-causing strain was a new member of the least represented subspecies pascullei. We used partial sequence based typing consensus patterns to mine an international database for additional representatives of fraseri and pascullei subspecies. As a result, we identified 46 sequence types (STs) belonging to subspecies fraseri and two STs belonging to subspecies pascullei. Moreover, a recent retrospective whole genome sequencing analysis of isolates from New York State LD clusters revealed the presence of a fourth L. pneumophila subspecies that we have termed raphaeli. This subspecies consists of 15 STs. Comparative analysis was conducted using the genomes of multiple members of all four L. pneumophila subspecies. Whereas each subspecies forms a distinct phylogenetic clade within the L. pneumophila species, they share more average nucleotide identity with each other than with other Legionella species. Unique genes for each subspecies were identified and could be used for rapid subspecies detection. Improved taxonomic classification of L. pneumophila strains may help identify environmental niches and virulence attributes associated with these genetically distinct subspecies. Published by Elsevier B.V.


September 22, 2019

Benefit from decline: the primary transcriptome of Alteromonas macleodii str. Te101 during Trichodesmium demise.

Interactions between co-existing microorganisms deeply affect the physiology of the involved organisms and, ultimately, the function of the ecosystem as a whole. Copiotrophic Alteromonas are marine gammaproteobacteria that thrive during the late stages of phytoplankton blooms in the marine environment and in laboratory co-cultures with cyanobacteria such as Trichodesmium. The response of this heterotroph to the sometimes rapid and transient changes in nutrient supply when the phototroph crashes is not well understood. Here, we isolated and sequenced the strain Alteromonas macleodii str. Te101 from a laboratory culture of Trichodesmium erythraeum IMS101, yielding a chromosome of 4.63?Mb and a single plasmid of 237?kb. Increasing salinities to =43 ppt inhibited the growth of Trichodesmium but stimulated growth of the associated Alteromonas. We characterized the transcriptomic responses of both microorganisms and identified the complement of active transcriptional start sites in Alteromonas at single-nucleotide resolution. In replicate cultures, a similar set of genes became activated in Alteromonas when growth rates of Trichodesmium declined and mortality was high. The parallel activation of fliA, rpoS and of flagellar assembly and growth-related genes indicated that Alteromonas might have increased cell motility, growth, and multiple biosynthetic activities. Genes with the highest expression in the data set were three small RNAs (Aln1a-c) that were identified as analogs of the small RNAs CsrB-C in E. coli or RsmX-Z in pathogenic bacteria. Together with the carbon storage protein A (CsrA) homolog Te101_05290, these RNAs likely control the expression of numerous genes in responding to changes in the environment.


September 22, 2019

Autologous cell therapy approach for Duchenne muscular dystrophy using PiggyBac transposons and mesoangioblasts.

Duchenne muscular dystrophy (DMD) is a lethal muscle-wasting disease currently without cure. We investigated the use of the PiggyBac transposon for full-length dystrophin expression in murine mesoangioblast (MABs) progenitor cells. DMD murine MABs were transfected with transposable expression vectors for full-length dystrophin and transplanted intramuscularly or intra-arterially into mdx/SCID mice. Intra-arterial delivery indicated that the MABs could migrate to regenerating muscles to mediate dystrophin expression. Intramuscular transplantation yielded dystrophin expression in 11%-44% of myofibers in murine muscles, which remained stable for the assessed period of 5 months. The satellite cells isolated from transplanted muscles comprised a fraction of MAB-derived cells, indicating that the transfected MABs may colonize the satellite stem cell niche. Transposon integration site mapping by whole-genome sequencing indicated that 70% of the integrations were intergenic, while none was observed in an exon. Muscle resistance assessment by atomic force microscopy indicated that 80% of fibers showed elasticity properties restored to those of wild-type muscles. As measured in vivo, transplanted muscles became more resistant to fatigue. This study thus provides a proof-of-principle that PiggyBac transposon vectors may mediate full-length dystrophin expression as well as functional amelioration of the dystrophic muscles within a potential autologous cell-based therapeutic approach of DMD. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Pathogen-specific binding soluble Down syndrome cell adhesion molecule (Dscam) regulates phagocytosis via membrane-bound Dscam in crab.

The Down syndrome cell adhesion molecule (Dscam) gene is an extraordinary example of diversity that can produce thousands of isoforms and has so far been found only in insects and crustaceans. Cumulative evidence indicates that Dscam may contribute to the mechanistic foundations of specific immune responses in insects. However, the mechanism and functions of Dscam in relation to pathogens and immunity remain largely unknown. In this study, we identified the genome organization and alternative Dscam exons from Chinese mitten crab, Eriocheir sinensis. These variants, designated EsDscam, potentially produce 30,600 isoforms due to three alternatively spliced immunoglobulin (Ig) domains and a transmembrane domain. EsDscam was significantly upregulated after bacterial challenge at both mRNA and protein levels. Moreover, bacterial specific EsDscam isoforms were found to bind specifically with the original bacteria to facilitate efficient clearance. Furthermore, bacteria-specific binding of soluble EsDscam via the complete Ig1-Ig4 domain significantly enhanced elimination of the original bacteria via phagocytosis by hemocytes; this function was abolished by partial Ig1-Ig4 domain truncation. Further studies showed that knockdown of membrane-bound EsDscam inhibited the ability of EsDscam with the same extracellular region to promote bacterial phagocytosis. Immunocytochemistry indicated colocalization of the soluble and membrane-bound forms of EsDscam at the hemocyte surface. Far-Western and coimmunoprecipitation assays demonstrated homotypic interactions between EsDscam isoforms. This study provides insights into a mechanism by which soluble Dscam regulates hemocyte phagocytosis via bacteria-specific binding and specific interactions with membrane-bound Dscam as a phagocytic receptor.


September 22, 2019

Repeated evolution of self-compatibility for reproductive assurance.

Sexual reproduction in eukaryotes requires the fusion of two compatible gametes of opposite sexes or mating types. To meet the challenge of finding a mating partner with compatible gametes, evolutionary mechanisms such as hermaphroditism and self-fertilization have repeatedly evolved. Here, by combining the insights from comparative genomics, computer simulations and experimental evolution in fission yeast, we shed light on the conditions promoting separate mating types or self-compatibility by mating-type switching. Analogous to multiple independent transitions between switchers and non-switchers in natural populations mediated by structural genomic changes, novel switching genotypes readily evolved under selection in the experimental populations. Detailed fitness measurements accompanied by computer simulations show the benefits and costs of switching during sexual and asexual reproduction, governing the occurrence of both strategies in nature. Our findings illuminate the trade-off between the benefits of reproductive assurance and its fitness costs under benign conditions facilitating the evolution of self-compatibility.


September 22, 2019

Comparative genomics of smut pathogens: Insights from orphans and positively selected genes into host specialization.

Host specialization is a key evolutionary process for the diversification and emergence of new pathogens. However, the molecular determinants of host range are poorly understood. Smut fungi are biotrophic pathogens that have distinct and narrow host ranges based on largely unknown genetic determinants. Hence, we aimed to expand comparative genomics analyses of smut fungi by including more species infecting different hosts and to define orphans and positively selected genes to gain further insights into the genetics basis of host specialization. We analyzed nine lineages of smut fungi isolated from eight crop and non-crop hosts: maize, barley, sugarcane, wheat, oats, Zizania latifolia (Manchurian rice), Echinochloa colona (a wild grass), and Persicaria sp. (a wild dicot plant). We assembled two new genomes: Ustilago hordei (strain Uhor01) isolated from oats and U. tritici (strain CBS 119.19) isolated from wheat. The smut genomes were of small sizes, ranging from 18.38 to 24.63 Mb. U. hordei species experienced genome expansions due to the proliferation of transposable elements and the amount of these elements varied among the two strains. Phylogenetic analysis confirmed that Ustilago is not a monophyletic genus and, furthermore, detected misclassification of the U. tritici specimen. The comparison between smut pathogens of crop and non-crop hosts did not reveal distinct signatures, suggesting that host domestication did not play a dominant role in shaping the evolution of smuts. We found that host specialization in smut fungi likely has a complex genetic basis: different functional categories were enriched in orphans and lineage-specific selected genes. The diversification and gain/loss of effector genes are probably the most important determinants of host specificity.


September 22, 2019

A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) ‘Hongyang’ draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models.A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within ‘Hongyang’ The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned ‘Hort16A’ cDNAs and comparing with the predicted protein models for Red5 and both the original ‘Hongyang’ assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised ‘Hongyang’ annotation, respectively, compared with 90.9% to the Red5 models.Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.


September 22, 2019

Comparative genomics of the wheat fungal pathogen Pyrenophora tritici-repentis reveals chromosomal variations and genome plasticity.

Pyrenophora tritici-repentis (Ptr) is a necrotrophic fungal pathogen that causes the major wheat disease, tan spot. We set out to provide essential genomics-based resources in order to better understand the pathogenicity mechanisms of this important pathogen.Here, we present eight new Ptr isolate genomes, assembled and annotated; representing races 1, 2 and 5, and a new race. We report a high quality Ptr reference genome, sequenced by PacBio technology with Illumina paired-end data support and optical mapping. An estimated 98% of the genome coverage was mapped to 10 chromosomal groups, using a two-enzyme hybrid approach. The final reference genome was 40.9 Mb and contained a total of 13,797 annotated genes, supported by transcriptomic and proteogenomics data sets.Whole genome comparative analysis revealed major chromosomal segmental rearrangements and fusions, highlighting intraspecific genome plasticity in this species. Furthermore, the Ptr race classification was not supported at the whole genome level, as phylogenetic analysis did not cluster the ToxA producing isolates. This expansion of available Ptr genomics resources will directly facilitate research aimed at controlling tan spot disease.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.