Menu
September 22, 2019

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


September 22, 2019

Nucleotide-binding resistance gene signatures in sugar beet, insights from a new reference genome.

Nucleotide-binding (NB-ARC), leucine-rich-repeat genes (NLRs) account for 60.8% of resistance (R) genes molecularly characterized from plants. NLRs exist as large gene families prone to tandem duplication and transposition, with high sequence diversity among crops and their wild relatives. This diversity can be a source of new disease resistance, but difficulty in distinguishing specific sequences from homologous gene family members hinders characterization of resistance for improving crop varieties. Current genome sequencing and assembly technologies, especially those using long-read sequencing, are improving resolution of repeat-rich genomic regions and clarifying locations of duplicated genes, such as NLRs. Using the conserved NB-ARC domain as a model, 231 tentative NB-ARC loci were identified in a highly contiguous genome assembly of sugar beet, revealing diverged and truncated NB-ARC signatures as well as full-length sequences. The NB-ARC-associated proteins contained NLR resistance gene domains, including TIR, CC, and LRR, as well as other integrated domains. Phylogenetic relationships of partial and complete domains were determined, and patterns of physical clustering in the genome were evaluated. Comparison of sugar beet NB-ARC domains to validated R genes from monocots and eudicots suggested extensive B. vulgaris-specific subfamily expansions. The NLR landscape in the rhizomania resistance conferring Rz region of Chromosome 3 was characterized, identifying 26 NLR-like sequences spanning 20 MB. This work presents the first detailed view of NLR family composition in a member of the Caryophyllales, builds a foundation for additional disease resistance work in B. vulgaris, and demonstrates an additional nucleic-acid-based method for NLR prediction in non-model plant species. This article is protected by copyright. All rights reserved.This article is protected by copyright. All rights reserved.


September 22, 2019

Enrichment of low-density symbiont DNA from minute insects.

Symbioses between bacteria and insects are often associated with changes in important biological traits that can significantly affect host fitness. To a large extent, studies of these interactions have been based on physiological changes or induced phenotypes in the host, and the genetic mechanisms by which symbionts interact with their hosts have only recently become better understood. Learning about symbionts has been challenging in part due to difficulties such as obtaining enough high quality genomic material for high throughput sequencing technology, especially for symbionts present in low titers, and in small or difficult to rear non-model hosts. Here we introduce a new method that substantially increases the yield of bacterial DNA in minute arthropod hosts, and requires less starting material relative to previous published methods. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogen.

Powdery mildews are biotrophic pathogenic fungi infecting a number of economically important plants. The grass powdery mildew, Blumeria graminis, has become a model organism to study host specialization of obligate biotrophic fungal pathogens. We resolved the large-scale genomic architecture of B. graminis forma specialis hordei (Bgh) to explore the potential influence of its genome organization on the co-evolutionary process with its host plant, barley (Hordeum vulgare).The near-chromosome level assemblies of the Bgh reference isolate DH14 and one of the most diversified isolates, RACE1, enabled a comparative analysis of these haploid genomes, which are highly enriched with transposable elements (TEs). We found largely retained genome synteny and gene repertoires, yet detected copy number variation (CNV) of secretion signal peptide-containing protein-coding genes (SPs) and locally disrupted synteny blocks. Genes coding for sequence-related SPs are often locally clustered, but neither the SPs nor the TEs reside preferentially in genomic regions with unique features. Extended comparative analysis with different host-specific B. graminis formae speciales revealed the existence of a core suite of SPs, but also isolate-specific SP sets as well as congruence of SP CNV and phylogenetic relationship. We further detected evidence for a recent, lineage-specific expansion of TEs in the Bgh genome.The characteristics of the Bgh genome (largely retained synteny, CNV of SP genes, recently proliferated TEs and a lack of significant compartmentalization) are consistent with a “one-speed” genome that differs in its architecture and (co-)evolutionary pattern from the “two-speed” genomes reported for several other filamentous phytopathogens.


September 22, 2019

Identification of a leucine-rich repeat receptor-like serine/threonine-protein kinase as a candidate gene for Rvi12 (Vb)-based apple scab resistance

Apple scab caused by Venturia inaequalis is the most important fungal disease of apples (Malus × domestica). Currently, the disease is controlled by up to 15 fungicide applications to the crop per year. Resistant apple cultivars will help promote the sustainable control of scab in commercial orchards. The breakdown of the Rvi6 (Vf) major-gene based resistance, the most used resistance gene in apple breeding, prompted the identification and characterization of new scab resistance genes. By using a large segregating population, the Rvi12 scab resistance gene was previously mapped to a genetic location flanked by molecular markers SNP_23.599 and SNP_24.482. Starting from these markers, utilizing chromosome walking of a Hansen’s baccata #2 (HB2) BAC-library; a single BAC clone spanning the Rvi12 interval was identified. Following Pacific Biosciences (PacBio) RS II sequencing and the use of the hierarchical genome assembly process (HGAP) assembly of the BAC clone sequence, the Rvi12 resistance locus was localized to a 62.3-kb genomic region. Gene prediction and in silico characterization identified a single candidate resistance gene. The gene, named here as Rvi12_Cd5, belongs to the LRR receptor-like serine/threonine-protein kinase family. In silico comparison of the resistance allele from HB2 and the susceptible allele from Golden Delicious (GD) identified the presence of an additional intron in the HB2 allele. Conserved domain analysis identified the presence of four additional LRR motifs in the susceptible allele compared to the resistance allele. The constitutive expression of Rvi12_Cd5 in HB2, together with its structural similarity to known resistance genes, makes it the most likely candidate for Rvi12 scab resistance in apple.


September 22, 2019

The African Bullfrog (Pyxicephalus adspersus) genome unites the two ancestral ingredients for making vertebrate sex chromosomes

Heteromorphic sex chromosomes have evolved repeatedly among vertebrate lineages despite largely deleterious reductions in gene dose. Understanding how this gene dose problem is overcome is hampered by the lack of genomic information at the base of tetrapods and comparisons across the evolutionary history of vertebrates. To address this problem, we produced a chromosome-level genome assembly for the African Bullfrog (Pyxicephalus adspersus)–an amphibian with heteromorphic ZW sex chromosomes–and discovered that the Bullfrog Z is surprisingly homologous to substantial portions of the human X. Using this new reference genome, we identified ancestral synteny among the sex chromosomes of major vertebrate lineages, showing that non-mammalian sex chromosomes are strongly associated with a single vertebrate ancestral chromosome, while mammals are associated with another that displays increased haploinsufficiency. The sex chromosomes of the African Bullfrog however, share genomic blocks with both humans and non-mammalian vertebrates, connecting the two ancestral chromosome sequences that repeatedly characterize vertebrate sex chromosomes. Our results highlight the consistency of sex-linked sequences despite sex determination system lability and reveal the repeated use of two major genomic sequence blocks during vertebrate sex chromosome evolution.


September 22, 2019

Pseudomonas aeruginosa L10: A hydrocarbon-degrading, biosurfactant-producing, and plant-growth-promoting endophytic bacterium isolated from a reed (Phragmites australis).

Bacterial endophytes with the capacity to degrade petroleum hydrocarbons and promote plant growth may facilitate phytoremediation for the removal of petroleum hydrocarbons from contaminated soils. A hydrocarbon-degrading, biosurfactant-producing, and plant-growth-promoting endophytic bacterium, Pseudomonas aeruginosa L10, was isolated from the roots of a reed, Phragmites australis, in the Yellow River Delta, Shandong, China. P. aeruginosa L10 efficiently degraded C10-C26n-alkanes from diesel oil, as well as common polycyclic aromatic hydrocarbons (PAHs) such as naphthalene, phenanthrene, and pyrene. In addition, P. aeruginosa L10 could produce biosurfactant, which was confirmed by the oil spreading method, and surface tension determination of inocula. Moreover, P. aeruginosa L10 had plant growth-stimulating attributes, including siderophore and indole-3-acetic acid (IAA) release, along with 1-aminocyclopropane-1-carboxylic (ACC) deaminase activity. To explore the mechanisms underlying the phenotypic traits of endophytic P. aeruginosa L10, we sequenced its complete genome. From the genome, we identified genes related to petroleum hydrocarbon degradation, such as putative genes encoding monooxygenase, dioxygenase, alcohol dehydrogenase, and aldehyde dehydrogenase. Genome annotation revealed that P. aeruginosa L10 contained a gene cluster involved in the biosynthesis of rhamnolipids, rhlABRI, which should be responsible for the observed biosurfactant activity. We also identified two clusters of genes involved in the biosynthesis of siderophore (pvcABCD and pchABCDREFG). The genome also harbored tryptophan biosynthetic genes (trpAB, trpDC, trpE, trpF, and trpG) that are responsible for IAA synthesis. Moreover, the genome contained the ACC deaminase gene essential for ACC deaminase activity. This study will facilitate applications of endophytic P. aeruginosa L10 to phytoremediation by advancing the understanding of hydrocarbon degradation, biosurfactant synthesis, and mutualistic interactions between endophytes and host plants.


September 22, 2019

Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data.

Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. The new technologies can provide Single Molecular Sequencing (SMS) data that cover about 90% of positions over chromosomes. However, the SMS data has a higher error rate comparing to 1% error rate for short reads. Thus, it becomes very difficult for SNP calling and haplotype assembly using SMS reads. Most existing technologies do not work properly for the SMS data.In this paper, we develop a progressive approach for SNP calling and haplotype assembly that works very well for the SMS data. Our method can handle more than 200 million non-N bases on Chromosome 1 with millions of reads, more than 100 blocks, each of which contains more than 2 million bases and more than 3K SNP sites on average. Experiment results show that the false discovery rate and false negative rate for our method are 15.7 and 11.0% on NA12878, and 16.5 and 11.0% on NA24385. Moreover, the overall switch errors for our method are 7.26 and 5.21 with average 3378 and 5736 SNP sites per block on NA12878 and NA24385, respectively. Here, we demonstrate that SMS reads alone can generate a high quality solution for both SNP calling and haplotype assembly.Source codes and results are available at https://github.com/guofeieileen/SMRT/wiki/Software.


September 22, 2019

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019

The genome of Artemisia annua provides insight into the evolution of Asteraceae family and artemisinin biosynthesis.

Artemisia annua, commonly known as sweet wormwood or Qinghao, is a shrub native to China and has long been used for medicinal purposes. A. annua is now cultivated globally as the only natural source of a potent anti-malarial compound, artemisinin. Here, we report a high-quality draft assembly of the 1.74-gigabase genome of A. annua, which is highly heterozygous, rich in repetitive sequences, and contains 63 226 protein-coding genes, one of the largest numbers among the sequenced plant species. We found that, as one of a few sequenced genomes in the Asteraceae, the A. annua genome contains a large number of genes specific to this large angiosperm clade. Notably, the expansion and functional diversification of genes encoding enzymes involved in terpene biosynthesis are consistent with the evolution of the artemisinin biosynthetic pathway. We further revealed by transcriptome profiling that A. annua has evolved the sophisticated transcriptional regulatory networks underlying artemisinin biosynthesis. Based on comprehensive genomic and transcriptomic analyses we generated transgenic A. annua lines producing high levels of artemisinin, which are now ready for large-scale production and thereby will help meet the challenge of increasing global demand of artemisinin. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Extensive exchange of transposable elements in the Drosophila pseudoobscura group.

As species diverge, so does their transposable element (TE) content. Within a genome, TE families may eventually become dormant due to host-silencing mechanisms, natural selection and the accumulation of inactive copies. The transmission of active copies from a TE families, both vertically and horizontally between species, can allow TEs to escape inactivation if it occurs often enough, as it may allow TEs to temporarily escape silencing in a new host. Thus, the contribution of horizontal exchange to TE persistence has been of increasing interest.Here, we annotated TEs in five species with sequenced genomes from the D. pseudoobscura species group, and curated a set of TE families found in these species. We found that, compared to host genes, many TE families showed lower neutral divergence between species, consistent with recent transmission of TEs between species. Despite these transfers, there are differences in the TE content between species in the group.The TE content is highly dynamic in the D. pseudoobscura species group, frequently transferring between species, keeping TEs active. This result highlights how frequently transposable elements are transmitted between sympatric species and, despite these transfers, how rapidly species TE content can diverge.


September 22, 2019

Genetic diversity of Cryptosporidium hominis in a Bangladeshi community as revealed by whole genome sequencing.

We studied the genetic diversity of Cryptosporidium hominis infections in slum-dwelling infants from Dhaka over a 2-year period. Cryptosporidium hominis infections were common during the monsoon, and were genetically diverse as measured by gp60 genotyping and whole-genome resequencing. Recombination in the parasite was evidenced by the decay of linkage disequilibrium in the genome over <300 bp. Regions of the genome with high levels of polymorphism were also identified. Yet to be determined is if genomic diversity is responsible in part for the high rate of reinfection, seasonality, and varied clinical presentations of cryptosporidiosis in this population.


September 22, 2019

Parallels between experimental and natural evolution of legume symbionts.

The emergence of symbiotic interactions has been studied using population genomics in nature and experimental evolution in the laboratory, but the parallels between these processes remain unknown. Here we compare the emergence of rhizobia after the horizontal transfer of a symbiotic plasmid in natural populations of Cupriavidus taiwanensis, over 10 MY ago, with the experimental evolution of symbiotic Ralstonia solanacearum for a few hundred generations. In spite of major differences in terms of time span, environment, genetic background, and phenotypic achievement, both processes resulted in rapid genetic diversification dominated by purifying selection. We observe no adaptation in the plasmid carrying the genes responsible for the ecological transition. Instead, adaptation was associated with positive selection in a set of genes that led to the co-option of the same quorum-sensing system in both processes. Our results provide evidence for similarities in experimental and natural evolutionary transitions and highlight the potential of comparisons between both processes to understand symbiogenesis.


September 22, 2019

The genome of Rhizophagus clarus HR1 reveals a common genetic basis for auxotrophy among arbuscular mycorrhizal fungi.

Mycorrhizal symbiosis is one of the most fundamental types of mutualistic plant-microbe interaction. Among the many classes of mycorrhizae, the arbuscular mycorrhizae have the most general symbiotic style and the longest history. However, the genomes of arbuscular mycorrhizal (AM) fungi are not well characterized due to difficulties in cultivation and genetic analysis. In this study, we sequenced the genome of the AM fungus Rhizophagus clarus HR1, compared the sequence with the genome sequence of the model species R. irregularis, and checked for missing genes that encode enzymes in metabolic pathways related to their obligate biotrophy.In the genome of R. clarus, we confirmed the absence of cytosolic fatty acid synthase (FAS), whereas all mitochondrial FAS components were present. A KEGG pathway map identified the absence of genes encoding enzymes for several other metabolic pathways in the two AM fungi, including thiamine biosynthesis and the conversion of vitamin B6 derivatives. We also found that a large proportion of the genes encoding glucose-producing polysaccharide hydrolases, that are present even in ectomycorrhizal fungi, also appear to be absent in AM fungi.In this study, we found several new genes that are absent from the genomes of AM fungi in addition to the genes previously identified as missing. Missing genes for enzymes in primary metabolic pathways imply that AM fungi may have a higher dependency on host plants than other biotrophic fungi. These missing metabolic pathways provide a genetic basis to explore the physiological characteristics and auxotrophy of AM fungi.


September 22, 2019

Horizontal transfer and proliferation of Tsu4 in Saccharomyces paradoxus.

Recent evidence suggests that horizontal transfer plays a significant role in the evolution of of transposable elements (TEs) in eukaryotes. Many cases of horizontal TE transfer (HTT) been reported in animals and plants, however surprisingly few examples of HTT have been reported in fungi.Here I report evidence for a novel HTT event in fungi involving Tsu4 in Saccharomyces paradoxus based on (i) unexpectedly high similarity between Tsu4 elements in S. paradoxus and S. uvarum, (ii) a patchy distribution of Tsu4 in S. paradoxus and general absence from its sister species S. cerevisiae, and (iii) discordance between the phylogenetic history of Tsu4 sequences and species in the Saccharomyces sensu stricto group. Available data suggests the HTT event likely occurred somewhere in the Nearctic, Neotropic or Indo-Australian part of the S. paradoxus species range, and that a lineage related to S. uvarum or S. eubayanus was the likely donor species. The HTT event has led to massive proliferation of Tsu4 in the South American lineage of S. paradoxus, which exhibits partial reproductive isolation with other strains of this species because of multiple reciprocal translocations. Full-length Tsu4 elements are associated with both breakpoints of one of these reciprocal translocations.This work shows that comprehensive analysis of TE sequences in essentially-complete genome assemblies derived from long-read sequencing provides new opportunities to detect HTT events in fungi and other organisms. This work also provides support for the hypothesis that HTT and subsequent TE proliferation can induce genome rearrangements that contribute to post-zygotic isolation in yeast.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.