Menu
September 22, 2019

The N6-adenine methylation in yeast genome profiled by single-molecule technology.

The most common and abundant DNA modification is 5-meth- ylcytosine (5mC), which has been well-established as an epigenetic mark regulating gene expression in eukaryotes (Jones, 2012). Another DNA modification N6-methyldeoxyadenosine (6mA), pre- viously reported as a widespread DNA methylation in prokaryotes, plays an important role in gene expression, DNA replication, DNA repair, cell cycle progression and host-pathogen interaction (Messer and Noyer-Weidner, 1988; Lu et al., 1994; Collier et al., 2007). The knowledge of 6mA in eukaryotes has been very limited until the recent development of high-throughput sequencing and high-sensitive mass spectrometry technologies, which have greatly contributed to the investigation of 6mA in fungi, animals and plants (Fu et al., 2015; Greer et al., 2015; Zhang et al., 2015; Koziol et al., 2016; Liu et al., 2016; Wu et al., 2016; Liang et al., 2017; Mondo et al., 2017). Recent studies revealed that 6mA abundance is vari- able, and it is relative higher in Chlamydomonas and early- diverging fungi species than other eukaryotes. The distribution pat- terns of 6mA and their functions are not quite conserved among or- ganisms. 6mA was found enriched near the transcription start sites (TSS) in Chlamydomonas (Fu et al., 2015) and at the repeats in Drosophila, Mus musculus and Danio rerio (Zhang et al., 2015; Liu et al., 2016; Wu et al., 2016), and commonly depleted from gene exons in Xenopus laevis and M. musculus (Koziol et al., 2016). In several species, 6mA was associated with transcriptionally active genes (Fu et al., 2015; Mondo et al., 2017), and it was also found correlated with gene silencing in mammalian embryonic stem cells (Wu et al., 2016).


September 22, 2019

The Phytophthora cactorum genome provides insights into the adaptation to host defense compounds and fungicides.

Phytophthora cactorum is a homothallic oomycete pathogen, which has a wide host range and high capability to adapt to host defense compounds and fungicides. Here we report the 121.5?Mb genome assembly of the P. cactorum using the third-generation single-molecule real-time (SMRT) sequencing technology. It is the second largest genome sequenced so far in the Phytophthora genera, which contains 27,981 protein-coding genes. Comparison with other Phytophthora genomes showed that P. cactorum had a closer relationship with P. parasitica, P. infestans and P. capsici. P. cactorum has similar gene families in the secondary metabolism and pathogenicity-related effector proteins compared with other oomycete species, but specific gene families associated with detoxification enzymes and carbohydrate-active enzymes (CAZymes) underwent expansion in P. cactorum. P. cactorum had a higher utilization and detoxification ability against ginsenosides-a group of defense compounds from Panax notoginseng-compared with the narrow host pathogen P. sojae. The elevated expression levels of detoxification enzymes and hydrolase activity-associated genes after exposure to ginsenosides further supported that the high detoxification and utilization ability of P. cactorum play a crucial role in the rapid adaptability of the pathogen to host plant defense compounds and fungicides.


September 22, 2019

Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers for genetic studies in tea plant (Camellia sinensis)

The tea plant (Camellia sinensis (L.) O. Kuntze) is one of the most popular non-alcoholic beverage crops worldwide. The availability of complete genome sequences for the Camellia sinensis var. ‘Shuchazao’ has provided the opportunity to identify all types of simple sequence repeat (SSR) markers by genome-wide scan. In this study, a total of 667,980 SSRs were identified in the ~?3.08 Gb genome, with an overall density of 216.88 SSRs/Mb. Dinucleotide repeats were predominant among microsatellites (72.25%), followed by trinucleotide repeats (15.35%), while the remaining SSRs accounted for less than 13%. The motif AG/CT (49.96%) and AT/TA (40.14%) were the most and the second most abundant among all identified SSR motifs, respectively; meanwhile, AAT/ATT (41.29%) and AAAT/ATTT (67.47%) were the most common among trinucleotides and tetranucleotides, respectively. A total of 300 primer pairs were designed to screen six tea cultivars for polymorphisms of SSR markers using the five selected repeat types of microsatellite sequences. The resulting 96 SSR markers that yielded polymorphic and unambiguous bands were further deployed on 47 tea cultivars for genetic diversity assessment, demonstrating high polymorphism of these SSR markers. Remarkably, the dendrogram revealed that the phylogenetic relationships among these tea cultivars are highly consistent with their genetic backgrounds or places of origin. The identified genome-wide SSRs and newly developed SSR markers will provide a powerful means for genetic researches in tea plant, including genetic diversity and evolutionary origin analysis, fingerprinting, QTL mapping, and marker-assisted selection for breeding.


September 22, 2019

DNA N6-adenine methylation in Arabidopsis thaliana.

DNA methylation on N6-adenine (6mA) has recently been found to be a potentially epigenetic mark in several unicellular and multicellular eukaryotes. However, its distribution patterns and potential functions in land plants, which are primary producers for most ecosystems, remain largely unknown. Here we report global profiling of 6mA sites at single-nucleotide resolution in the genome of Arabidopsis thaliana at different developmental stages using single-molecule real-time sequencing. 6mA sites are widely distributed across the Arabidopsis genome and enriched over the pericentromeric heterochromatin regions. 6mA occurs more frequently in gene bodies than intergenic regions. Analysis of 6mA methylomes and RNA sequencing data demonstrates that 6mA frequency positively correlates with the gene expression level and the transition from vegetative to reproductive growth in Arabidopsis. Our results uncover 6mA as a DNA mark associated with actively expressed genes in Arabidopsis, suggesting that 6mA serves as a hitherto unknown epigenetic mark in land plants. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality.

Tea, one of the world’s most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ~0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ~30 to 40 and ~90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

De novo genome assembly of the red silk cotton tree (Bombax ceiba).

Bombax ceiba L. (the red silk cotton tree) is a large deciduous tree that is distributed in tropical and sub-tropical Asia as well as northern Australia. It has great economic and ecological importance, with several applications in industry and traditional medicine in many Asian countries. To facilitate further utilization of this plant resource, we present here the draft genome sequence for B. ceiba.We assembled a relatively intact genome of B. ceiba by using PacBio single-molecule sequencing and BioNano optical mapping technologies. The final draft genome is approximately 895 Mb long, with contig and scaffold N50 sizes of 1.0 Mb and 2.06 Mb, respectively.The high-quality draft genome assembly of B. ceiba will be a valuable resource enabling further genetic improvement and more effective use of this tree species.


September 22, 2019

Phenotypic diversification by enhanced genome restructuring after induction of multiple DNA double-strand breaks.

DNA double-strand break (DSB)-mediated genome rearrangements are assumed to provide diverse raw genetic materials enabling accelerated adaptive evolution; however, it remains unclear about the consequences of massive simultaneous DSB formation in cells and their resulting phenotypic impact. Here, we establish an artificial genome-restructuring technology by conditionally introducing multiple genomic DSBs in vivo using a temperature-dependent endonuclease TaqI. Application in yeast and Arabidopsis thaliana generates strains with phenotypes, including improved ethanol production from xylose at higher temperature and increased plant biomass, that are stably inherited to offspring after multiple passages. High-throughput genome resequencing revealed that these strains harbor diverse rearrangements, including copy number variations, translocations in retrotransposons, and direct end-joinings at TaqI-cleavage sites. Furthermore, large-scale rearrangements occur frequently in diploid yeasts (28.1%) and tetraploid plants (46.3%), whereas haploid yeasts and diploid plants undergo minimal rearrangement. This genome-restructuring system (TAQing system) will enable rapid genome breeding and aid genome-evolution studies.


September 22, 2019

The evolution of genomic and epigenomic features in two Pleurotus fungi.

Pleurotus tuoliensis (Bailinggu, designated Pt) and P. eryngii var. eryngii (Xingbaogu, designated Pe) are highly valued edible mushrooms. We report de novo assemblies of high-quality genomes for both mushrooms based on PacBio RS II sequencing and annotation of all identified genes. A comparative genomics analysis between Pt and Pe with P. ostreatus as an outgroup taxon revealed extensive genomic divergence between the two mushroom genomes primarily due to the rapid gain of taxon-specific genes and disruption of synteny in either taxon. The re-appraised phylogenetic relationship between Pt and Pe at the genome-wide level validates earlier proposals to designate Pt as an independent species. Variation of the identified wood-decay-related gene content can largely explain the variable adaptation and host specificity of the two mushrooms. On the basis of the two assembled genome sequences, methylomes and the regulatory roles of DNA methylation in gene expression were characterized and compared. The genome, methylome and transcriptome data of these two important mushrooms will provide valuable information for advancing our understanding of the evolution of Pleurotus and related genera and for facilitating genome- and epigenome-based strategies for mushroom breeding.


September 22, 2019

Genomic analyses of unique carbohydrate and phytohormone metabolism in the macroalga Gracilariopsis lemaneiformis (Rhodophyta).

Red algae are economically valuable for food and in industry. However, their genomic information is limited, and the genomic data of only a few species of red algae have been sequenced and deposited recently. In this study, we annotated a draft genome of the macroalga Gracilariopsis lemaneiformis (Gracilariales, Rhodophyta).The entire 88.98 Mb genome of Gp. lemaneiformis 981 was generated from 13,825 scaffolds (=500 bp) with an N50 length of 30,590 bp, accounting for approximately 91% of this algal genome. A total of 38.73 Mb of scaffold sequences were repetitive, and 9281 protein-coding genes were predicted. A phylogenomic analysis of 20 genomes revealed the relationship among the Chromalveolata, Rhodophyta, Chlorophyta and higher plants. Homology analysis indicated phylogenetic proximity between Gp. lemaneiformis and Chondrus crispus. The number of enzymes related to the metabolism of carbohydrates, including agar, glycoside hydrolases, glycosyltransferases, was abundant. In addition, signaling pathways associated with phytohormones such as auxin, salicylic acid and jasmonates are reported for the first time for this alga.We sequenced and analyzed a draft genome of the red alga Gp. lemaneiformis, and revealed its carbohydrate metabolism and phytohormone signaling characteristics. This work will be helpful in research on the functional and comparative genomics of the order Gracilariales and will enrich the genomic information on marine algae.


September 22, 2019

Genome-wide analysis of the NAC transcription factor family and their expression during the development and ripening of the Fragaria × ananassa fruits.

NAC proteins are a family of transcription factors which have a variety of important regulatory roles in plants. They present a very well conserved group of NAC subdomains in the N-terminal region and a highly variable domain at the C-terminus. Currently, knowledge concerning NAC family in the strawberry plant remains very limited. In this work, we analyzed the NAC family of Fragaria vesca, and a total of 112 NAC proteins were identified after we curated the annotations from the version 4.0.a1 genome. They were placed into the ligation groups (pseudo-chromosomes) and described its physicochemical and genetic features. A microarray transcriptomic analysis showed six of them expressed during the development and ripening of the Fragaria x ananassa fruit. Their expression patterns were studied in fruit (receptacle and achenes) in different stages of development and in vegetative tissues. Also, the expression level under different hormonal treatments (auxins, ABA) and drought stress was investigated. In addition, they were clustered with other NAC transcription factor with known function related to growth and development, senescence, fruit ripening, stress response, and secondary cell wall and vascular development. Our results indicate that these six strawberry NAC proteins could play different important regulatory roles in the process of development and ripening of the fruit, providing the basis for further functional studies and the selection for NAC candidates suitable for biotechnological applications.


September 22, 2019

Inpactor, integrated and parallel analyzer and classifier of LTR retrotransposons and its application for pineapple LTR retrotransposons diversity and dynamics.

One particular class of Transposable Elements (TEs), called Long Terminal Repeats (LTRs), retrotransposons, comprises the most abundant mobile elements in plant genomes. Their copy number can vary from several hundreds to up to a few million copies per genome, deeply affecting genome organization and function. The detailed classification of LTR retrotransposons is an essential step to precisely understand their effect at the genome level, but remains challenging in large-sized genomes, requiring the use of optimized bioinformatics tools that can take advantage of supercomputers. Here, we propose a new tool: Inpactor, a parallel and scalable pipeline designed to classify LTR retrotransposons, to identify autonomous and non-autonomous elements, to perform RT-based phylogenetic trees and to analyze their insertion times using High Performance Computing (HPC) techniques. Inpactor was tested on the classification and annotation of LTR retrotransposons in pineapple, a recently-sequenced genome. The pineapple genome assembly comprises 44% of transposable elements, of which 23% were classified as LTR retrotransposons. Exceptionally, 16.4% of the pineapple genome assembly corresponded to only one lineage of the Gypsy superfamily: Del, suggesting that this particular lineage has undergone a significant increase in its copy numbers. As demonstrated for the pineapple genome, Inpactor provides comprehensive data of LTR retrotransposons’ classification and dynamics, allowing a fine understanding of their contribution to genome structure and evolution. Inpactor is available at https://github.com/simonorozcoarias/Inpactor.


September 22, 2019

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


September 22, 2019

Nucleotide-binding resistance gene signatures in sugar beet, insights from a new reference genome.

Nucleotide-binding (NB-ARC), leucine-rich-repeat genes (NLRs) account for 60.8% of resistance (R) genes molecularly characterized from plants. NLRs exist as large gene families prone to tandem duplication and transposition, with high sequence diversity among crops and their wild relatives. This diversity can be a source of new disease resistance, but difficulty in distinguishing specific sequences from homologous gene family members hinders characterization of resistance for improving crop varieties. Current genome sequencing and assembly technologies, especially those using long-read sequencing, are improving resolution of repeat-rich genomic regions and clarifying locations of duplicated genes, such as NLRs. Using the conserved NB-ARC domain as a model, 231 tentative NB-ARC loci were identified in a highly contiguous genome assembly of sugar beet, revealing diverged and truncated NB-ARC signatures as well as full-length sequences. The NB-ARC-associated proteins contained NLR resistance gene domains, including TIR, CC, and LRR, as well as other integrated domains. Phylogenetic relationships of partial and complete domains were determined, and patterns of physical clustering in the genome were evaluated. Comparison of sugar beet NB-ARC domains to validated R genes from monocots and eudicots suggested extensive B. vulgaris-specific subfamily expansions. The NLR landscape in the rhizomania resistance conferring Rz region of Chromosome 3 was characterized, identifying 26 NLR-like sequences spanning 20 MB. This work presents the first detailed view of NLR family composition in a member of the Caryophyllales, builds a foundation for additional disease resistance work in B. vulgaris, and demonstrates an additional nucleic-acid-based method for NLR prediction in non-model plant species. This article is protected by copyright. All rights reserved.This article is protected by copyright. All rights reserved.


September 22, 2019

Identification of a leucine-rich repeat receptor-like serine/threonine-protein kinase as a candidate gene for Rvi12 (Vb)-based apple scab resistance

Apple scab caused by Venturia inaequalis is the most important fungal disease of apples (Malus × domestica). Currently, the disease is controlled by up to 15 fungicide applications to the crop per year. Resistant apple cultivars will help promote the sustainable control of scab in commercial orchards. The breakdown of the Rvi6 (Vf) major-gene based resistance, the most used resistance gene in apple breeding, prompted the identification and characterization of new scab resistance genes. By using a large segregating population, the Rvi12 scab resistance gene was previously mapped to a genetic location flanked by molecular markers SNP_23.599 and SNP_24.482. Starting from these markers, utilizing chromosome walking of a Hansen’s baccata #2 (HB2) BAC-library; a single BAC clone spanning the Rvi12 interval was identified. Following Pacific Biosciences (PacBio) RS II sequencing and the use of the hierarchical genome assembly process (HGAP) assembly of the BAC clone sequence, the Rvi12 resistance locus was localized to a 62.3-kb genomic region. Gene prediction and in silico characterization identified a single candidate resistance gene. The gene, named here as Rvi12_Cd5, belongs to the LRR receptor-like serine/threonine-protein kinase family. In silico comparison of the resistance allele from HB2 and the susceptible allele from Golden Delicious (GD) identified the presence of an additional intron in the HB2 allele. Conserved domain analysis identified the presence of four additional LRR motifs in the susceptible allele compared to the resistance allele. The constitutive expression of Rvi12_Cd5 in HB2, together with its structural similarity to known resistance genes, makes it the most likely candidate for Rvi12 scab resistance in apple.


September 22, 2019

Amplification and adaptation of centromeric repeats in polyploid switchgrass species.

Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats from a single satellite repeat family. Why centromeres are dominated by a single satellite repeat and how the satellite repeats originate and evolve are among the most intriguing and long-standing questions in centromere biology. We identified eight satellite repeats in the centromeres of tetraploid switchgrass (Panicum virgatum). Seven repeats showed characteristics associated with classical centromeric repeats with monomeric lengths ranging from 166 to 187 bp. Interestingly, these repeats share an 80-bp DNA motif. We demonstrate that this 80-bp motif may dictate translational and rotational phasing of the centromeric repeats with the cenH3 nucleosomes. The sequence of the last centromeric repeat, Pv156, is identical to the 5S ribosomal RNA genes. We demonstrate that a 5S ribosomal RNA gene array was recruited to be the functional centromere for one of the switchgrass chromosomes. Our findings reveal that certain types of satellite repeats, which are associated with unique sequence features and are composed of monomers in mono-nucleosomal length, are favorable for centromeres. Centromeric repeats may undergo dynamic amplification and adaptation before the centromeres in the same species become dominated by the best adapted satellite repeat.© 2018 The Authors. New Phytologist © 2018 New Phytologist Trust.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.