Menu
September 22, 2019

A high-quality genome sequence of Rosa chinensis to elucidate ornamental traits.

Rose is the world’s most important ornamental plant, with economic, cultural and symbolic value. Roses are cultivated worldwide and sold as garden roses, cut flowers and potted plants. Roses are outbred and can have various ploidy levels. Our objectives were to develop a high-quality reference genome sequence for the genus Rosa by sequencing a doubled haploid, combining long and short reads, and anchoring to a high-density genetic map, and to study the genome structure and genetic basis of major ornamental traits. We produced a doubled haploid rose line (‘HapOB’) from Rosa chinensis ‘Old Blush’ and generated a rose genome assembly anchored to seven pseudo-chromosomes (512?Mb with N50 of 3.4?Mb and 564 contigs). The length of 512?Mb represents 90.1-96.1% of the estimated haploid genome size of rose. Of the assembly, 95% is contained in only 196 contigs. The anchoring was validated using high-density diploid and tetraploid genetic maps. We delineated hallmark chromosomal features, including the pericentromeric regions, through annotation of transposable element families and positioned centromeric repeats using fluorescent in situ hybridization. The rose genome displays extensive synteny with the Fragaria vesca genome, and we delineated only two major rearrangements. Genetic diversity was analysed using resequencing data of seven diploid and one tetraploid Rosa species selected from various sections of the genus. Combining genetic and genomic approaches, we identified potential genetic regulators of key ornamental traits, including prickle density and the number of flower petals. A rose APETALA2/TOE homologue is proposed to be the major regulator of petal number in rose. This reference sequence is an important resource for studying polyploidization, meiosis and developmental processes, as we demonstrated for flower and prickle development. It will also accelerate breeding through the development of molecular markers linked to traits, the identification of the genes underlying them and the exploitation of synteny across Rosaceae.


September 22, 2019

Stendomycin and pantomycin are identical natural products: Preparation of a functionalized bioactive analogue.

The natural products pantomycin and stendomycin were both reported as antimicrobial agents. We demonstrate by gene cluster analysis, LC-MS analysis, and isolation that these polypeptides are identical, and we identify previously unknown congeners. We show that stendomycin can be chemically modified at its electrophilic dehydrobutyrine moiety yielding the first bioactive analogue of this natural product which can undergo additional functionalization. This compound may be a valuable starting point for molecular probe development, and we invite its distribution to the scientific community.


September 22, 2019

High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant.

Salvia splendens Ker-Gawler, scarlet or tropical sage, is a tender herbaceous perennial widely introduced and seen in public gardens all over the world. With few molecular resources, breeding is still restricted to traditional phenotypic selection, and the genetic mechanisms underlying phenotypic variation remain unknown. Hence, a high-quality reference genome will be very valuable for marker-assisted breeding, genome editing, and molecular genetics.We generated 66 Gb and 37 Gb of raw DNA sequences, respectively, from whole-genome sequencing of a largely homozygous scarlet sage inbred line using Pacific Biosciences (PacBio) single-molecule real-time and Illumina HiSeq sequencing platforms. The PacBio de novo assembly yielded a final genome with a scaffold N50 size of 3.12 Mb and a total length of 808 Mb. The repetitive sequences identified accounted for 57.52% of the genome sequence, and ?54,008 protein-coding genes were predicted collectively with ab initio and homology-based gene prediction from the masked genome. The divergence time between S. splendens and Salvia miltiorrhiza was estimated at 28.21 million years ago (Mya). Moreover, 3,797 species-specific genes and 1,187 expanded gene families were identified for the scarlet sage genome.We provide the first genome sequence and gene annotation for the scarlet sage. The availability of these resources will be of great importance for further breeding strategies, genome editing, and comparative genomics among related species.


September 22, 2019

N6-methyladenine DNA modification in the human genome.

DNA N6-methyladenine (6mA) modification is the most prevalent DNA modification in prokaryotes, but whether it exists in human cells and whether it plays a role in human diseases remain enigmatic. Here, we showed that 6mA is extensively present in the human genome, and we cataloged 881,240 6mA sites accounting for ~0.051% of the total adenines. [G/C]AGG[C/T] was the most significantly associated motif with 6mA modification. 6mA sites were enriched in the coding regions and mark actively transcribed genes in human cells. DNA 6mA and N6-demethyladenine modification in the human genome were mediated by methyltransferase N6AMT1 and demethylase ALKBH1, respectively. The abundance of 6mA was significantly lower in cancers, accompanied by decreased N6AMT1 and increased ALKBH1 levels, and downregulation of 6mA modification levels promoted tumorigenesis. Collectively, our results demonstrate that DNA 6mA modification is extensively present in human cells and the decrease of genomic DNA 6mA promotes human tumorigenesis. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

Comparative analysis reveals unexpected genome features of newly isolated Thraustochytrids strains: on ecological function and PUFAs biosynthesis.

Thraustochytrids are unicellular fungal-like marine protists with ubiquitous existence in marine environments. They are well-known for their ability to produce high-valued omega-3 polyunsaturated fatty acids (?-3-PUFAs) (e.g., docosahexaenoic acid (DHA)) and hydrolytic enzymes. Thraustochytrid biomass has been estimated to surpass that of bacterioplankton in both coastal and oceanic waters indicating they have an important role in microbial food-web. Nevertheless, the molecular pathway and regulatory network for PUFAs production and the molecular mechanisms underlying ecological functions of thraustochytrids remain largely unknown.The genomes of two thraustochytrids strains (Mn4 and SW8) with ability to produce DHA were sequenced and assembled with a hybrid sequencing approach utilizing Illumina short paired-end reads and Pacific Biosciences long reads to generate a highly accurate genome assembly. Phylogenomic and comparative genomic analyses found that DHA-producing thraustochytrid strains were highly similar and possessed similar gene content. Analysis of the conventional fatty acid synthesis (FAS) and the polyketide synthase (PKS) systems for PUFAs production only detected incomplete and fragmentary pathways in the genome of these two strains. Surprisingly, secreted carbohydrate active enzymes (CAZymes) were found to be significantly depleted in the genomes of these 2 strains as compared to other sequenced relatives. Furthermore, these two strains possess an expanded gene repertoire for signal transduction and self-propelled movement, which could be important for their adaptations to dynamic marine environments.Our results demonstrate the possibility of a third PUFAs synthesis pathway besides previously described FAS and PKS pathways encoded in the genome of these two thraustochytrid strains. Moreover, lack of a complete set of hydrolytic enzymatic machinery for degrading plant-derived organic materials suggests that these two DHA-producing strains play an important role as a nutritional source rather than a nutrient-producer in marine microbial-food web. Results of this study suggest the existence of two types of saprobic thraustochytrids in the world’s ocean. The first group, which does not produce cellulosic enzymes and live as ‘left-over’ scavenger of bacterioplankton, serves as a dietary source for the plankton of higher trophic levels and the other possesses capacity to live on detrital organic matters in the marine ecosystems.


September 22, 2019

Evidence of non-tandemly repeated rDNAs and their intragenomic heterogeneity in Rhizophagus irregularis

Arbuscular mycorrhizal fungus (AMF) species are some of the most widespread symbionts of land plants. Our much improved reference genome assembly of a model AMF, Rhizophagus irregularis DAOM-181602 (total contigs?=?210), facilitated a discovery of repetitive elements with unusual characteristics. R. irregularis has only ten or 11 copies of complete 45S rDNAs, whereas the general eukaryotic genome has tens to thousands of rDNA copies. R. irregularis rDNAs are highly heterogeneous and lack a tandem repeat structure. These findings provide evidence for the hypothesis that rDNA heterogeneity depends on the lack of tandem repeat structures. RNA-Seq analysis confirmed that all rDNA variants are actively transcribed. Observed rDNA/rRNA polymorphisms may modulate translation by using different ribosomes depending on biotic and abiotic interactions. The non-tandem repeat structure and intragenomic heterogeneity of AMF rDNA/rRNA may facilitate successful adaptation to various environmental conditions, increasing host compatibility of these symbiotic fungi.


September 22, 2019

GC content elevates mutation and recombination rates in the yeast Saccharomyces cerevisiae.

The chromosomes of many eukaryotes have regions of high GC content interspersed with regions of low GC content. In the yeast Saccharomyces cerevisiae, high-GC regions are often associated with high levels of meiotic recombination. In this study, we constructed URA3 genes that differ substantially in their base composition [URA3-AT (31% GC), URA3-WT (43% GC), and URA3-GC (63% GC)] but encode proteins with the same amino acid sequence. The strain with URA3-GC had an approximately sevenfold elevated rate of ura3 mutations compared with the strains with URA3-WT or URA3-AT About half of these mutations were single-base substitutions and were dependent on the error-prone DNA polymerase ?. About 30% were deletions or duplications between short (5-10 base) direct repeats resulting from DNA polymerase slippage. The URA3-GC gene also had elevated rates of meiotic and mitotic recombination relative to the URA3-AT or URA3-WT genes. Thus, base composition has a substantial effect on the basic parameters of genome stability and evolution. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Horizontal transfer of BovB and L1 retrotransposons in eukaryotes.

Transposable elements (TEs) are mobile DNA sequences, colloquially known as jumping genes because of their ability to replicate to new genomic locations. TEs can jump between organisms or species when given a vector of transfer, such as a tick or virus, in a process known as horizontal transfer. Here, we propose that LINE-1 (L1) and Bovine-B (BovB), the two most abundant TE families in mammals, were initially introduced as foreign DNA via ancient horizontal transfer events.Using analyses of 759 plant, fungal and animal genomes, we identify multiple possible L1 horizontal transfer events in eukaryotic species, primarily involving Tx-like L1s in marine eukaryotes. We also extend the BovB paradigm by increasing the number of estimated transfer events compared to previous studies, finding new parasite vectors of transfer such as bed bug, leech and locust, and BovB occurrences in new lineages such as bat and frog. Given that these transposable elements have colonised more than half of the genome sequence in today’s mammals, our results support a role for horizontal transfer in causing long-term genomic change in new host organisms.We describe extensive horizontal transfer of BovB retrotransposons and provide the first evidence that L1 elements can also undergo horizontal transfer. With the advancement of genome sequencing technologies and bioinformatics tools, we anticipate our study to be a valuable resource for inferring horizontal transfer from large-scale genomic data.


September 22, 2019

The genome assembly of the fungal pathogen Pyrenochaeta lycopersici from Single-Molecule Real-Time sequencing sheds new light on its biological complexity.

The first draft genome sequencing of the non-model fungal pathogen Pyrenochaeta lycopersici showed an expansion of gene families associated with heterokaryon incompatibility and lacking of mating-type genes, providing insights into the genetic basis of this “imperfect” fungus which lost the ability to produce the sexual stage. However, due to the Illumina short-read technology, the draft genome was too fragmented to allow a comprehensive characterization of the genome, especially of the repetitive sequence fraction. In this work, the sequencing of another P. lycopersici isolate using long-read Single Molecule Real-Time sequencing technology was performed with the aim of obtaining a gapless genome. Indeed, a gapless genome assembly of 62.7 Mb was obtained, with a fraction of repetitive sequences representing 30% of the total bases. The gene content of the two P. lycopersici isolates was very similar, and the large difference in genome size (about 8 Mb) might be attributable to the high fraction of repetitive sequences detected for the new sequenced isolate. The role of repetitive elements, including transposable elements, in modulating virulence effectors is well established in fungal plant pathogens. Moreover, transposable elements are of fundamental importance in creating and re-modelling genes, especially in imperfect fungi. Their abundance in P. lycopersici, together with the large expansion of heterokaryon incompatibility genes in both sequenced isolates, suggest the presence of possible mechanisms alternative to gene re-assorting mediated by sexual recombination. A quite large fraction (~9%) of repetitive elements in P. lycopersici, has no homology with known classes, strengthening this hypothesis. The availability of a gapless genome of P. lycopersici allowed the in-depth analysis of its genome content, by annotating functional genes and TEs. This goal will be an important resource for shedding light on the evolution of the reproductive and pathogenic behaviour of this soilborne pathogen and the onset of a possible speciation within this species.


September 22, 2019

The Chara genome: Secondary complexity and implications for plant terrestrialization.

Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote. Copyright © 2018 Elsevier Inc. All rights reserved.


September 22, 2019

The complete mitochondrial genome of the Basidiomycete edible fungus Hypsizygus marmoreus

The complete mitochondrial genome of the edible fungus Hypsizygus marmoreus was published in this paper. It was determined using Pacbio and Illumina sequencing. The complete mitochondrial DNA (mtDNA) is 106,417?bp in length with a GC content of 31.74%, which was the fourth large mitogenome in Agaricales. The circular mitogenome encoded 67 protein-coding genes and one ribosomal RNAs (rns). Among these genes, 13 conserved protein-coding genes were determined in the genome, including 6 subunits of NAD dehydrogenase (nad1-4, 4L and 6), three cytochrome oxidases (cox1-3), one apocytochrome b (cob) and three ATP synthases (atp6, apt 8 and apt 9). The phylogenic analysis confirmed that H. marmoreus (Lyophyllaceae) clustered together with Tricholoma matsutake (Tricholomataceae).


September 22, 2019

Nine draft genome sequences of Claviceps purpurea s.lat., including C. arundinis, C. humidiphila, and C. cf. spartinae, pseudomolecules for the pitch canker pathogen Fusarium circinatum, draft genome of Davidsoniella eucalypti, Grosmannia galeiformis, Quambalaria eucalypti, and Teratosphaeria destructans.

This genome announcement includes draft genomes from Claviceps purpurea s.lat., including C. arundinis, C. humidiphila and C. cf. spartinae. The draft genomes of Davidsoniella eucalypti, Quambalaria eucalypti and Teratosphaeria destructans, all three important eucalyptus pathogens, are presented. The insect associate Grosmannia galeiformis is also described. The pine pathogen genome of Fusarium circinatum has been assembled into pseudomolecules, based on additional sequence data and by harnessing the known synteny within the Fusarium fujikuroi species complex. This new assembly of the F. circinatum genome provides 12 pseudomolecules that correspond to the haploid chromosome number of F. circinatum. These are comparable to other chromosomal assemblies within the FFSC and will enable more robust genomic comparisons within this species complex.


September 22, 2019

Fusarium species complex causing Pokkah Boeng in China

Sugarcane is one of the most important crops for sugar production in sugarcane-growing areas. Many biotic and abiotic stresses affected the sugarcane production which leads to severe losses. Pokkah boeng is now playing a very important role due to its economic threats. Currently, the occurrence and rigorousness of pokkah boeng disease have been spread like wildfire from major sugarcane-growing countries. Pokkah boeng is a fungal disease that can cause serious yield losses in susceptible varieties. Infection of the disease is caused either by spores or ascospores. It may cause serious yield losses in commercial plantings. However, there have been many reported outbreaks of the disease which have looked spectacular but have caused trade and industry loss. Fusarium species complex is the major causal agent of this disease around the world, but some researchers have documented the increased importance of Fusarium. Three Fusarium species have been identified to cause the sugarcane pokkah boeng disease in China. Moreover, Fusarium may be accompanied of its mycotoxin production, genomic sequencing, and association with nitrogen application in China. Many studies on disease investigations, breeding of disease-resistant varieties, and strategy of disease control have also been carried out in China.


September 22, 2019

A chromosome scale assembly of the model desiccation tolerant grass Oropetium thomaeum

Oropetium thomaeum is an emerging model for desiccation tolerance and genome size evolution in grasses. A high-quality draft genome of Oropetium was recently sequenced, but the lack of a chromosome scale assembly has hindered comparative analyses and downstream functional genomics. Here, we reassembled Oropetium, and anchored the genome into ten chromosomes using Hi-C based chromatin interactions. A combination of high-resolution RNAseq data and homology-based gene prediction identified thousands of new, conserved gene models that were absent from the V1 assembly. This includes thousands of new genes with high expression across a desiccation timecourse. The sorghum and Oropetium genomes have a surprising degree of chromosome-level collinearity, and several chromosome pairs have near perfect synteny. Other chromosomes are collinear in the gene rich chromosome arms but have experienced pericentric translocations. Together, these resources will be useful for the grass comparative genomic community and further establish Oropetium as a model resurrection plant.


September 22, 2019

Analysis of the draft genome of the red seaweed Gracilariopsis chorda provides insights into genome size evolution in Rhodophyta.

Red algae (Rhodophyta) underwent two phases of large-scale genome reduction during their early evolution. The red seaweeds did not attain genome sizes or gene inventories typical of other multicellular eukaryotes. We generated a high-quality 92.1 Mb draft genome assembly from the red seaweed Gracilariopsis chorda, including methylation and small (s)RNA data. We analyzed these and other Archaeplastida genomes to address three questions: 1) What is the role of repeats and transposable elements (TEs) in explaining Rhodophyta genome size variation, 2) what is the history of genome duplication and gene family expansion/reduction in these taxa, and 3) is there evidence for TE suppression in red algae? We find that the number of predicted genes in red algae is relatively small (4,803-13,125 genes), particularly when compared with land plants, with no evidence of polyploidization. Genome size variation is primarily explained by TE expansion with the red seaweeds having the largest genomes. Long terminal repeat elements and DNA repeats are the major contributors to genome size growth. About 8.3% of the G. chorda genome undergoes cytosine methylation among gene bodies, promoters, and TEs, and 71.5% of TEs contain methylated-DNA with 57% of these regions associated with sRNAs. These latter results suggest a role for TE-associated sRNAs in RNA-dependent DNA methylation to facilitate silencing. We postulate that the evolution of genome size in red algae is the result of the combined action of TE spread and the concomitant emergence of its epigenetic suppression, together with other important factors such as changes in population size.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.