Menu
September 22, 2019

Genome analysis of Taraxacum kok-saghyz Rodin provides new insights into rubber biosynthesis

The Russian dandelion Taraxacum kok-saghyz Rodin (TKS), a member of the Composite family and a potential alternative source of natural rubber (NR) and inulin, is an ideal model system for studying rubber biosynthesis. Here we present the draft genome of TKS, the first assembled NR-producing weed plant. The draft TKS genome assembly has a length of 1.29 Gb, containing 46,731 predicted protein-coding genes and 68.56% repeats, in which the LTR-RT elements predominantly contribute to the genome enlargement. We analyzed the heterozygous regions/genes, suggesting its possible involvement in inbreeding depression. Through comparative studies between rubber-producing and non-rubber-producing plants, we found that enzymes of the mevalonate (MVA) pathway and rubber elongation might be critical for rubber biosynthesis, and several key isoforms have been isolated showing predominantly expressed in the latex, indicating their crucial functions in rubber biosynthesis. Moreover, for two important families in rubber elongation, the CPT/CPTL and REF/SRPP families, diverse evolutionary tracks have been revealed. These results provide valuable resources and new insights into the mechanism of NR biosynthesis, and facilitate the development of alternative NR producing crops.


September 22, 2019

Aberration or analogy? The atypical plastomes of Geraniaceae

A number of plant groups have been proposed as ideal systems to explore plastid inheritance, plastome evolution and plastome-nuclear genome coevolution. Quick generation times and a compact nuclear genome in Arabidopsis thaliana, the relative ease of plastid isolation from Spinacia oleracea and the tractability of plastid transformation in Nicotiana tabacum are all desirable attributes in a model system; however, these and most other groups all lack novelty in terms of plastome structure and nucleotide sequence evolution. Contemporary sequencing and assembly technologies have facilitated analyses of atypical plastomes and, as predicted by early investigations, Geraniaceae plastomes have experienced unprecedented rearrangements relative to the canonical structure and exhibit remarkably high rates of synonymous and nonsynonymous nucleotide substitutions. While not the only lineage with unusual plastome features, likely no other group represents the array of aberrant phenomena recorded for the family. In this chapter, Geraniaceae plastomes will be discussed and, where possible, compared with other taxa.


September 22, 2019

Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young ‘AA’ subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 ‘Miracle Rice’, which relieved famine and drove the Green Revolution in Asia 50 years ago.


September 22, 2019

Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium conductrix SAG 241.80: implications to maltose excretion by a green alga.

Green algae represent a key segment of the global species capable of photoautotrophic-driven biological carbon fixation. Algae partition fixed-carbon into chemical compounds required for biomass, while diverting excess carbon into internal storage compounds such as starch and lipids or, in certain cases, into targeted extracellular compounds. Two green algae were selected to probe for critical components associated with sugar production and release in a model alga. Chlorella sorokiniana UTEX 1602 – which does not release significant quantities of sugars to the extracellular space – was selected as a control to compare with the maltose-releasing Micractinium conductrix SAG 241.80 – which was originally isolated from an endosymbiotic association with the ciliate Paramecium bursaria. Both strains were subjected to three sequencing approaches to assemble their genomes and annotate their genes. This analysis was further complemented with transcriptional studies during maltose release by M. conductrix SAG 241.80 versus conditions where sugar release is minimal. The annotation revealed that both strains contain homologs for the key components of a putative pathway leading to cytosolic maltose accumulation, while transcriptional studies found few changes in mRNA levels for the genes associated with these established intracellular sugar pathways. A further analysis of potential sugar transporters found multiple homologs for SWEETs and tonoplast sugar transporters. The analysis of transcriptional differences revealed a lesser and more measured global response for M. conductrix SAG 241.80 versus C. sorokiniana UTEX 1602 during conditions resulting in sugar release, providing a catalog of genes that might play a role in extracellular sugar transport.© 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.


September 22, 2019

Comparative genomic analysis reveals the evolution and environmental adaptation strategies of vibrios.

Vibrios are among the most diverse and ecologically important marine bacteria, which have evolved many characteristics and lifestyles to occupy various niches. The relationship between genome features and environmental adaptation strategies is an essential part for understanding the ecological functions of vibrios in the marine system. The advent of complete genome sequencing technology has provided an important method of examining the genetic characteristics of vibrios on the genomic level.Two Vibrio genomes were sequenced and found to occupy many unique orthologues families which absent from the previously genes pool of the complete genomes of vibrios. Comparative genomics analysis found vibrios encompass a steady core-genome and tremendous pan-genome with substantial gene gain and horizontal gene transfer events in the evolutionary history. Evolutionary analysis based on the core-genome tree suggested that V. fischeri emerged ~?385 million years ago, along with the occurrence of cephalopods and the flourish of fish. The relatively large genomes, the high number of 16S rRNA gene copies, and the presence of R-M systems and CRISPR system help vibrios live in various marine environments. Chitin-degrading related genes are carried in nearly all the Vibrio genomes. The number of chitinase genes in vibrios has been extremely expanded compared to which in the most recent ancestor of the genus. The chitinase A genes were estimated to have evolved along with the genus, and have undergone significant purifying selective force to conserve the ancestral state.Vibrios have experienced extremely genome expansion events during their evolutionary history, allowing them to develop various functions to spread globally. Despite their close phylogenetic relationships, vibrios were found to have a tremendous pan-genome with a steady core-genome, which indicates the highly plastic genome of the genus. Additionally, the existence of various chitin-degrading related genes and the expansion of chitinase A in the genus demonstrate the importance of the chitin utilization for vibrios. Defensive systems in the Vibrio genomes may protect them from the invasion of external DNA. These genomic features investigated here provide a better knowledge of how the evolutionary process has forged Vibrio genomes to occupy various niches.


September 22, 2019

Bacterial artificial chromosome clones randomly selected for sequencing reveal genomic differences between soybean cultivars

This study pioneered the use of multiple technologies to combine the bacterial artificial chromosome (BAC) pooling strategy with high-throughput next- and third-generation sequencing technologies to analyse genomic difference. To understand the genetic background of the Chinese soybean cultivar N23601, we built a BAC library and sequenced 10 randomly selected clones followed by de novo assembly. Comparative analysis was conducted against the reference genome of Glycine max var. Williams 82 (2.0). Therefore, our result is an assessment of the reference genome. Our results revealed that 3517 single nucleotide polymorphisms (SNPs) and 662 insertion–deletions (InDels) occurred in ~1.2 Mb of the genomic region and that four of the 10 BAC clones contained 15 large structural variations (72?887?bp) compared with the reference genome. Gene annotation of the reference genome showed that Glyma.18g181000 was missing from the corresponding position of the 10 BAC clones. Additionally, there may be a problem with the assembly of some positions of the reference genome. Several gap regions in the reference genome could be supplemented by using the complete sequence of the 10 BAC clones. We believe that accurate and complete BAC sequence is a valuable resource that contributes to the completeness of the reference genome.


September 22, 2019

Assembly and analysis of a qingke reference genome demonstrate its close genetic relation to modern cultivated barley.

Qingke, the local name of hulless barley in the Tibetan Plateau, is a staple food for Tibetans. The availability of its reference genome sequences could be useful for studies on breeding and molecular evolution. Taking advantage of the third-generation sequencer (PacBio), we de novo assembled a 4.84-Gb genome sequence of qingke, cv. Zangqing320 and anchored a 4.59-Gb sequence to seven chromosomes. Of the 46,787 annotated ‘high-confidence’ genes, 31 564 were validated by RNA-sequencing data of 39 wild and cultivated barley genotypes with wide genetic diversity, and the results were also confirmed by nonredundant protein database from NCBI. As some gaps in the reference genome of Morex were covered in the reference genome of Zangqing320 by PacBio reads, we believe that the Zangqing320 genome provides the useful supplements for the Morex genome. Using the qingke genome as a reference, we conducted a genome comparison, revealing a close genetic relationship between a hulled barley (cv. Morex) and a hulless barley (cv. Zangqing320), which is strongly supported by the low-diversity regions in the two genomes. Considering the origin of Morex from its breeding pedigree, we then demonstrated a close genomic relationship between modern cultivated barley and qingke. Given this genomic relationship and the large genetic diversity between qingke and modern cultivated barley, we propose that qingke could provide elite genes for barley improvement.© 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


September 22, 2019

Rapid allopolyploid radiation of moonwort ferns (Botrychium; Ophioglossaceae) revealed by PacBio sequencing of homologous and homeologous nuclear regions.

Polyploidy is a major speciation process in vascular plants, and is postulated to be particularly important in shaping the diversity of extant ferns. However, limitations in the availability of bi-parental markers for ferns have greatly limited phylogenetic investigation of polyploidy in this group. With a large number of allopolyploid species, the genus Botrychium is a classic example in ferns where recurrent polyploidy is postulated to have driven frequent speciation events. Here, we use PacBio sequencing and the PURC bioinformatics pipeline to capture all homeologous or allelic copies of four long (~1?kb) low-copy nuclear regions from a sample of 45 specimens (25 diploids and 20 polyploids) representing 37 Botrychium taxa, and three outgroups. This sample includes most currently recognized Botrychium species in Europe and North America, and the majority of our specimens were genotyped with co-dominant nuclear allozymes to ensure species identification. We analyzed the sequence data using maximum likelihood (ML) and Bayesian inference (BI) concatenated-data (“gene tree”) approaches to explore the relationships among Botrychium species. Finally, we estimated divergence times among Botrychium lineages and inferred the multi-labeled polyploid species tree showing the origins of the polyploid taxa, and their relationships to each other and to their diploid progenitors. We found strong support for the monophyly of the major lineages within Botrychium and identified most of the parental donors of the polyploids; these results largely corroborate earlier morphological and allozyme-based investigations. Each polyploid had at least two distinct homeologs, indicating that all sampled polyploids are likely allopolyploids (rather than autopolyploids). Our divergence-time analyses revealed that these allopolyploid lineages originated recently-within the last two million years-and thus that the genus has undergone a recent radiation, correlated with multiple independent allopolyploidizations across the phylogeny. Also, we found strong parental biases in the formation of allopolyploids, with individual diploid species participating multiple times as either the maternal or paternal donor (but not both). Finally, we discuss the role of polyploidy in the evolutionary history of Botrychium and the interspecific reproductive barriers possibly involved in these parental biases. Copyright © 2017 Elsevier Inc. All rights reserved.


September 22, 2019

The hardy rubber tree genome provides insights into the evolution of polyisoprene biosynthesis.

Eucommia ulmoides, also called hardy rubber tree, is an economically important tree; however, the lack of its genome sequence restricts the fundamental biological research and applied studies of this plant species. Here, we present a high-quality assembly of its ~1.2-Gb genome (scaffold N50 = 1.88 Mb) with at least 26 723 predicted genes for E. ulmoides, the first sequenced genome of the order Garryales, which was obtained using an integrated strategy combining Illumina sequencing, PacBio sequencing, and BioNano mapping. As a sister taxon to lamiids and campanulids, E. ulmoides underwent an ancient genome triplication shared by core eudicots but no further whole-genome duplication in the last ~125 million years. E. ulmoides exhibits high expression levels and/or gene number expansion for multiple genes involved in stress responses and the biosynthesis of secondary metabolites, which may account for its considerable environmental adaptability. In contrast to the rubber tree (Hevea brasiliensis), which produces cis-polyisoprene, E. ulmoides has evolved to synthesize long-chain trans-polyisoprene via farnesyl diphosphate synthases (FPSs). Moreover, FPS and rubber elongation factor/small rubber particle protein gene families were expanded independently from the H. brasiliensis lineage. These results provide new insights into the biology of E. ulmoides and the origin of polyisoprene biosynthesis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

The complete mitochondrial genome of the hermaphroditic freshwater mussel Anodonta cygnea (Bivalvia: Unionidae): in silico analyses of sex-specific ORFs across order Unionoida.

Doubly uniparental inheritance (DUI) of mitochondrial DNA in bivalves is a fascinating exception to strictly maternal inheritance as practiced by all other animals. Recent work on DUI suggests that there may be unique regions of the mitochondrial genomes that play a role in sex determination and/or sexual development in freshwater mussels (order Unionoida). In this study, one complete mitochondrial genome of the hermaphroditic swan mussel, Anodonta cygnea, is sequenced and compared to the complete mitochondrial genome of the gonochoric duck mussel, Anodonta anatina. An in silico assessment of novel proteins found within freshwater bivalve species (known as F-, H-, and M-open reading frames or ORFs) is conducted, with special attention to putative transmembrane domains (TMs), signal peptides (SPs), signal cleavage sites (SCS), subcellular localization, and potential control regions. Characteristics of TMs are also examined across freshwater mussel lineages.In silico analyses suggests the presence of SPs and SCSs and provides some insight into possible function(s) of these novel ORFs. The assessed confidence in these structures and functions was highly variable, possibly due to the novelty of these proteins. The number and topology of putative TMs appear to be maintained among both F- and H-ORFs, however, this is not the case for M-ORFs. There does not appear to be a typical control region in H-type mitochondrial DNA, especially given the loss of tandem repeats in unassigned regions when compared to F-type mtDNA.In silico analyses provides a useful tool to discover patterns in DUI and to navigate further in situ analyses related to DUI in freshwater mussels. In situ analysis will be necessary to further explore the intracellular localizations and possible role of these open reading frames in the process of sex determination in freshwater mussel.


September 22, 2019

Targeted long-read sequencing of a locus under long-term balancing selection in Capsella.

Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10-5 A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach. Copyright © 2018 Bachmann et al.


September 22, 2019

Ginseng Genome Database: an open-access platform for genomics of Panax ginseng.

The ginseng (Panax ginseng C.A. Meyer) is a perennial herbaceous plant that has been used in traditional oriental medicine for thousands of years. Ginsenosides, which have significant pharmacological effects on human health, are the foremost bioactive constituents in this plant. Having realized the importance of this plant to humans, an integrated omics resource becomes indispensable to facilitate genomic research, molecular breeding and pharmacological study of this herb.The first draft genome sequences of P. ginseng cultivar “Chunpoong” were reported recently. Here, using the draft genome, transcriptome, and functional annotation datasets of P. ginseng, we have constructed the Ginseng Genome Database http://ginsengdb.snu.ac.kr /, the first open-access platform to provide comprehensive genomic resources of P. ginseng. The current version of this database provides the most up-to-date draft genome sequence (of approximately 3000 Mbp of scaffold sequences) along with the structural and functional annotations for 59,352 genes and digital expression of genes based on transcriptome data from different tissues, growth stages and treatments. In addition, tools for visualization and the genomic data from various analyses are provided. All data in the database were manually curated and integrated within a user-friendly query page.This database provides valuable resources for a range of research fields related to P. ginseng and other species belonging to the Apiales order as well as for plant research communities in general. Ginseng genome database can be accessed at http://ginsengdb.snu.ac.kr /.


September 22, 2019

Gene presence-absence polymorphism in castrating anther-smut fungi: Recent gene Gains and Phylogeographic Structure.

Gene presence-absence polymorphisms segregating within species are a significant source of genetic variation but have been little investigated to date in natural populations. In plant pathogens, the gain or loss of genes encoding proteins interacting directly with the host, such as secreted proteins, probably plays an important role in coevolution and local adaptation. We investigated gene presence-absence polymorphism in populations of two closely related species of castrating anther-smut fungi, Microbotryum lychnidis-dioicae (MvSl) and M. silenes-dioicae (MvSd), from across Europe, on the basis of Illumina genome sequencing data and high-quality genome references. We observed presence-absence polymorphism for 186 autosomal genes (2% of all genes) in MvSl, and only 51 autosomal genes in MvSd. Distinct genes displayed presence-absence polymorphism in the two species. Genes displaying presence-absence polymorphism were frequently located in subtelomeric and centromeric regions and close to repetitive elements, and comparison with outgroups indicated that most were present in a single species, being recently acquired through duplications in multiple-gene families. Gene presence-absence polymorphism in MvSl showed a phylogeographic structure corresponding to clusters detected based on SNPs. In addition, gene absence alleles were rare within species and skewed toward low-frequency variants. These findings are consistent with a deleterious or neutral effect for most gene presence-absence polymorphism. Some of the observed gene loss and gain events may however be adaptive, as suggested by the putative functions of the corresponding encoded proteins (e.g., secreted proteins) or their localization within previously identified selective sweeps. The adaptive roles in plant and anther-smut fungi interactions of candidate genes however need to be experimentally tested in future studies.


September 22, 2019

The complete chloroplast genome of Chrysanthemum boreale (Asteraceae)

Chrysanthemum boreale is a perennial plant in the Asteraceae family that is native to eastern Asia and has both ornamental and herbal uses. Here, we determined the complete chloroplast genome sequence for C. boreale using long-read sequencing. The chloroplast genome was 151,012?bp and consisted of a large single copy (LSC) region (82,817?bp), a small single copy (SSC) region (18,281?bp) and two inverted repeats (IRs) (24,957?bp). It was predicted to contain 131 genes, including 87 protein-coding genes, eight rRNAs and 46 tRNAs. Phylogenetic analysis of chloroplast genomes clustered C. boreale with other Chrysanthemum and Asteraceae species.


September 22, 2019

DNA N6-adenine methylation in Arabidopsis thaliana.

DNA methylation on N6-adenine (6mA) has recently been found to be a potentially epigenetic mark in several unicellular and multicellular eukaryotes. However, its distribution patterns and potential functions in land plants, which are primary producers for most ecosystems, remain largely unknown. Here we report global profiling of 6mA sites at single-nucleotide resolution in the genome of Arabidopsis thaliana at different developmental stages using single-molecule real-time sequencing. 6mA sites are widely distributed across the Arabidopsis genome and enriched over the pericentromeric heterochromatin regions. 6mA occurs more frequently in gene bodies than intergenic regions. Analysis of 6mA methylomes and RNA sequencing data demonstrates that 6mA frequency positively correlates with the gene expression level and the transition from vegetative to reproductive growth in Arabidopsis. Our results uncover 6mA as a DNA mark associated with actively expressed genes in Arabidopsis, suggesting that 6mA serves as a hitherto unknown epigenetic mark in land plants. Copyright © 2018 Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.