Menu
September 22, 2019

Sequencing of Panax notoginseng genome reveals genes involved in disease resistance and ginsenoside biosynthesis

Background: Panax notoginseng is a traditional Chinese herb with high medicinal and economic value. There has been considerable research on the pharmacological activities of ginsenosides contained in Panax spp.; however, very little is known about the ginsenoside biosynthetic pathway. Results: We reported the first de novo genome of 2.36 Gb of sequences from P. notoginseng with 35,451 protein-encoding genes. Compared to other plants, we found notable gene family contraction of disease-resistance genes in P. notoginseng, but notable expansion for several ATP-binding cassette (ABC) transporter subfamilies, such as the Gpdr subfamily, indicating that ABCs might be an additional mechanism for the plant to cope with biotic stress. Combining eight transcriptomes of roots and aerial parts, we identified several key genes, their transcription factor binding sites and all their family members involved in the synthesis pathway of ginsenosides in P. notoginseng, including dammarenediol synthase, CYP716 and UGT71. Conclusions: The complete genome analysis of P. notoginseng, the first in genus Panax, will serve as an important reference sequence for improving breeding and cultivation of this important nutraceutical and medicinal but vulnerable plant species.


September 22, 2019

Fusarium species complex causing Pokkah Boeng in China

Sugarcane is one of the most important crops for sugar production in sugarcane-growing areas. Many biotic and abiotic stresses affected the sugarcane production which leads to severe losses. Pokkah boeng is now playing a very important role due to its economic threats. Currently, the occurrence and rigorousness of pokkah boeng disease have been spread like wildfire from major sugarcane-growing countries. Pokkah boeng is a fungal disease that can cause serious yield losses in susceptible varieties. Infection of the disease is caused either by spores or ascospores. It may cause serious yield losses in commercial plantings. However, there have been many reported outbreaks of the disease which have looked spectacular but have caused trade and industry loss. Fusarium species complex is the major causal agent of this disease around the world, but some researchers have documented the increased importance of Fusarium. Three Fusarium species have been identified to cause the sugarcane pokkah boeng disease in China. Moreover, Fusarium may be accompanied of its mycotoxin production, genomic sequencing, and association with nitrogen application in China. Many studies on disease investigations, breeding of disease-resistant varieties, and strategy of disease control have also been carried out in China.


September 22, 2019

A chromosome scale assembly of the model desiccation tolerant grass Oropetium thomaeum

Oropetium thomaeum is an emerging model for desiccation tolerance and genome size evolution in grasses. A high-quality draft genome of Oropetium was recently sequenced, but the lack of a chromosome scale assembly has hindered comparative analyses and downstream functional genomics. Here, we reassembled Oropetium, and anchored the genome into ten chromosomes using Hi-C based chromatin interactions. A combination of high-resolution RNAseq data and homology-based gene prediction identified thousands of new, conserved gene models that were absent from the V1 assembly. This includes thousands of new genes with high expression across a desiccation timecourse. The sorghum and Oropetium genomes have a surprising degree of chromosome-level collinearity, and several chromosome pairs have near perfect synteny. Other chromosomes are collinear in the gene rich chromosome arms but have experienced pericentric translocations. Together, these resources will be useful for the grass comparative genomic community and further establish Oropetium as a model resurrection plant.


September 22, 2019

Identification of the DNA methyltransferases establishing the methylome of the cyanobacterium Synechocystis sp. PCC 6803.

DNA methylation in bacteria is important for defense against foreign DNA, but is also involved in DNA repair, replication, chromosome partitioning, and regulatory processes. Thus, characterization of the underlying DNA methyltransferases in genetically tractable bacteria is of paramount importance. Here, we characterized the methylome and orphan methyltransferases in the model cyanobacterium Synechocystis sp. PCC 6803. Single molecule real-time (SMRT) sequencing revealed four DNA methylation recognition sequences in addition to the previously known motif m5CGATCG, which is recognized by M.Ssp6803I. For three of the new recognition sequences, we identified the responsible methyltransferases. M.Ssp6803II, encoded by the sll0729 gene, modifies GGm4CC, M.Ssp6803III, encoded by slr1803, represents the cyanobacterial dam-like methyltransferase modifying Gm6ATC, and M.Ssp6803V, encoded by slr6095 on plasmid pSYSX, transfers methyl groups to the bipartite motif GGm6AN7TTGG/CCAm6AN7TCC. The remaining methylation recognition sequence GAm6AGGC is probably recognized by methyltransferase M.Ssp6803IV encoded by slr6050. M.Ssp6803III and M.Ssp6803IV were essential for the viability of Synechocystis, while the strains lacking M.Ssp6803I and M.Ssp6803V showed growth similar to the wild type. In contrast, growth was strongly diminished of the ?sll0729 mutant lacking M.Ssp6803II. These data provide the basis for systematic studies on the molecular mechanisms impacted by these methyltransferases.


September 22, 2019

Analysis of the Gli-D2 locus identifies a genetic target for simultaneously improving the breadmaking and health-related traits of common wheat.

Gliadins are a major component of wheat seed proteins. However, the complex homoeologous Gli-2 loci (Gli-A2, -B2 and -D2) that encode the a-gliadins in commercial wheat are still poorly understood. Here we analyzed the Gli-D2 locus of Xiaoyan 81 (Xy81), a winter wheat cultivar. A total of 421.091 kb of the Gli-D2 sequence was assembled from sequencing multiple bacterial artificial clones, and 10 a-gliadin genes were annotated. Comparative genomic analysis showed that Xy81 carried only eight of the a-gliadin genes of the D genome donor Aegilops tauschii, with two of them each experiencing a tandem duplication. A mutant line lacking Gli-D2 (DLGliD2) consistently exhibited better breadmaking quality and dough functionalities than its progenitor Xy81, but without penalties in other agronomic traits. It also had an elevated lysine content in the grains. Transcriptome analysis verified the lack of Gli-D2 a-gliadin gene expression in DLGliD2. Furthermore, the transcript and protein levels of protein disulfide isomerase were both upregulated in DLGliD2 grains. Consistent with this finding, DLGliD2 had increased disulfide content in the flour. Our work sheds light on the structure and function of Gli-D2 in commercial wheat, and suggests that the removal of Gli-D2 and the gliadins specified by it is likely to be useful for simultaneously enhancing the end-use and health-related traits of common wheat. Because gliadins and homologous proteins are widely present in grass species, the strategy and information reported here may be broadly useful for improving the quality traits of diverse cereal crops.© 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


September 22, 2019

Complete sequence of kenaf (Hibiscus cannabinus) mitochondrial genome and comparative analysis with the mitochondrial genomes of other plants.

Plant mitochondrial (mt) genomes are species specific due to the vast of foreign DNA migration and frequent recombination of repeated sequences. Sequencing of the mt genome of kenaf (Hibiscus cannabinus) is essential for elucidating its evolutionary characteristics. In the present study, single-molecule real-time sequencing technology (SMRT) was used to sequence the complete mt genome of kenaf. Results showed that the complete kenaf mt genome was 569,915?bp long and consisted of 62 genes, including 36 protein-coding, 3 rRNA and 23 tRNA genes. Twenty-five introns were found among nine of the 36 protein-coding genes, and five introns were trans-spliced. A comparative analysis with other plant mt genomes showed that four syntenic gene clusters were conserved in all plant mtDNAs. Fifteen chloroplast-derived fragments were strongly associated with mt genes, including the intact sequences of the chloroplast genes psaA, ndhB and rps7. According to the plant mt genome evolution analysis, some ribosomal protein genes and succinate dehydrogenase genes were frequently lost during the evolution of angiosperms. Our data suggest that the kenaf mt genome retained evolutionarily conserved characteristics. Overall, the complete sequencing of the kenaf mt genome provides additional information and enhances our better understanding of mt genomic evolution across angiosperms.


September 22, 2019

Optical and physical mapping with local finishing enables megabase-scale resolution of agronomically important regions in the wheat genome.

Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome.Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region.Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.


September 22, 2019

Protocol: a versatile, inexpensive, high-throughput plant genomic DNA extraction method suitable for genotyping-by-sequencing.

The recent development of next-generation sequencing DNA marker technologies, such as genotyping-by-sequencing (GBS), generates thousands of informative single nucleotide polymorphism markers in almost any species, regardless of genomic resources. This enables poorly resourced or “orphan” crops/species access to high-density, high-throughput marker platforms which have revolutionised population genetics studies and plant breeding. DNA quality underpins success of GBS methods as the DNA must be amenable to restriction enzyme digestion and sequencing. A barrier to implementing GBS technologies is access to inexpensive, high-throughput extraction methods that yield sequencing-quality genomic DNA (gDNA) from plants. Several high-throughput DNA extraction methods are available, but typically provide low yield or poor quality gDNA, or are costly (US$6-$9/sample) for consumables.We modified a non-organic solvent protocol to extract microgram quantities (1-13 µg) of sequencing-quality high molecular weight gDNA inexpensively in 96-well plates from either fresh, freeze-dried or silica gel-dried plant tissue. The protocol was effective for several easy and difficult-to-extract forage, crop, horticultural and common model species including Trifolium, Medicago, Lolium, Secale, Festuca, Malus, Oryza, and Arabidopsis. The extracted DNA was of high molecular weight and digested readily with restriction enzymes. Contrasting with other extraction protocols we assessed, Illumina-based sequencing of GBS libraries developed from this gDNA had very uniform high quality base-calls to the end of sequence reads. Furthermore, DNA extracted using this method has been sequenced successfully with the PacBio long-read platform. The protocol is scalable, readily automated without requirement for fume hoods, requires approximately three hours to process 192 samples (384-576 samples/day), and is inexpensive at US$0.62/sample for consumables.This versatile, scalable and simple protocol yields high molecular weight genomic DNA suitable for restriction enzyme digestion and next-generation sequencing applications including GBS and long-read sequencing platforms such as PacBio. The low cost, high-throughput, and extraction of high quality gDNA from a range of fresh and dried source plant material makes this method suitable for many sequencing and genotyping applications including large-scale sample screening underpinning breeding programmes.


September 22, 2019

The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination.

Mitochondrial genomes of flowering plants (angiosperms) are highly dynamic in genome structure. The mitogenome of the earliest angiosperm Amborella is remarkable in carrying rampant foreign DNAs, in contrast to Liriodendron, the other only known early angiosperm mitogenome that is described as ‘fossilized’. The distinctive features observed in the two early flowering plant mitogenomes add to the current confusions of what early flowering plants look like. Expanded sampling would provide more details in understanding the mitogenomic evolution of early angiosperms. Here we report the complete mitochondrial genome of water lily Nymphaea colorata from Nymphaeales, one of the three orders of the earliest angiosperms.Assembly of data from Pac-Bio long-read sequencing yielded a circular mitochondria chromosome of 617,195 bp with an average depth of 601×. The genome encoded 41 protein coding genes, 20 tRNA and three rRNA genes with 25 group II introns disrupting 10 protein coding genes. Nearly half of the genome is composed of repeated sequences, which contributed substantially to the intron size expansion, making the gross intron length of the Nymphaea mitochondrial genome one of the longest among angiosperms, including an 11.4-Kb intron in cox2, which is the longest organellar intron reported to date in plants. Nevertheless, repeat mediated homologous recombination is unexpectedly low in Nymphaea evidenced by 74 recombined reads detected from ten recombinationally active repeat pairs among 886,982 repeat pairs examined. Extensive gene order changes were detected in the three early angiosperm mitogenomes, i.e. 38 or 44 events of inversions and translocations are needed to reconcile the mitogenome of Nymphaea with Amborella or Liriodendron, respectively. In contrast to Amborella with six genome equivalents of foreign mitochondrial DNA, not a single horizontal gene transfer event was observed in the Nymphaea mitogenome.The Nymphaea mitogenome resembles the other available early angiosperm mitogenomes by a similarly rich 64-coding gene set, and many conserved gene clusters, whereas stands out by its highly repetitive nature and resultant remarkable intron expansions. The low recombination level in Nymphaea provides evidence for the predominant master conformation in vivo with a highly substoichiometric set of rearranged molecules.


September 22, 2019

Orphan legumes growing in dry environments: Marama bean as a case study.

Plants have developed morphological, physiological, biochemical, cellular, and molecular mechanisms to survive in drought-stricken environments with little or no water caused by below-average precipitation. In this mini-review, we highlight the characteristics that allows marama bean [Tylosema esculentum (Burchell) Schreiber], an example of an orphan legume native to arid regions of southwestern Southern Africa, to flourish under an inhospitable climate and dry soil conditions where no other agricultural crop competes in this agro-ecological zone. Orphan legumes are often better suited to withstand such harsh growth environments due to development of survival strategies using a combination of different traits and responses. Recent findings on questions on marama bean speciation, hybridization, population dynamics, and the evolutionary history of the bean and mechanisms by which the bean is able to extract and conserve water and nutrients from its environment as well as aspects of morphological and physiological adaptation will be reviewed. The importance of the soil microbiome and the genetic diversity in this species, and their interplay, as a reservoir for improvement will also be considered. In particular, the application of the newly established marama bean genome sequence will facilitate both the identification of important genes involved in the interaction with the soil microbiome and the identification of the diversity within the wild germplasm for genes involved drought tolerance. Since predicted future changes in climatic conditions, with less water availability for plant growth, will severely affect agricultural productivity, an understanding of the mechanisms of unique adaptations in marama bean to such conditions may also provide insights as to how to improve the performance of the major crops.


September 22, 2019

Comprehensive evaluation of the host responses to infection with differentially virulent classical swine fever virus strains in pigs.

Classical swine fever virus (CSFV) infection causes most variable clinical syndromes from chronic or latent infection to acute death, and it is generally acknowledged that the course of disease is affected by both virus and host factors. To compare host immune responses to differentially virulent CSFV strains in pigs, fifteen 8-week-old specific-pathogen-free pigs were randomly divided into four groups and inoculated with the CSFV Shimen strain (a highly virulent strain), the HLJZZ2014 strain (a moderately virulent strains), C-strain (an avirulent strain), and DMEM (mock control), respectively. Infection with the Shimen or HLJZZ2014 strain resulted in fever, clinical signs and histopathological lesions, which were not observed in the C-strain-inoculated pigs, though low viral genome copies were detected in the peripheral blood and tissue samples. The data showed that the virulence of the strains affected the outcome of duration and intensity of the disease rather than the tissue tropism of the virus. Furthermore, leukopenia, lymphocytopenia, differentiation of T-cells, and the secretion of cytokines associated with inflammation or apoptosis such as interferon alpha (IFN-a), tumor necrosis factor alpha (TNF-a), interleukin 2 (IL-2), IL-4, IL-6, and IL-10 were induced by the virulent CSFV infection, the differences reflected in onset and extent of the regulation. Taken together, our results revealed that the major differences among the three strains resided in the kinetics of host response to the infection: severe and immediate with the highly virulent strain, while progressive and delayed with the moderately virulent one. This comparative study will help to dissect the pathogenesis of CSFV. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

Population genomics of Culiseta melanura, the principal vector of Eastern equine encephalitis virus in the United States.

Eastern Equine Encephalitis (EEE) (Togaviridae, Alphavirus) is a highly pathogenic mosquito-borne arbovirus that circulates in an enzootic cycle involving Culiseta melanura mosquitoes and wild Passeriformes birds in freshwater swamp habitats. Recently, the northeastern United States has experienced an intensification of virus activity with increased human involvement and northward expansion into new regions. In addition to its principal role in enzootic transmission of EEE virus among avian hosts, recent studies on the blood-feeding behavior of Cs. melanura throughout its geographic range suggest that this mosquito may also be involved in epizootic / epidemic transmission to equines and humans in certain locales. Variations in blood feeding behavior may be a function of host availability, environmental factors, and/or underlying genetic differences among regional populations. Despite the importance of Cs. melanura in transmission and maintenance of EEE virus, the genetics of this species remains largely unexplored.To investigate the occurrence of genetic variation in Cs. melanura, the genome of this mosquito vector was sequenced resulting in a draft genome assembly of 1.28 gigabases with a contig N50 of 93.36 kilobases. Populations of Cs. melanura from 10 EEE virus foci in the eastern North America were genotyped with double-digest RAD-seq. Following alignment of reads to the reference genome, variant calling, and filtering, 40,384 SNPs were retained for downstream analyses. Subsequent analyses revealed genetic differentiation between northern and southern populations of this mosquito species. Moreover, limited fine-scale population structure was detected throughout northeastern North America, suggesting local differentiation of populations but also a history of ancestral polymorphism or contemporary gene flow. Additionally, a genetically distinct cluster was identified predominantly at two northern sites.This study elucidates the first evidence of fine-scale population structure in Cs. melanura throughout its eastern range and detects evidence of gene flow between populations in northeastern North America. This investigation provides the groundwork for examining the consequences of genetic variations in the populations of this mosquito species that could influence vector-host interactions and the risk of human and equine infection with EEE virus.


September 22, 2019

First draft genome assembly of the Argane tree (Argania spinosa)

Background: The Argane tree (Argania spinosa L. Skeels) is an endemic tree of southwestern Morocco that plays an important socioeconomic and ecologic role for a dense human population in an arid zone. Several studies confirmed the importance of this species as a food and feed source and as a resource for both pharmaceutical and cosmetic compounds. Unfortunately, the argane tree ecosystem is facing significant threats from environmental changes (global warming, over-population) and over-exploitation. Limited research has been conducted, however, on argane tree genetics and genomics, which hinders its conservation and genetic improvement. Methods: Here, we present a draft genome assembly of A. spinosa. A reliable reference genome of A. spinosa was created using a hybrid de novo assembly approach combining short and long sequencing reads. Results: In total, 144 Gb Illumina HiSeq reads and 7.2 Gb PacBio reads were produced and assembled. The final draft genome comprises 75 327 scaffolds totaling 671 Mb with an N50 of 49 916 kb. The draft assembly is close to the genome size estimated by k-mers distribution and covers 89% of complete and 4.3 % of partial Arabidopsis orthologous groups in BUSCO. Conclusion: The A. spinosa genome will be useful for assessing biodiversity leading to efficient conservation of this endangered endemic tree. Furthermore, the genome may enable genome-assisted cultivar breeding, and provide a better understanding of important metabolic pathways and their underlying genes for both cosmetic and pharmacological purposes.


September 22, 2019

Opposite polarity monospore genome de novo sequencing and comparative analysis reveal the possible heterothallic life cycle of Morchella importuna.

Morchella is a popular edible fungus worldwide due to its rich nutrition and unique flavor. Many research efforts were made on the domestication and cultivation of Morchella all over the world. In recent years, the cultivation of Morchella was successfully commercialized in China. However, the biology is not well understood, which restricts the further development of the morel fungus cultivation industry. In this paper, we performed de novo sequencing and assembly of the genomes of two monospores with a different mating type (M04M24 and M04M26) isolated from the commercially cultivated strain M04. Gene annotation and comparative genome analysis were performed to study differences in CAZyme (Carbohydrate-active enzyme) enzyme content, transcription factors, duplicated sequences, structure of mating type sites, and differences at the gene and functional levels between the two monospore strains of M. importuna. Results showed that the de novo assembled haploid M04M24 and M04M26 genomes were 48.98 and 51.07 Mb, respectively. A complete fine physical map of M. importuna was obtained from genome coverage and gene completeness evaluation. A total of 10,852 and 10,902 common genes and 667 and 868 endemic genes were identified from the two monospore strains, respectively. The Gene Ontology (GO) and KAAS (KEGG Automatic Annotation Serve) enrichment analyses showed that the endemic genes performed different functions. The two monospore strains had 99.22% collinearity with each other, accompanied with certain position and rearrangement events. Analysis of complete mating-type loci revealed that the two monospore M. importuna strains contained an independent mating-type structure and remained conserved in sequence and location. The phylogenetic and divergence time of M. importuna was analyzed at the whole-genome level for the first time. The bifurcation time of morel and tuber was estimated to be 201.14 million years ago (Mya); the two monospore strains with a different mating type represented the evolution of different nuclei, and the single copy homologous genes between them were also different due to a genetic differentiation distance about 0.65 Mya. Compared with truffles, M. importuna had an extension of 28 clusters of orthologous genes (COGs) and a contraction of two COGs. The two different polar nuclei with different degrees of contraction and expansion suggested that they might have undergone different evolutionary processes. The different mating-type structures, together with the functional clustering and enrichment analysis results of the endemic genes of the two different polar nuclei, imply that M. importuna might be a heterothallic fungus and the interaction between the endemic genes may be necessary for its complete life history. Studies on the genome of M. importuna facilitate a better understanding of morel biology and evolution.


September 22, 2019

Groundnut entered post-genome sequencing era: Opportunities and challenges in translating genomic information from genome to field

Cultivated groundnut or peanut (Arachis hypogaea) is an allopolyploid crop with a large complex genome and genetic barrier for exchanging genetic diversity from its wild relatives due to ploidy differences. Optimum genetic and genomic resources are key for accelerating the process for trait mapping and gene discovery and deploying diagnostic markers in genomics-assisted breeding. The better utilization of different aspects of peanut biology such as genetics, genomics, transcriptomics, proteomics, epigenomics, metabolomics, and interactomics can be of great help to groundnut genetic improvement program across the globe. The availability of high-quality reference genome is core to all the “omics” approaches, and hence optimum genomic resources are a must for fully exploiting the potential of modern science into conventional breeding. In this context, groundnut is passing through a very critical and transformational phase by making available the required genetic and genomic resources such as reference genomes of progenitors, resequencing of diverse lines, transcriptome resources, germplasm diversity panel, and multi-parent genetic populations for conducting high-resolution trait mapping, identification of associated markers, and development of diagnostic markers for selected traits. Lastly, the available resources have been deployed in translating genomic information from genome to field by developing improved groundnut lines with enhanced resistance to root-knot nematode, rust, and late leaf spot and high oleic acid. In addition, the International Peanut Genome Initiative (IPGI) have made available the high-quality reference genome for cultivated tetraploid groundnut which will facilitate better utilization of genetic resources in groundnut improvement. In parallel, the development of high-density genotyping platforms, such as Axiom_Arachis array with 58 K SNPs, and constitution of training population will initiate the deployment of the modern breeding approach, genomic selection, for achieving higher genetic gains in less time with more precision.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.