Menu
September 22, 2019

Whole-genome landscape of Medicago truncatula symbiotic genes.

Advances in deciphering the functional architecture of eukaryotic genomes have been facilitated by recent breakthroughs in sequencing technologies, enabling a more comprehensive representation of genes and repeat elements in genome sequence assemblies, as well as more sensitive and tissue-specific analyses of gene expression. Here we show that PacBio sequencing has led to a substantially improved genome assembly of Medicago truncatula A17, a legume model species notable for endosymbiosis studies1, and has enabled the identification of genome rearrangements between genotypes at a near-base-pair resolution. Annotation of the new M. truncatula genome sequence has allowed for a thorough analysis of transposable elements and their dynamics, as well as the identification of new players involved in symbiotic nodule development, in particular 1,037 upregulated long non-coding RNAs (lncRNAs). We have also discovered that a substantial proportion (~35% and 38%, respectively) of the genes upregulated in nodules or expressed in the nodule differentiation zone colocalize in genomic clusters (270 and 211, respectively), here termed symbiotic islands. These islands contain numerous expressed lncRNA genes and display differentially both DNA methylation and histone marks. Epigenetic regulations and lncRNAs are therefore attractive candidate elements for the orchestration of symbiotic gene expression in the M. truncatula genome.


September 22, 2019

Desiccation Tolerance Evolved through Gene Duplication and Network Rewiring in Lindernia.

Although several resurrection plant genomes have been sequenced, the lack of suitable dehydration-sensitive outgroups has limited genomic insights into the origin of desiccation tolerance. Here, we utilized a comparative system of closely related desiccation-tolerant (Lindernia brevidens) and -sensitive (Lindernia subracemosa) species to identify gene- and pathway-level changes associated with the evolution of desiccation tolerance. The two high-quality Lindernia genomes we assembled are largely collinear, and over 90% of genes are conserved. L. brevidens and L. subracemosa have evidence of an ancient, shared whole-genome duplication event, and retained genes have neofunctionalized, with desiccation-specific expression in L. brevidens Tandem gene duplicates also are enriched in desiccation-associated functions, including a dramatic expansion of early light-induced proteins from 4 to 26 copies in L. brevidens A comparative differential gene coexpression analysis between L. brevidens and L. subracemosa supports extensive network rewiring across early dehydration, desiccation, and rehydration time courses. Many LATE EMBRYOGENESIS ABUNDANT genes show significantly higher expression in L. brevidens compared with their orthologs in L. subracemosa Coexpression modules uniquely upregulated during desiccation in L. brevidens are enriched with seed-specific and abscisic acid-associated cis-regulatory elements. These modules contain a wide array of seed-associated genes that have no expression in the desiccation-sensitive L. subracemosa Together, these findings suggest that desiccation tolerance evolved through a combination of gene duplications and network-level rewiring of existing seed desiccation pathways.© 2018 American Society of Plant Biologists. All rights reserved.


September 22, 2019

The genomic landscape of molecular responses to natural drought stress in Panicum hallii

Environmental stress is a major driver of ecological community dynamics and agricultural productivity. This is especially true for soil water availability, because drought is the greatest abiotic inhibitor of worldwide crop yields. Here, we test the genetic basis of drought responses in the genetic model for C4perennial grasses, Panicum hallii, through population genomics, field-scale gene-expression (eQTL) analysis, and comparison of two complete genomes. While gene expression networks are dominated by local cis-regulatory elements, we observe three genomic hotspots of unlinked trans-regulatory loci. These regulatory hubs are four times more drought responsive than the genome-wide average. Additionally, cis- and trans-regulatory networks are more likely to have opposing effects than expected under neutral evolution, supporting a strong influence of compensatory evolution and stabilizing selection. These results implicate trans-regulatory evolution as a driver of drought responses and demonstrate the potential for crop improvement in drought-prone regions through modification of gene regulatory networks.


September 22, 2019

N6-methyladenine DNA methylation in Japonica and Indica rice genomes and its association with gene expression, plant development, and stress responses.

N6-Methyladenine (6mA) DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes, including the dicot model Arabidopsis thaliana. However, the conservation and divergence of 6mA distribution patterns and functions in plants remain elusive. Here we report high-quality 6mA methylomes at single-nucleotide resolution in rice based on substantially improved genome sequences of two rice cultivars, Nipponbare (Nip; Japonica) and 93-11 (Indica). Analysis of 6mA genomic distribution and its association with transcription suggest that 6mA distribution and function is rather conserved between rice and Arabidopsis. We found that 6mA levels are positively correlated with the expression of key stress-related genes, which may be responsible for the difference in stress tolerance between Nip and 93-11. Moreover, we showed that mutations in DDM1 cause defects in plant growth and decreased 6mA level. Our results reveal that 6mA is a conserved DNA modification that is positively associated with gene expression and contributes to key agronomic traits in plants. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.


September 22, 2019

How resurrection plants survive being hung out to dry.

Resurrection plants have the unique ability to survive extreme dehydration (desiccation), lying dormant for months or sometimes years until rehydration is possible. This formidable survival strategy has independently evolved several times across the land plant phylogeny, and several phylogenetically diverse resurrection plant genomes have been sequenced and assembled in an attempt to understand the causal genetic mechanisms. Large-scale comparisons across each of these phylogenetically distant resurrection plant genomes reveals that some conserved molecular signatures may underlie desiccation tolerance (Illing et al., 2005; Zhang and Bartels, 2018), but overall the genes, networks, and regulatory factors that underlie desiccation tolerance remain largely unknown.


September 22, 2019

Approaches for surveying cosmic radiation damage in large populations of Arabidopsis thaliana seeds-Antarctic balloons and particle beams.

The Cosmic Ray Exposure Sequencing Science (CRESS) payload system is a proof of concept experiment to assess the genomic impact of space radiation on seeds. CRESS was designed as a secondary payload for the December 2016 high-altitude, high-latitude, and long-duration balloon flight carrying the Boron And Carbon Cosmic Rays in the Upper Stratosphere (BACCUS) experimental hardware. Investigation of the biological effects of Galactic Cosmic Radiation (GCR), particularly those of ions with High-Z and Energy (HZE), is of interest due to the genomic damage this type of radiation inflicts. The biological effects of upper-stratospheric mixed radiation above Antarctica (ANT) were sampled using Arabidopsis thaliana seeds and were compared to those resulting from a controlled simulation of GCR at Brookhaven National Laboratory (BNL) and to laboratory control seed. The payload developed for Antarctica exposure was broadly designed to 1U CubeSat specifications (10cmx10cmx10cm, =1.33kg), maintained 1 atm internal pressure, and carried an internal cargo of four seed trays (about 580,000 seeds) and twelve CR-39 Solid-State Nuclear Track Detectors (SSNTDs). The irradiated seeds were recovered, sterilized and grown on Petri plates for phenotypic screening. BNL and ANT M0 seeds showed significantly reduced germination rates and elevated somatic mutation rates when compared to non-irradiated controls, with the BNL mutation rate also being significantly higher than that of ANT. Genomic DNA from mutants of interest was evaluated with whole-genome sequencing using PacBio SMRT technology. Sequence data revealed the presence of an array of genome structural variants in the genomes of M0 and M1 mutant plants.


September 22, 2019

Sex chromosome evolution via two genes

The origin of sex chromosomes has been hypothesized to involve the linkage of factors with antagonistic effects on male and female function. Garden asparagus (Asparagus officinalis L.) is an ideal species to test this hypothesis, as the X and Y chromosomes are cytologically homomorphic and recently evolved from an ancestral autosome pair in association with a shift from hermaphroditism to dioecy. Mutagenesis screens paired with single-molecule fluorescence in situ hybridization (smFISH) directly implicate Y-specific genes that respectively suppress female organ development and are necessary for male gametophyte development. Comparison of contiguous X and Y chromosome shows that loss of recombination between the genes suppressing female function (SUPPRESSOR OF FEMALE FUNCTION, SOFF) and promoting male function (TAPETAL DEVELOPMENT AND FUNCTION 1, aspTDF1) is due to hemizygosity. We also experimentally demonstrate the function of aspTDF1. These finding provide direct evidence that sex chromosomes can evolve from autosomes via two sex determination genes: a dominant suppressor of femaleness and a promoter of maleness.


September 22, 2019

Glyphosate resistance and EPSPS gene duplication: Convergent evolution in multiple plant species.

One of the increasingly widespread mechanisms of resistance to the herbicide glyphosate is copy number variation (CNV) of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene. EPSPS gene duplication has been reported in eight weed species, ranging from 3-5 extra copies to more than 150 extra copies. In the case of Palmer amaranth (Amaranthus palmeri), a section of >300 kb containing EPSPS and many other genes has been replicated and inserted at new loci throughout the genome, resulting in significant increase in total genome size. The replicated sequence contains several classes of mobile genetic elements including helitrons, raising the intriguing possibility of extra-chromosomal replication of the EPSPS-containing sequence. In kochia (Kochia scoparia), from three to more than 10 extra EPSPS copies are arranged as a tandem gene duplication at one locus. In the remaining six weed species that exhibit EPSPS gene duplication, little is known about the underlying mechanisms of gene duplication or their entire sequence. There is mounting evidence that adaptive gene amplification is an important mode of evolution in the face of intense human-mediated selection pressure. The convergent evolution of CNVs for glyphosate resistance in weeds, through at least two different mechanisms, may be indicative of a more general importance for this mechanism of adaptation in plants. CNVs warrant further investigation across plant functional genomics for adaptation to biotic and abiotic stresses, particularly for adaptive evolution on rapid time scales.© The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


September 21, 2019

in silico Whole Genome Sequencer & Analyzer (iWGS): a computational pipeline to guide the design and analysis of de novo genome sequencing studies.

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in non-model organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS. Copyright © 2016 Author et al.


September 21, 2019

Complete chloroplast genome sequence of the red silk cotton tree (Bombax ceiba)

Bombax ceiba L. is a beautiful and deciduous tree with great ecological and economic importance. The third generation sequencing of chloroplast genome of B. ceiba was conducted on the PacBio sequencing platform (Pacific Biosciences). The complete chloroplast genome was 158,997?bp, which contains a large single-copy (LSC) region (89,021?bp), a small single-copy (SSC) region (21,110?bp), and two inverted repeats (IRs) (24,433?bp). In total, 116 genes were annotated, including 81 protein-coding genes, eight rRNA genes, and 27 tRNA genes. The phylogenetic tree showed that B. ceiba was closely clustered with one clade of Malvaceae.


September 21, 2019

A distinct and genetically diverse lineage of the hybrid fungal pathogen Verticillium longisporum population causes stem striping in British oilseed rape.

Population genetic structures illustrate evolutionary trajectories of organisms adapting to differential environmental conditions. Verticillium stem striping disease on oilseed rape was mainly observed in continental Europe, but has recently emerged in the United Kingdom. The disease is caused by the hybrid fungal species Verticillium longisporum that originates from at least three separate hybridization events, yet hybrids between Verticillium progenitor species A1 and D1 are mainly responsible for Verticillium stem striping. We reveal a hitherto un-described dichotomy within V. longisporum lineage A1/D1 that correlates with the geographic distribution of the isolates with an ‘A1/D1 West’ and an ‘A1/D1 East’ cluster. Genome comparison between representatives of the A1/D1 West and East clusters excluded population distinctiveness through separate hybridization events. Remarkably, the A1/D1 West population that is genetically more diverse than the entire A1/D1 East cluster caused the sudden emergence of Verticillium stem striping in the UK, whereas in continental Europe Verticillium stem striping is predominantly caused by the more genetically uniform A1/D1 East population. The observed genetic diversity of the A1/D1 West population argues against a recent introduction of the pathogen into the UK, but rather suggests that the pathogen previously established in the UK and remained latent or unnoticed as oilseed rape pathogen until recently.© 2017 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.


September 21, 2019

Assessing genome assembly quality using the LTR Assembly Index (LAI).

Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.


September 21, 2019

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.

Long-read, single-molecule real-time (SMRT) sequencing is routinely used to finish microbial genomes, but available assembly methods have not scaled well to larger genomes. We introduce the MinHash Alignment Process (MHAP) for overlapping noisy, long reads using probabilistic, locality-sensitive hashing. Integrating MHAP with the Celera Assembler enabled reference-grade de novo assemblies of Saccharomyces cerevisiae, Arabidopsis thaliana, Drosophila melanogaster and a human hydatidiform mole cell line (CHM1) from SMRT sequencing. The resulting assemblies are highly continuous, include fully resolved chromosome arms and close persistent gaps in these reference genomes. Our assembly of D. melanogaster revealed previously unknown heterochromatic and telomeric transition sequences, and we assembled low-complexity sequences from CHM1 that fill gaps in the human GRCh38 reference. Using MHAP and the Celera Assembler, single-molecule sequencing can produce de novo near-complete eukaryotic assemblies that are 99.99% accurate when compared with available reference genomes.


September 21, 2019

Phased diploid genome assembly with single-molecule real-time sequencing.

While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.


September 21, 2019

Population sequencing reveals clonal diversity and ancestral inbreeding in the grapevine cultivar Chardonnay.

Chardonnay is the basis of some of the world’s most iconic wines and its success is underpinned by a historic program of clonal selection. There are numerous clones of Chardonnay available that exhibit differences in key viticultural and oenological traits that have arisen from the accumulation of somatic mutations during centuries of asexual propagation. However, the genetic variation that underlies these differences remains largely unknown. To address this knowledge gap, a high-quality, diploid-phased Chardonnay genome assembly was produced from single-molecule real time sequencing, and combined with re-sequencing data from 15 different Chardonnay clones. There were 1620 markers identified that distinguish the 15 clones. These markers were reliably used for clonal identification of independently sourced genomic material, as well as in identifying a potential genetic basis for some clonal phenotypic differences. The predicted parentage of the Chardonnay haplomes was elucidated by mapping sequence data from the predicted parents of Chardonnay (Gouais blanc and Pinot noir) against the Chardonnay reference genome. This enabled the detection of instances of heterosis, with differentially-expanded gene families being inherited from the parents of Chardonnay. Most surprisingly however, the patterns of nucleotide variation present in the Chardonnay genome indicate that Pinot noir and Gouais blanc share an extremely high degree of kinship that has resulted in the Chardonnay genome displaying characteristics that are indicative of inbreeding.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.