Menu
July 19, 2019

Advances in Sequencing and Resequencing in Crop Plants.

DNA sequencing technologies have changed the face of biological research over the last 20 years. From reference genomes to population level resequencing studies, these technologies have made significant contributions to our understanding of plant biology and evolution. As the technologies have increased in power, the breadth and complexity of the questions that can be asked has increased. Along with this, the challenges of managing unprecedented quantities of sequence data are mounting. This chapter describes a few aspects of the journey so far and looks forward to what may lie ahead.


July 19, 2019

RNAi is a critical determinant of centromere evolution in closely related fungi.

The centromere DNA locus on a eukaryotic chromosome facilitates faithful chromosome segregation. Despite performing such a conserved function, centromere DNA sequence as well as the organization of sequence elements is rapidly evolving in all forms of eukaryotes. The driving force that facilitates centromere evolution remains an enigma. Here, we studied the evolution of centromeres in closely related species in the fungal phylum of Basidiomycota. Using ChIP-seq analysis of conserved inner kinetochore proteins, we identified centromeres in three closely related Cryptococcus species: two of which are RNAi-proficient, while the other lost functional RNAi. We find that the centromeres in the RNAi-deficient species are significantly shorter than those of the two RNAi-proficient species. While centromeres are LTR retrotransposon-rich in all cases, the RNAi-deficient species lost all full-length retroelements from its centromeres. In addition, centromeres in RNAi-proficient species are associated with a significantly higher level of cytosine DNA modifications compared with those of RNAi-deficient species. Furthermore, when an RNAi-proficient Cryptococcus species and its RNAi-deficient mutants were passaged under similar conditions, the centromere length was found to be occasionally shortened in RNAi mutants. In silico analysis of predicted centromeres in a group of closely related Ustilago species, also belonging to the Basidiomycota, were found to have undergone a similar transition in the centromere length in an RNAi-dependent fashion. Based on the correlation found in two independent basidiomycetous species complexes, we present evidence suggesting that the loss of RNAi and cytosine DNA methylation triggered transposon attrition, which resulted in shortening of centromere length during evolution. Copyright © 2018 the Author(s). Published by PNAS.


July 19, 2019

Genome sequence of the progenitor of wheat A subgenome Triticum urartu.

Triticum urartu (diploid, AA) is the progenitor of the A subgenome of tetraploid (Triticum turgidum, AABB) and hexaploid (Triticum aestivum, AABBDD) wheat1,2. Genomic studies of T. urartu have been useful for investigating the structure, function and evolution of polyploid wheat genomes. Here we report the generation of a high-quality genome sequence of T. urartu by combining bacterial artificial chromosome (BAC)-by-BAC sequencing, single molecule real-time whole-genome shotgun sequencing 3 , linked reads and optical mapping4,5. We assembled seven chromosome-scale pseudomolecules and identified protein-coding genes, and we suggest a model for the evolution of T. urartu chromosomes. Comparative analyses with genomes of other grasses showed gene loss and amplification in the numbers of transposable elements in the T. urartu genome. Population genomics analysis of 147 T. urartu accessions from across the Fertile Crescent showed clustering of three groups, with differences in altitude and biostress, such as powdery mildew disease. The T. urartu genome assembly provides a valuable resource for studying genetic variation in wheat and related grasses, and promises to facilitate the discovery of genes that could be useful for wheat improvement.


July 19, 2019

Resequencing of 243 diploid cotton accessions based on an updated A genome identifies the genetic basis of key agronomic traits.

The ancestors of Gossypium arboreum and Gossypium herbaceum provided the A subgenome for the modern cultivated allotetraploid cotton. Here, we upgraded the G. arboreum genome assembly by integrating different technologies. We resequenced 243?G. arboreum and G. herbaceum accessions to generate a map of genome variations and found that they are equally diverged from Gossypium raimondii. Independent analysis suggested that Chinese G. arboreum originated in South China and was subsequently introduced to the Yangtze and Yellow River regions. Most accessions with domestication-related traits experienced geographic isolation. Genome-wide association study (GWAS) identified 98 significant peak associations for 11 agronomically important traits in G. arboreum. A nonsynonymous substitution (cysteine-to-arginine substitution) of GaKASIII seems to confer substantial fatty acid composition (C16:0 and C16:1) changes in cotton seeds. Resistance to fusarium wilt disease is associated with activation of GaGSTF9 expression. Our work represents a major step toward understanding the evolution of the A genome of cotton.


July 19, 2019

The Rosa genome provides new insights into the domestication of modern roses.

Roses have high cultural and economic importance as ornamental plants and in the perfume industry. We report the rose whole-genome sequencing and assembly and resequencing of major genotypes that contributed to rose domestication. We generated a homozygous genotype from a heterozygous diploid modern rose progenitor, Rosa chinensis ‘Old Blush’. Using single-molecule real-time sequencing and a meta-assembly approach, we obtained one of the most comprehensive plant genomes to date. Diversity analyses highlighted the mosaic origin of ‘La France’, one of the first hybrids combining the growth vigor of European species and the recurrent blooming of Chinese species. Genomic segments of Chinese ancestry identified new candidate genes for recurrent blooming. Reconstructing regulatory and secondary metabolism pathways allowed us to propose a model of interconnected regulation of scent and flower color. This genome provides a foundation for understanding the mechanisms governing rose traits and should accelerate improvement in roses, Rosaceae and ornamentals.


July 19, 2019

Introduction: The host-associated microbiome: Pattern, process and function.

An explosion of studies in recent years has established the ubiquity of host-associated microbes and their centrality to host biology (McFall-Ngai et al., 2013; Russell, Dubilier, & Rudgers, 2014). Microbes aid in digestion, modulate development, contribute to host immunity, mediate abiotic stress and more. While relationships with host-associated microbes are ubiquitous and important, they are cer- tainly not monolithic. Characterizing the microbial diversity associ- ated with an ever-broadening array of hosts (diverse animals, plants, algae and protists) has shown that essential functions can be per- formed by microbes that are integrated with the host to varying degrees, ranging from embedded endosymbionts to a variable cast of transient microbes acquired from the environment. The maturing host–microbiome field is now developing a mechanistic understand- ing of host/microbe relationships across this spectrum and the cross- talk mediating these interactions. Similarly, studies across systems are illuminating the ecological and evolutionary factors that shape host–microbe interactions today and providing hints into the origins of specific relationships.


July 19, 2019

Long-read sequencing and de novo genome assembly of Ammopiptanthus nanus, a desert shrub.

Ammopiptanthus nanus is a rare broad-leaved shrub that is found in the desert and arid regions of Central Asia. This plant species exhibits extremely high tolerance to drought and freezing and has been used in abiotic tolerance research in plants. As a relic of the tertiary period, A. nanus is of great significance to plant biogeographic research in the ancient Mediterranean region. Here, we report a draft genome assembly using the Pacific Biosciences (PacBio) platform and gene annotation for A. nanus.A total of 64.72 Gb of raw PacBio sequel reads were generated from four 20-kb libraries. After filtering, 64.53 Gb of clean reads were obtained, giving 72.59× coverage depth. Assembly using Canu gave an assembly length of 823.74 Mb, with a contig N50 of 2.76 Mb. The final size of the assembled A. nanus genome was close to the 889 Mb estimated by k-mer analysis. The gene annotation completeness was evaluated using Benchmarking Universal Single-Copy Orthologs; 1,327 of the 1,440 conserved genes (92.15%) could be found in the A. nanus assembly. Genome annotation revealed that 74.08% of the A. nanus genome is composed of repetitive elements and 53.44% is composed of long terminal repeat elements. We predicted ?37,188 protein-coding genes, of which 96.53% were functionally annotated.The genomic sequences of A. nanus could be a valuable source for comparative genomic analysis in the legume family and will be useful for understanding the phylogenetic relationships of the Thermopsideae and the evolutionary response of plant species to the Qinghai Tibetan Plateau uplift.


July 19, 2019

Surfing the genomic new wave.

In the last decade, high-throughput sequencing approaches have revolutionized the field of plant genomics. With the pace of technical improvement showing no sign of slowing what advances could be just around the corner.


July 19, 2019

Fern genomes elucidate land plant evolution and cyanobacterial symbioses.

Ferns are the closest sister group to all seed plants, yet little is known about their genomes other than that they are generally colossal. Here, we report on the genomes of Azolla filiculoides and Salvinia cucullata (Salviniales) and present evidence for episodic whole-genome duplication in ferns-one at the base of ‘core leptosporangiates’ and one specific to Azolla. One fern-specific gene that we identified, recently shown to confer high insect resistance, seems to have been derived from bacteria through horizontal gene transfer. Azolla coexists in a unique symbiosis with N2-fixing cyanobacteria, and we demonstrate a clear pattern of cospeciation between the two partners. Furthermore, the Azolla genome lacks genes that are common to arbuscular mycorrhizal and root nodule symbioses, and we identify several putative transporter genes specific to Azolla-cyanobacterial symbiosis. These genomic resources will help in exploring the biotechnological potential of Azolla and address fundamental questions in the evolution of plant life.


July 19, 2019

Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes.

Maize is an important crop with a high level of genome diversity and heterosis. The genome sequence of a typical female line, B73, was previously released. Here, we report a de novo genome assembly of a corresponding male representative line, Mo17. More than 96.4% of the 2,183?Mb assembled genome can be accounted for by 362 scaffolds in ten pseudochromosomes with 38,620 annotated protein-coding genes. Comparative analysis revealed large gene-order and gene structural variations: approximately 10% of the annotated genes were mutually nonsyntenic, and more than 20% of the predicted genes had either large-effect mutations or large structural variations, which might cause considerable protein divergence between the two inbred lines. Our study provides a high-quality reference-genome sequence of an important maize germplasm, and the intraspecific gene order and gene structural variations identified should have implications for heterosis and genome evolution.


July 19, 2019

Identification and analysis of adenine N6-methylation sites in the rice genome.

DNA N6-methyladenine (6mA) is a non-canonical DNA modification that is present at low levels in different eukaryotes1-8, but its prevalence and genomic function in higher plants are unclear. Using mass spectrometry, immunoprecipitation and validation with analysis of single-molecule real-time sequencing, we observed that about 0.2% of all adenines are 6mA methylated in the rice genome. 6mA occurs most frequently at GAGG motifs and is mapped to about 20% of genes and 14% of transposable elements. In promoters, 6mA marks silent genes, but in bodies correlates with gene activity. 6mA overlaps with 5-methylcytosine (5mC) at CG sites in gene bodies and is complementary to 5mC at CHH sites in transposable elements. We show that OsALKBH1 may be potentially involved in 6mA demethylation in rice. The results suggest that 6mA is complementary to 5mC as an epigenomic mark in rice and reinforce a distinct role for 6mA as a gene expression-associated epigenomic mark in eukaryotes.


July 19, 2019

A near complete, chromosome-scale assembly of the black raspberry (Rubus occidentalis) genome.

The fragmented nature of most draft plant genomes has hindered downstream gene discovery, trait mapping for breeding, and other functional genomics applications. There is a pressing need to improve or finish draft plant genome assemblies.Here, we present a chromosome-scale assembly of the black raspberry genome using single-molecule real-time Pacific Biosciences sequencing and high-throughput chromatin conformation capture (Hi-C) genome scaffolding. The updated V3 assembly has a contig N50 of 5.1 Mb, representing an ~200-fold improvement over the previous Illumina-based version. Each of the 235 contigs was anchored and oriented into seven chromosomes, correcting several major misassemblies. Black raspberry V3 contains 47 Mb of new sequences including large pericentromeric regions and thousands of previously unannotated protein-coding genes. Among the new genes are hundreds of expanded tandem gene arrays that were collapsed in the Illumina-based assembly. Detailed comparative genomics with the high-quality V4 woodland strawberry genome (Fragaria vesca) revealed near-perfect 1:1 synteny with dramatic divergence in tandem gene array composition. Lineage-specific tandem gene arrays in black raspberry are related to agronomic traits such as disease resistance and secondary metabolite biosynthesis.The improved resolution of tandem gene arrays highlights the need to reassemble these highly complex and biologically important regions in draft plant genomes. The updated, high-quality black raspberry reference genome will be useful for comparative genomics across the horticulturally important Rosaceae family and enable the development of marker assisted breeding in Rubus.


July 19, 2019

How well can we create phased, diploid, human genomes?: An assessment of FALCON-Unzip phasing using a human trio

Long read sequencing technology has allowed researchers to create de novo assemblies with impressive continuity[1,2]. This advancement has dramatically increased the number of reference genomes available and hints at the possibility of a future where personal genomes are assembled rather than resequenced. In 2016 Pacific Biosciences released the FALCON-Unzip framework, which can provide long, phased haplotype contigs from de novo assemblies. This phased genome algorithm enhances the accuracy of highly heterozygous organisms and allows researchers to explore questions that require haplotype information such as allele-specific expression and regulation. However, validation of this technique has been limited to small genomes or inbred individuals[3]. As a roadmap to personal genome assembly and phasing, we assess the phasing accuracy of FALCON-Unzip in humans using publicly available data for the Ashkenazi trio from the Genome in a Bottle Consortium[4]. To assess the accuracy of the Unzip algorithm, we assembled the genome of the son using FALCON and FALCON Unzip, genotyped publicly available short read data for the mother and the father, and observed the inheritance pattern of the parental SNPs along the phased genome of the son. We found that 72.8% of haplotype contigs share SNPs with only one parent suggesting that these contigs are correctly phased. Most mis-phased SNPs are random but present in high frequency toward the end of haplotype contigs. Approximately 20.7% of mis-phased haplotype contigs contain clusters of mis-phased SNPs, suggesting that haplotypes were mis-joined by FALCON-Unzip. Mis-joined boundaries in those contigs are located in areas of low SNP density. This research demonstrates that the FALCON-Unzip algorithm can be used to create long and accurate haplotypes for humans and identifies problematic regions that could benefit in future improvement.


July 19, 2019

Accelerated ex situ breeding of GBSS- and PTST1-edited cassava for modified starch.

Crop diversification required to meet demands for food security and industrial use is often challenged by breeding time and amenability of varieties to genome modification. Cassava is one such crop. Grown for its large starch-rich storage roots, it serves as a staple food and a commodity in the multibillion-dollar starch industry. Starch is composed of the glucose polymers amylopectin and amylose, with the latter strongly influencing the physicochemical properties of starch during cooking and processing. We demonstrate that CRISPR-Cas9 (clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9)-mediated targeted mutagenesis of two genes involved in amylose biosynthesis, PROTEIN TARGETING TO STARCH (PTST1) or GRANULE BOUND STARCH SYNTHASE (GBSS), can reduce or eliminate amylose content in root starch. Integration of the Arabidopsis FLOWERING LOCUS T gene in the genome-editing cassette allowed us to accelerate flowering-an event seldom seen under glasshouse conditions. Germinated seeds yielded S1, a transgene-free progeny that inherited edited genes. This attractive new plant breeding technique for modified cassava could be extended to other crops to provide a suite of novel varieties with useful traits for food and industrial applications.


July 19, 2019

From short reads to chromosome-scale genome assemblies.

A high-quality, annotated genome assembly is the foundation for many downstream studies. However, obtaining such an assembly is a complex, reiterative process that requires the assimilation of high-quality data and combines different approaches and data types. While some software packages incorporating multiple steps of genome assembly are commercially available, they may not be flexible enough to be routinely applied to all organisms, particularly to nonmodel species such as pathogenic oomycetes and fungi. If researchers understand and apply the most appropriate, currently available tools for each step, it is possible to customize parameters and optimize results for their organism of study. Based on our experience of de novo assembly and annotation of several oomycete species, this chapter provides a modular workflow from processing of raw reads, to initial assembly generation, through optimization, chromosome-scale scaffolding and annotation, outlining input and output data as well as examples and alternative software used for each step. The accompanying Notes provide background information for each step as well as alternative options. The final result of this workflow could be an annotated, high-quality, validated, chromosome-scale assembly or a draft assembly of sufficient quality to meet specific needs of a project.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.