Menu
July 7, 2019  |  

Exploiting next-generation sequencing to solve the haplotyping puzzle in polyploids: a simulation study.

Haplotypes are the units of inheritance in an organism, and many genetic analyses depend on their precise determination. Methods for haplotyping single individuals use the phasing information available in next-generation sequencing reads, by matching overlapping single-nucleotide polymorphisms while penalizing post hoc nucleotide corrections made. Haplotyping diploids is relatively easy, but the complexity of the problem increases drastically for polyploid genomes, which are found in both model organisms and in economically relevant plant and animal species. Although a number of tools are available for haplotyping polyploids, the effects of the genomic makeup and the sequencing strategy followed on the accuracy of these methods have hitherto not been thoroughly evaluated.We developed the simulation pipeline haplosim to evaluate the performance of three haplotype estimation algorithms for polyploids: HapCompass, HapTree and SDhaP, in settings varying in sequencing approach, ploidy levels and genomic diversity, using tetraploid potato as the model. Our results show that sequencing depth is the major determinant of haplotype estimation quality, that 1?kb PacBio circular consensus sequencing reads and Illumina reads with large insert-sizes are competitive and that all methods fail to produce good haplotypes when ploidy levels increase. Comparing the three methods, HapTree produces the most accurate estimates, but also consumes the most resources. There is clearly room for improvement in polyploid haplotyping algorithms.


July 7, 2019  |  

Current advances in genome sequencing of common wheat and its ancestral species

Common wheat is an important and widely cultivated food crop throughout the world. Much progress has been made in regard to wheat genome sequencing in the last decade. Starting from the sequencing of single chromosomes/chromosome arms whole genome sequences of common wheat and its diploid and tetraploid ancestors have been decoded along with the development of sequencing and assembling technologies. In this review, we give a brief summary on international progress in wheat genome sequencing, and mainly focus on reviewing the effort and contributions made by Chinese scientists.


July 7, 2019  |  

Development of molecular markers linked to powdery mildew resistance GenePm4bby combining SNP discovery from transcriptome sequencing data with bulked segregant analysis (BSR-Seq) in wheat.

Powdery mildew resistance genePm4b, originating fromTriticum persicum, is effective against the prevalentBlumeria graminisf. sp.tritici(Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification ofPm4bduring the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3mapping population (237 families) derived from a pair of isogenic lines VPM1/7*Bainong 3217 F4(carryingPm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. FourPm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking thePm4bgene. Three SSR markers,Xics13,Xics43, andXics76, were incorporated in the new genetic linkage map, which locatedPm4bin a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship withBrachypodium distachyonchromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification ofPm4bduring its MAS practice.


July 7, 2019  |  

Identification and expression analysis of wheat TaGF14 genes.

The 14-3-3 gene family members play key roles in various cellular processes. However, little is known about the numbers and roles of 14-3-3 genes in wheat. The aims of this study were to identify TaGF14 numbers in wheat by searching its whole genome through blast, to study the phylogenetic relationships with other plant species and to discuss the functions of TaGF14s. The results showed that common wheat harbored 20 TaGF14 genes, located on wheat chromosome groups 2, 3, 4, and 7. Out of them, eighteen TaGF14s are non-e proteins, and two wheat TaGF14 genes, TaGF14i and TaGF14f, are e proteins. Phylogenetic analysis indicated that these genes were divided into six clusters: cluster 1 (TaGF14d, TaGF14g, TaGF14j, TaGF14h, TaGF14c, and TaGF14n); cluster 2 (TaGF14k); cluster 3 (TaGF14b, TaGF14l, TaGF14m, and TaGF14s); cluster 4 (TaGF14a, TaGF14e, and TaGF14r); cluster 5 (TaGF14i and TaGF14f); and cluster 6 (TaGF14o, TaGF14p, TaGF14q, and TaGF14t). Tissue-specific gene expressions suggested that all TaGF14s were likely constitutively expressed, except two genes, i.e., TaGF14p and TaGF14f. And the highest amount of TaGF14 transcripts were observed in developing grains at 20 days post anthesis (DPA), especially for TaGF14j and TaGF14l. After drought stress, five genes, i.e., TaGF14c, TaGF14d, TaGF14g, TaGF14h, and TaGF14j, were up-regulated expression under drought stress for both 1 and 6 h, suggesting these genes played vital role in combating against drought stress. However, all the TaGF14s were down-regulated expression under heat stress for both 1 and 6 h, indicating TaGF14s may be negatively associated with heat stress by reducing the expression to combat heat stress or through other pathways. These results suggested that cluster 1, e.g., TaGF14j, may participate in the whole wheat developing stages, e.g., grain-filling (starch biosynthesis) and may also participate in combating against drought stress. Subsequently, a homolog of TaGF14j, TaGF14-JM22, were cloned by RACE and used to validate its function. Immunoblotting results showed that TaGF14-JM22 protein, closely related to TaGF14d, TaGF14g, and TaGF14j, can interact with AGP-L, SSI, SSII, SBEIIa, and SBEIIb in developing grains, suggesting that TaGF14s located on group 4 may be involved in starch biosynthesis. Therefore, it is possible to develop starch-rich wheat cultivars by modifying TaGF14s.


July 7, 2019  |  

Sustaining global agriculture through rapid detection and deployment of genetic resistance to deadly crop diseases.

Contents Summary 45 I. Introduction 45 II. Targeted chromosome-based cloning via long-range assembly (TACCA) 46 III. Resistance gene cloning through mutational mapping (MutMap) 47 IV. Cloning through mutant chromosome sequencing (MutChromSeq) 47 V. Rapid cloning through resistance gene enrichment and sequencing (RenSeq) 49 VI. Cloning resistance genes through transcriptome profiling (RNAseq) 49 VII. Resistance gene deployment strategies 49 VIII. Conclusions 50 Acknowledgements 50 References 50 SUMMARY: Genetically encoded resistance is a major component of crop disease management. Historically, gene loci conferring resistance to pathogens have been identified through classical genetic methods. In recent years, accelerated gene cloning strategies have become available through advances in sequencing, gene capture and strategies for reducing genome complexity. Here, I describe these approaches with key emphasis on the isolation of resistance genes to the cereal crop diseases that are an ongoing threat to global food security. Rapid gene isolation enables their efficient deployment through marker-assisted selection and transgenic technology. Together with innovations in genome editing and progress in pathogen virulence studies, this creates further opportunities to engineer long-lasting resistance. These approaches will speed progress towards a future of farming using fewer pesticides.© 2017 Commonwealth of Australia. New Phytologist © 2017 New Phytologist Trust.


July 7, 2019  |  

Satellite DNA evolution: old ideas, new approaches.

A substantial portion of the genomes of most multicellular eukaryotes consists of large arrays of tandemly repeated sequence, collectively called satellite DNA. The processes generating and maintaining different satellite DNA abundances across lineages are important to understand as satellites have been linked to chromosome mis-segregation, disease phenotypes, and reproductive isolation between species. While much theory has been developed to describe satellite evolution, empirical tests of these models have fallen short because of the challenges in assessing satellite repeat regions of the genome. Advances in computational tools and sequencing technologies now enable identification and quantification of satellite sequences genome-wide. Here, we describe some of these tools and how their applications are furthering our knowledge of satellite evolution and function. Copyright © 2018 Elsevier Ltd. All rights reserved.


July 7, 2019  |  

The case for not masking away repetitive DNA

In the course of analyzing whole-genome data, it is common practice to mask or filter out repetitive regions of a genome, such as transposable elements and endogenous retroviruses, in order to focus only on genes and thus simplify the results. This Commentary is a plea from one member of the Mobile DNA community to all gene-centric researchers: please do not ignore the repetitive fraction of the genome. Please stop narrowing your findings by only analyzing a minority of the genome, and instead broaden your analyses to include the rich biology of repetitive and mobile DNA. In this article, I present four arguments supporting a case for retaining repetitive DNA in your genome-wide analysis.


July 7, 2019  |  

The challenge of analyzing the sugarcane genome.

Reference genome sequences have become key platforms for genetics and breeding of the major crop species. Sugarcane is probably the largest crop produced in the world (in weight of crop harvested) but lacks a reference genome sequence. Sugarcane has one of the most complex genomes in crop plants due to the extreme level of polyploidy. The genome of modern sugarcane hybrids includes sub-genomes from two progenitors Saccharum officinarum and S. spontaneum with some chromosomes resulting from recombination between these sub-genomes. Advancing DNA sequencing technologies and strategies for genome assembly are making the sugarcane genome more tractable. Advances in long read sequencing have allowed the generation of a more complete set of sugarcane gene transcripts. This is supporting transcript profiling in genetic research. The progenitor genomes are being sequenced. A monoploid coverage of the hybrid genome has been obtained by sequencing BAC clones that cover the gene space of the closely related sorghum genome. The complete polyploid genome is now being sequenced and assembled. The emerging genome will allow comparison of related genomes and increase understanding of the functioning of this polyploidy system. Sugarcane breeding for traditional sugar and new energy and biomaterial uses will be enhanced by the availability of these genomic resources.


July 7, 2019  |  

Fast-SG: an alignment-free algorithm for hybrid assembly.

Long-read sequencing technologies are the ultimate solution for genome repeats, allowing near reference-level reconstructions of large genomes. However, long-read de novo assembly pipelines are computationally intense and require a considerable amount of coverage, thereby hindering their broad application to the assembly of large genomes. Alternatively, hybrid assembly methods that combine short- and long-read sequencing technologies can reduce the time and cost required to produce de novo assemblies of large genomes.Here, we propose a new method, called Fast-SG, that uses a new ultrafast alignment-free algorithm specifically designed for constructing a scaffolding graph using light-weight data structures. Fast-SG can construct the graph from either short or long reads. This allows the reuse of efficient algorithms designed for short-read data and permits the definition of novel modular hybrid assembly pipelines. Using comprehensive standard datasets and benchmarks, we show how Fast-SG outperforms the state-of-the-art short-read aligners when building the scaffoldinggraph and can be used to extract linking information from either raw or error-corrected long reads. We also show how a hybrid assembly approach using Fast-SG with shallow long-read coverage (5X) and moderate computational resources can produce long-range and accurate reconstructions of the genomes of Arabidopsis thaliana (Ler-0) and human (NA12878).Fast-SG opens a door to achieve accurate hybrid long-range reconstructions of large genomes with low effort, high portability, and low cost.


July 7, 2019  |  

Genome-wide characterization and phylogenetic analysis of GSK gene family in three species of cotton: evidence for a role of some GSKs in fiber development and responses to stress

Background: The glycogen synthase kinase 3/shaggy kinase (GSK3) is a serine/threonine kinase with important roles in animals. Although GSK3 genes have been studied for more than 30years, plant GSK genes have been studied only since the last decade. Previous research has confirmed that plant GSK genes are involved in diverse processes, including floral development, brassinosteroid signaling, and responses to abiotic stresses. Result: In this study, 20, 15 (including 5 different transcripts) and 10 GSK genes were identified in G. hirsutum, G. raimondii and G. arboreum, respectively. A total of 65 genes from Arabidopsis, rice, and cotton were classified into 4 clades. High similarities were found in GSK3 protein sequences, conserved motifs, and gene structures, as well as good concordance in gene pairwise comparisons (G. hirsutum vs. G. arboreum, G. hirsutum vs. G. raimondii, and G. arboreum vs. G. raimondii) were observed. Whole genome duplication (WGD) within At and Dt sub-genomes has been central to the expansion of the GSK gene family. Furthermore, GhSK genes showed diverse expression patterns in various tissues. Additionally, the expression profiles of GhSKs under different stress treatments demonstrated that many are stress-responsive genes. However, none were induced by brassinolide treatment. Finally, nine co-expression sub- networks were observed for GhSKs and the functional annotations of these genes suggested that some GhSKs might be involved in cotton fiber development. Conclusion: In this present work, we identified 45 GSK genes from three cotton species, which were divided into four clades. The gene features, muti-alignment, conversed motifs, and syntenic blocks indicate that they have been highly conserved during evolution. Whole genome duplication was determined to be the dominant factor for GSK gene family expansion. The analysis of co-expressed sub-networks and tissue-specific expression profiles suggested functions of GhSKs during fiber development. Moreover, their different responses to various abiotic stresses indicated great functional diversity amongst the GhSKs. Briefly, data presented herein may serve as the basis for future functional studies of GhSKs.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.