Menu
July 7, 2019

Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing

Mangroves invade some very marginal habitats for woody plants—at the interface between land and sea. Since mangroves anchor tropical coastal communities globally, their origin, diversification and adaptation are of scientific significance, particularly at a time of global climate change. In this study, a combination of single-molecule long reads and the more conventional short reads are generated from Rhizophora apiculata for the de novo assembly of its genome to a near chromosome level. The longest scaffold, N50 and N90 for the R. apiculata genome, are 13.3 Mb, 5.4 Mb and 1.0 Mb, respectively. Short reads for the genomes and transcriptomes of eight related species are also generated. We find that the ancestor of Rhizophoreae experienced a whole-genome duplication ~70 Myrs ago, which is followed rather quickly by colonization and species diversification. Mangroves exhibit pan-exome modifications of amino acid (AA) usage as well as unusual AA substitutions among closely related species. The usage and substitution of AAs, unique among plants surveyed, is correlated with the rapid evolution of proteins in mangroves. A small subset of these substitutions is associated with mangroves’ highly specialized traits (vivipary and red bark) thought to be adaptive in the intertidal habitats. Despite the many adaptive features, mangroves are among the least genetically diverse plants, likely the result of continual habitat turnovers caused by repeated rises and falls of sea level in the geologically recent past. Mangrove genomes thus inform about their past evolutionary success as well as portend a possibly difficult future.


July 7, 2019

Complete genome sequence of the drought resistance-promoting endophyte Klebsiella sp. LTGPAF-6F.

Bacterial endophytes with capacity to promote plant growth and improve plant tolerance against biotic and abiotic stresses have importance in agricultural practice and phytoremediation. A plant growth-promoting endophyte named Klebsiella sp. LTGPAF-6F, which was isolated from the roots of the desert plant Alhagi sparsifolia in north-west China, exhibits the ability to enhance the growth of wheat under drought stress. The complete genome sequence of this strain consists of one circular chromosome and two circular plasmids. From the genome, we identified genes related to the plant growth promotion and stress tolerance, such as nitrogen fixation, production of indole-3-acetic acid, acetoin, 2,3-butanediol, spermidine and trehalose. This genome sequence provides a basis for understanding the beneficial interactions between LTGPAF-6F and host plants, and will facilitate its applications as biotechnological agents in agriculture. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

A large gene family in fission yeast encodes spore killers that subvert Mendel’s law.

Spore killers in fungi are selfish genetic elements that distort Mendelian segregation in their favor. It remains unclear how many species harbor them and how diverse their mechanisms are. Here, we discover two spore killers from a natural isolate of the fission yeast Schizosaccharomyces pombe. Both killers belong to the previously uncharacterized wtf gene family with 25 members in the reference genome. These two killers act in strain-background-independent and genome-location-independent manners to perturb the maturation of spores not inheriting them. Spores carrying one killer are protected from its killing effect but not that of the other killer. The killing and protecting activities can be uncoupled by mutation. The numbers and sequences of wtf genes vary considerably between S. pombe isolates, indicating rapid divergence. We propose that wtf genes contribute to the extensive intraspecific reproductive isolation in S. pombe, and represent ideal models for understanding how segregation-distorting elements act and evolve.


July 7, 2019

The complete genome sequence of Bacillus velezensis 9912D reveals its biocontrol mechanism as a novel commercial biological fungicide agent.

A Bacillus sp. 9912 mutant, 9912D, was approved as a new biological fungicide agent by the Ministry of Agriculture of the People’s Republic of China in 2016 owing to its excellent inhibitory effect on various plant pathogens and being environment-friendly. Here, we present the genome of 9912D with a circular chromosome having 4436 coding DNA sequences (CDSs), and a circular plasmid encoding 59 CDSs. This strain was finally designated as Bacillus velezensis based on phylogenomic analyses. Genome analysis revealed a total of 19 candidate gene clusters involved in secondary metabolite biosynthesis, including potential new type II lantibiotics. The absence of fengycin biosynthetic gene cluster is noteworthy. Our data offer insights into the genetic, biological and physiological characteristics of this strain and aid in deeper understanding of its biocontrol mechanism. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Phylogenomic analysis supports multiple instances of polyphyly in the oomycete peronosporalean lineage.

The study of biological diversification of oomycetes has been a difficult task for more than a century. Pioneer researchers used morphological characters to describe this heterogeneous group, and physiological and genetic tools expanded knowledge of these microorganisms. However, research on oomycete diversification is limited by conflicting phylogenies. Using whole genomic data from 17 oomycete taxa, we obtained a dataset of 277 core orthologous genes shared among these genomes. Analyses of this dataset resulted in highly congruent and strongly supported estimates of oomycete phylogeny when we used concatenated maximum likelihood and coalescent-based methods; the one important exception was the position of Albugo. Our results supported the position of Phytopythium vexans (formerly in Pythium clade K) as a sister clade to the Phytophthora-Hyaloperonospora clade. The remaining clades comprising Pythium sensu lato formed two monophyletic groups. One group was composed of three taxa that correspond to Pythium clades A, B and C, and the other group contained taxa representing clades F, G and I, in agreement with previous Pythium phylogenies. However, the group containing Pythium clades F, G and I was placed as sister to the Phytophthora-Hyaloperonospora-Phytopythium clade, thus confirming the lack of monophyly of Pythium sensu lato. Multispecies coalescent methods revealed that the white blister rust, Albugo laibachii, could not be placed with a high degree of confidence. Our analyses show that genomic data can resolve the oomycete phylogeny and provide a phylogenetic framework to study the evolution of oomycete lifestyles. Copyright © 2017 Elsevier Inc. All rights reserved.


July 7, 2019

Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production.

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.


July 7, 2019

Evolutionary strata on young mating-type chromosomes despite the lack of sexual antagonism.

Sex chromosomes can display successive steps of recombination suppression known as “evolutionary strata,” which are thought to result from the successive linkage of sexually antagonistic genes to sex-determining genes. However, there is little evidence to support this explanation. Here we investigate whether evolutionary strata can evolve without sexual antagonism using fungi that display suppressed recombination extending beyond loci determining mating compatibility despite lack of male/female roles associated with their mating types. By comparing full-length chromosome assemblies from five anther-smut fungi with or without recombination suppression in their mating-type chromosomes, we inferred the ancestral gene order and derived chromosomal arrangements in this group. This approach shed light on the chromosomal fusion underlying the linkage of mating-type loci in fungi and provided evidence for multiple clearly resolved evolutionary strata over a range of ages (0.9-2.1 million years) in mating-type chromosomes. Several evolutionary strata did not include genes involved in mating-type determination. The existence of strata devoid of mating-type genes, despite the lack of sexual antagonism, calls for a unified theory of sex-related chromosome evolution, incorporating, for example, the influence of partially linked deleterious mutations and the maintenance of neutral rearrangement polymorphism due to balancing selection on sexes and mating types.


July 7, 2019

N-glycan maturation mutants in Lotus japonicus for basic and applied glycoprotein research.

Studies of protein N-glycosylation are important for answering fundamental questions on the diverse functions of glycoproteins in plant growth and development. Here we generated and characterised a comprehensive collection of Lotus japonicusLORE1 insertion mutants, each lacking the activity of one of the 12 enzymes required for normal N-glycan maturation in the glycosylation machinery. The inactivation of the individual genes resulted in altered N-glycan patterns as documented using mass spectrometry and glycan-recognising antibodies, indicating successful identification of null mutations in the target glyco-genes. For example, both mass spectrometry and immunoblotting experiments suggest that proteins derived from the a1,3-fucosyltransferase (Lj3fuct) mutant completely lacked a1,3-core fucosylation. Mass spectrometry also suggested that the Lotus japonicus convicilin 2 was one of the main glycoproteins undergoing differential expression/N-glycosylation in the mutants. Demonstrating the functional importance of glycosylation, reduced growth and seed production phenotypes were observed for the mutant plants lacking functional mannosidase I, N-acetylglucosaminyltransferase I, and a1,3-fucosyltransferase, even though the relative protein composition and abundance appeared unaffected. The strength of our N-glycosylation mutant platform is the broad spectrum of resulting glycoprotein profiles and altered physiological phenotypes that can be produced from single, double, triple and quadruple mutants. This platform will serve as a valuable tool for elucidating the functional role of protein N-glycosylation in plants. Furthermore, this technology can be used to generate stable plant mutant lines for biopharmaceutical production of glycoproteins displaying relative homogeneous and mammalian-like N-glycosylation features.© 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.


July 7, 2019

ALUMINUM RESISTANCE TRANSCRIPTION FACTOR 1 (ART1) contributes to natural variation in aluminum resistance in diverse genetic backgrounds of rice (O. sativa)

Abstract Transcription factors (TFs) regulate the expression of other genes to indirectly mediate stress resistance mechanisms. Therefore, when studying TF-mediated stress resistance, it is important to understand how TFs interact with genes in the genetic background. Here, we fine-mapped the aluminum (Al) resistance QTL Alt12.1 to a 44-kb region containing six genes. Among them is ART1, which encodes a C2H2-type zinc finger TF required for Al resistance in rice. The mapping parents, Al-resistant cv Azucena (tropical japonica) and Al-sensitive cv IR64 (indica), have extensive sequence polymorphism within the ART1 coding region, but similar ART1 expression levels. Using reciprocal near-isogenic lines (NILs) we examined how allele-swapping the Alt12.1 locus would affect plant responses to Al. Analysis of global transcriptional responses to Al stress in roots of the NILs alongside their recurrent parents demonstrated that the presence of the Alt12.1 from Al-resistant Azucena led to greater changes in gene expression in response to Al when compared to the Alt12.1 from IR64 in both genetic backgrounds. The presence of the ART1 allele from the opposite parent affected the expression of several genes not previously implicated in rice Al tolerance. We highlight examples where putatively functional variation in cis-regulatory regions of ART1-regulated genes interacts with ART1 to determine gene expression in response to Al. This ART1–promoter interaction may be associated with transgressive variation for Al resistance in the Azucena × IR64 population. These results illustrate how ART1 interacts with the genetic background to contribute to quantitative phenotypic variation in rice Al resistance.


July 7, 2019

Genetic control of plasticity of oil yield for combined abiotic stresses using a joint approach of crop modelling and genome-wide association.

Understanding the genetic basis of phenotypic plasticity is crucial for predicting and managing climate change effects on wild plants and crops. Here, we combined crop modelling and quantitative genetics to study the genetic control of oil yield plasticity for multiple abiotic stresses in sunflower. First, we developed stress indicators to characterize 14 environments for three abiotic stresses (cold, drought and nitrogen) using the SUNFLO crop model and phenotypic variations of three commercial varieties. The computed plant stress indicators better explain yield variation than descriptors at the climatic or crop levels. In those environments, we observed oil yield of 317 sunflower hybrids and regressed it with three selected stress indicators. The slopes of cold stress norm reaction were used as plasticity phenotypes in the following genome-wide association study. Among the 65 534 tested Single Nucleotide Polymorphisms (SNPs), we identified nine quantitative trait loci controlling oil yield plasticity to cold stress. Associated single nucleotide polymorphisms are localized in genes previously shown to be involved in cold stress responses: oligopeptide transporters, lipid transfer protein, cystatin, alternative oxidase or root development. This novel approach opens new perspectives to identify genomic regions involved in genotype-by-environment interaction of a complex traits to multiple stresses in realistic natural or agronomical conditions.© 2017 John Wiley & Sons Ltd.


July 7, 2019

Generation of a collection of mutant tomato lines using pooled CRISPR libraries.

The high efficiency of clustered regularly interspaced short palindromic repeats (CRISPR)-mediated mutagenesis in plants enables the development of high-throughput mutagenesis strategies. By transforming pooled CRISPR libraries into tomato (Solanum lycopersicum), collections of mutant lines were generated with minimal transformation attempts and in a relatively short period of time. Identification of the targeted gene(s) was easily determined by sequencing the incorporated guide RNA(s) in the primary transgenic events. From a single transformation with a CRISPR library targeting the immunity-associated leucine-rich repeat subfamily XII genes, heritable mutations were recovered in 15 of the 54 genes targeted. To increase throughput, a second CRISPR library was made containing three guide RNAs per construct to target 18 putative transporter genes. This resulted in stable mutations in 15 of the 18 targeted genes, with some primary transgenic plants having as many as five mutated genes. Furthermore, the redundancy in this collection of plants allowed for the association of aberrant T0 phenotypes with the underlying targeted genes. Plants with mutations in a homolog of an Arabidopsis (Arabidopsis thaliana) boron efflux transporter displayed boron deficiency phenotypes. The strategy described here provides a technically simple yet high-throughput approach for generating a collection of lines with targeted mutations and should be applicable to any plant transformation system.© 2017 American Society of Plant Biologists. All Rights Reserved.


July 7, 2019

Genomics-enabled analysis of the emergent disease cotton bacterial blight.

Cotton bacterial blight (CBB), an important disease of (Gossypium hirsutum) in the early 20th century, had been controlled by resistant germplasm for over half a century. Recently, CBB re-emerged as an agronomic problem in the United States. Here, we report analysis of cotton variety planting statistics that indicate a steady increase in the percentage of susceptible cotton varieties grown each year since 2009. Phylogenetic analysis revealed that strains from the current outbreak cluster with race 18 Xanthomonas citri pv. malvacearum (Xcm) strains. Illumina based draft genomes were generated for thirteen Xcm isolates and analyzed along with 4 previously published Xcm genomes. These genomes encode 24 conserved and nine variable type three effectors. Strains in the race 18 clade contain 3 to 5 more effectors than other Xcm strains. SMRT sequencing of two geographically and temporally diverse strains of Xcm yielded circular chromosomes and accompanying plasmids. These genomes encode eight and thirteen distinct transcription activator-like effector genes. RNA-sequencing revealed 52 genes induced within two cotton cultivars by both tested Xcm strains. This gene list includes a homeologous pair of genes, with homology to the known susceptibility gene, MLO. In contrast, the two strains of Xcm induce different clade III SWEET sugar transporters. Subsequent genome wide analysis revealed patterns in the overall expression of homeologous gene pairs in cotton after inoculation by Xcm. These data reveal important insights into the Xcm-G. hirsutum disease complex and strategies for future development of resistant cultivars.


July 7, 2019

Sequencing the genomic regions flanking S-linked PvGLO sequences confirms the presence of two GLO loci, one of which lies adjacent to the style-length determinant gene CYP734A50.

Primula vulgaris contains two GLOBOSA loci, one located adjacent to the style length determinant gene CYP734A50 which lies within the S -locus. Using a combination of BAC walking and PacBio sequencing, we have sequenced two substantial genomic contigs in and around the S-locus of Primula vulgaris. Using these data, we were able to demonstrate that two alleles of PvGlo (P) as well as PvGlo (T) can be present in the genome of a single plant, providing empirical evidence that these two forms of the MADS-box gene GLOBOSA are separate loci and not allelic as previously reported. We propose they should be renamed PvGLO1 and PvGLO2. BAC contigs extending from each GLOBOSA locus were identified and fully sequenced. No homologous genes were found between the contigs other than the GLOBOSA genes themselves, consistent with their identity as separate loci. Exons of the recently identified style-length determinant gene CYP734A50 were identified on one end of the contig containing PvGLO2 and these genes are adjacent in the genome, suggesting that PvGLO2 lies either within or at least very close to the S-locus. Current evidence suggests that both CYP734A50 and GLO2 are specific to the S-morph mating type and are hemizygous rather than heterozygous in the Primula genome. This finding contrasts classical models of the HSI locus, which propose that components of the S-locus are allelic, suggesting that these models may need to be reconsidered.


July 7, 2019

Euglena gracilis genome and transcriptome: organelles, nuclear genome assembly strategies and initial features.

Euglena gracilis is a major component of the aquatic ecosystem and together with closely related species, is ubiquitous worldwide. Euglenoids are an important group of protists, possessing a secondarily acquired plastid and are relatives to the Kinetoplastidae, which themselves have global impact as disease agents. To understand the biology of E. gracilis, as well as to provide further insight into the evolution and origins of the Kinetoplastidae, we embarked on sequencing the nuclear genome; the plastid and mitochondrial genomes are already in the public domain. Earlier studies suggested an extensive nuclear DNA content, with likely a high degree of repetitive sequence, together with significant extrachromosomal elements. To produce a list of coding sequences we have combined transcriptome data from both published and new sources, as well as embarked on de novo sequencing using a combination of 454, Illumina paired end libraries and long PacBio reads. Preliminary analysis suggests a surprisingly large genome approaching 2 Gbp, with a highly fragmented architecture and extensive repeat composition. Over 80% of the RNAseq reads from E. gracilis maps to the assembled genome sequence, which is comparable with the well assembled genomes of T. brucei and T. cruzi. In order to achieve this level of assembly we employed multiple informatics pipelines, which are discussed here. Finally, as a preliminary view of the genome architecture, we discuss the tubulin and calmodulin genes, which highlight potential novel splicing mechanisms.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.