Menu
July 7, 2019

A precise chloroplast genome of Nelumbo nucifera (Nelumbonaceae) evaluated with Sanger, Illumina MiSeq, and PacBio RS II sequencing platforms: insight into the plastid evolution of basal eudicots.

BackgroundThe chloroplast genome is important for plant development and plant evolution. Nelumbo nucifera is one member of relict plants surviving from the late Cretaceous. Recently, a new sequencing platform PacBio RS II, known as `SMRT (Single Molecule, Real-Time) sequencing¿, has been developed. Using the SMRT sequencing to investigate the chloroplast genome of N. nucifera will help to elucidate the plastid evolution of basal eudicots.ResultsThe sizes of the de novo assembled complete chloroplast genome of N. nucifera were 163,307 bp, 163,747 bp and 163,600 bp with average depths of coverage of 7×, 712× and 105× sequenced by Sanger, Illumina MiSeq and PacBio RS II, respectively. The precise chloroplast genome of N. nucifera was obtained from PacBio RS II data proofread by Illumina MiSeq reads, with a quadripartite structure containing a large single copy region (91,846 bp) and a small single copy region (19,626 bp) separated by two inverted repeat regions (26,064 bp). The genome contains 113 different genes, including four distinct rRNAs, 30 distinct tRNAs and 79 distinct peptide-coding genes. A phylogenetic analysis of 133 taxa from 56 orders indicated that Nelumbo with an age of 177 million years is a sister clade to Platanus, which belongs to the basal eudicots. Basal eudicots began to emerge during the early Jurassic with estimated divergence times at 197 million years using MCMCTree. IR expansions/contractions within the basal eudicots seem to have occurred independently.ConclusionsBecause of long reads and lack of bias in coverage of AT-rich regions, PacBio RS II showed a great promise for highly accurate `finished¿ genomes, especially for a de novo assembly of genomes. N. nucifera is one member of basal eudicots, however, evolutionary analyses of IR structural variations of N. nucifera and other basal eudicots suggested that IR expansions/contractions occurred independently in these basal eudicots or were caused by independent insertions and deletions. The precise chloroplast genome of N. nucifera will present new information for structural variation of chloroplast genomes and provide new insight into the evolution of basal eudicots at the primary sequence and structural level.


July 7, 2019

A Y-chromosome-encoded small RNA acts as a sex determinant in persimmons.

In plants, multiple lineages have evolved sex chromosomes independently, providing a powerful comparative framework, but few specific determinants controlling the expression of a specific sex have been identified. We investigated sex determinants in the Caucasian persimmon, Diospyros lotus, a dioecious plant with heterogametic males (XY). Male-specific short nucleotide sequences were used to define a male-determining region. A combination of transcriptomics and evolutionary approaches detected a Y-specific sex-determinant candidate, OGI, that displays male-specific conservation among Diospyros species. OGI encodes a small RNA targeting the autosomal MeGI gene, a homeodomain transcription factor regulating anther fertility in a dosage-dependent fashion. This identification of a feminizing gene suppressed by a Y-chromosome-encoded small RNA contributes to our understanding of the evolution of sex chromosome systems in higher plants. Copyright © 2014, American Association for the Advancement of Science.


July 7, 2019

Pseudomonas syringae CC1557: a highly virulent strain with an unusually small type III effector repertoire that includes a novel effector.

Both type III effector proteins and nonribosomal peptide toxins play important roles for Pseudomonas syringae pathogenicity in host plants, but whether and how these pathways interact to promote infection remains unclear. Genomic evidence from one clade of P. syringae suggests a tradeoff between the total number of type III effector proteins and presence of syringomycin, syringopeptin, and syringolin A toxins. Here, we report the complete genome sequence from P. syringae CC1557, which contains the lowest number of known type III effectors to date and has also acquired genes similar to sequences encoding syringomycin pathways from other strains. We demonstrate that this strain is pathogenic on Nicotiana benthamiana and that both the type III secretion system and a new type III effector, hopBJ1, contribute to pathogenicity. We further demonstrate that activity of HopBJ1 is dependent on residues structurally similar to the catalytic site of Escherichia coli CNF1 toxin. Taken together, our results provide additional support for a negative correlation between type III effector repertoires and the potential to produce syringomycin-like toxins while also highlighting how genomic synteny and bioinformatics can be used to identify and characterize novel virulence proteins.


July 7, 2019

Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge.

Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki.


July 7, 2019

Transcriptome analysis of the entomopathogenic oomycete Lagenidium giganteum reveals putative virulence factors.

A combination of 454 pyrosequencing and Sanger sequencing was used to sample and characterize the transcriptome of the entomopathogenic oomycete Lagenidium giganteum. More than 50,000 high-throughput reads were annotated through homology searches. Several selected reads served as seeds for the amplification and sequencing of full-length transcripts. Phylogenetic analyses inferred from full-length cellulose synthase alignments revealed that L giganteum is nested within the peronosporalean galaxy and as such appears to have evolved from a phytopathogenic ancestor. In agreement with the phylogeny reconstructions, full-length L. giganteum oomycete effector orthologs, corresponding to the cellulose-binding elicitor lectin (CBEL), crinkler (CRN), and elicitin proteins, were characterized by domain organizations similar to those of pathogenicity factors of plant-pathogenic oomycetes. Importantly, the L. giganteum effectors provide a basis for detailing the roles of canonical CRN, CBEL, and elicitin proteins in the infectious process of an oomycete known principally as an animal pathogen. Finally, phylogenetic analyses and genome mining identified members of glycoside hydrolase family 5 subfamily 27 (GH5_27) as putative virulence factors active on the host insect cuticle, based in part on the fact that GH5_27 genes are shared by entomopathogenic oomycetes and fungi but are underrepresented in nonentomopathogenic genomes. The genomic resources gathered from the L. giganteum transcriptome analysis strongly suggest that filamentous entomopathogens (oomycetes and fungi) exhibit convergent evolution: they have evolved independently from plant-associated microbes, have retained genes indicative of plant associations, and may share similar cores of virulence factors, such as GH5_27 enzymes, that are absent from the genomes of their plant-pathogenic relatives. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


July 7, 2019

The first 50 plant genomes

Fifty-five plant genomes have been published to date representing 49 different species (Table 1 includes PubMed IDs for complete reference). What have we learned from the first wave of plant genomes? It has been said that plant genome papers (and genome papers in general) are dry and lack “biology” and that the days of high impact plant genome papers are drawing to a close unless they explore significant biology. However, with each new genome, earlier observations are refined and plant genome papers continue to reveal novel aspects of genome biology. For example, the tomato and banana genome papers refined current thinking on the whole genome duplications (WGD) that shaped dicot and monocot genome evolution (D’Hont et al., 2012; Tomato Genome Consortium, 2012). These observations were enabled not only by high quality genome assemblies but also by a greater number of genomes available for com- parisons. In addition, the initial round of plant genomes enabled the first generation of functional genomics that helped to define the roles of hundreds of genes, provided unprecedented access to sequence-based markers for breeding, and provided glimpses into plant evolutionary history. More genomes, representing the diverse array of species in Viridiplantae are still required to gain a full understanding of plant genome structure, evolution, and complexity.


July 7, 2019

Whole-exome targeted sequencing of the uncharacterized pine genome.

The large genome size of many species hinders the development and application of genomic tools to study them. For instance, loblolly pine (Pinus taeda L.), an ecologically and economically important conifer, has a large and yet uncharacterized genome of 21.7 Gbp. To characterize the pine genome, we performed exome capture and sequencing of 14 729 genes derived from an assembly of expressed sequence tags. Efficiency of sequence capture was evaluated and shown to be similar across samples with increasing levels of complexity, including haploid cDNA, haploid genomic DNA and diploid genomic DNA. However, this efficiency was severely reduced for probes that overlapped multiple exons, presumably because intron sequences hindered probe:exon hybridizations. Such regions could not be entirely avoided during probe design, because of the lack of a reference sequence. To improve the throughput and reduce the cost of sequence capture, a method to multiplex the analysis of up to eight samples was developed. Sequence data showed that multiplexed capture was reproducible among 24 haploid samples, and can be applied for high-throughput analysis of targeted genes in large populations. Captured sequences were de novo assembled, resulting in 11 396 expanded and annotated gene models, significantly improving the knowledge about the pine gene space. Interspecific capture was also evaluated with over 98% of all probes designed from P. taeda that were efficient in sequence capture, were also suitable for analysis of the related species Pinus elliottii Engelm.© 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.


July 7, 2019

Genome of an arbuscular mycorrhizal fungus provides insight into the oldest plant symbiosis.

The mutualistic symbiosis involving Glomeromycota, a distinctive phylum of early diverging Fungi, is widely hypothesized to have promoted the evolution of land plants during the middle Paleozoic. These arbuscular mycorrhizal fungi (AMF) perform vital functions in the phosphorus cycle that are fundamental to sustainable crop plant productivity. The unusual biological features of AMF have long fascinated evolutionary biologists. The coenocytic hyphae host a community of hundreds of nuclei and reproduce clonally through large multinucleated spores. It has been suggested that the AMF maintain a stable assemblage of several different genomes during the life cycle, but this genomic organization has been questioned. Here we introduce the 153-Mb haploid genome of Rhizophagus irregularis and its repertoire of 28,232 genes. The observed low level of genome polymorphism (0.43 SNP per kb) is not consistent with the occurrence of multiple, highly diverged genomes. The expansion of mating-related genes suggests the existence of cryptic sex-related processes. A comparison of gene categories confirms that R. irregularis is close to the Mucoromycotina. The AMF obligate biotrophy is not explained by genome erosion or any related loss of metabolic complexity in central metabolism, but is marked by a lack of genes encoding plant cell wall-degrading enzymes and of genes involved in toxin and thiamine synthesis. A battery of mycorrhiza-induced secreted proteins is expressed in symbiotic tissues. The present comprehensive repertoire of R. irregularis genes provides a basis for future research on symbiosis-related mechanisms in Glomeromycota.


July 7, 2019

The genome of the anaerobic fungus Orpinomyces sp. strain C1A reveals the unique evolutionary history of a remarkable plant biomass degrader.

Anaerobic gut fungi represent a distinct early-branching fungal phylum (Neocallimastigomycota) and reside in the rumen, hindgut, and feces of ruminant and nonruminant herbivores. The genome of an anaerobic fungal isolate, Orpinomyces sp. strain C1A, was sequenced using a combination of Illumina and PacBio single-molecule real-time (SMRT) technologies. The large genome (100.95 Mb, 16,347 genes) displayed extremely low G+C content (17.0%), large noncoding intergenic regions (73.1%), proliferation of microsatellite repeats (4.9%), and multiple gene duplications. Comparative genomic analysis identified multiple genes and pathways that are absent in Dikarya genomes but present in early-branching fungal lineages and/or nonfungal Opisthokonta. These included genes for posttranslational fucosylation, the production of specific intramembrane proteases and extracellular protease inhibitors, the formation of a complete axoneme and intraflagellar trafficking machinery, and a near-complete focal adhesion machinery. Analysis of the lignocellulolytic machinery in the C1A genome revealed an extremely rich repertoire, with evidence of horizontal gene acquisition from multiple bacterial lineages. Experimental analysis indicated that strain C1A is a remarkable biomass degrader, capable of simultaneous saccharification and fermentation of the cellulosic and hemicellulosic fractions in multiple untreated grasses and crop residues examined, with the process significantly enhanced by mild pretreatments. This capability, acquired during its separate evolutionary trajectory in the rumen, along with its resilience and invasiveness compared to prokaryotic anaerobes, renders anaerobic fungi promising agents for consolidated bioprocessing schemes in biofuels production.


July 7, 2019

Next-generation sequencing and large genome assemblies.

The next-generation sequencing (NGS) revolution has drastically reduced time and cost requirements for sequencing of large genomes, and also qualitatively changed the problem of assembly. This article reviews the state of the art in de novo genome assembly, paying particular attention to mammalian-sized genomes. The strengths and weaknesses of the main sequencing platforms are highlighted, leading to a discussion of assembly and the new challenges associated with NGS data. Current approaches to assembly are outlined and the various software packages available are introduced and compared. The question of whether quality assemblies can be produced using short-read NGS data alone, or whether it must be combined with more expensive sequencing techniques, is considered. Prospects for future assemblers and tests of assembly performance are also discussed.


July 7, 2019

Next-generation polyploid phylogenetics: rapid resolution of hybrid polyploid complexes using PacBio single-molecule sequencing.

Difficulties in generating nuclear data for polyploids have impeded phylogenetic study of these groups. We describe a high-throughput protocol and an associated bioinformatics pipeline (Pipeline for Untangling Reticulate Complexes (Purc)) that is able to generate these data quickly and conveniently, and demonstrate its efficacy on accessions from the fern family Cystopteridaceae. We conclude with a demonstration of the downstream utility of these data by inferring a multi-labeled species tree for a subset of our accessions. We amplified four c. 1-kb-long nuclear loci and sequenced them in a parallel-tagged amplicon sequencing approach using the PacBio platform. Purc infers the final sequences from the raw reads via an iterative approach that corrects PCR and sequencing errors and removes PCR-mediated recombinant sequences (chimeras). We generated data for all gene copies (homeologs, paralogs, and segregating alleles) present in each of three sets of 50 mostly polyploid accessions, for four loci, in three PacBio runs (one run per set). From the raw sequencing reads, Purc was able to accurately infer the underlying sequences. This approach makes it easy and economical to study the phylogenetics of polyploids, and, in conjunction with recent analytical advances, facilitates investigation of broad patterns of polyploid evolution.© 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.


July 7, 2019

Current status of genome sequencing and its applications in aquaculture

Aquaculture is the fastest-growing food production sector in agriculture, with great potential to meet projected protein needs of human beings. Aquaculture is facing several challenges, including lack of a sufficient number of genetically improved species, lack of species-specific feeds, high mortality due to diseases and pollution of ecosystems. The rapid development of sequencing technologies has revolutionized biological sciences, and supplied necessary tools to tackle these challenges in aquaculture and thus ensure its sustainability and profitability. So far, draft genomes have been published in over 24 aquaculture species, and used to address important issues related to aquaculture. We briefly review the advances of next generation sequencing technologies, and summarize the status of whole genome sequencing and its general applications (i.e. establishing reference genomes and discovering DNA markers) and specific applications in tackling some important issues (e.g. breeding, diseases, sex determination & maturation) related to aquaculture. For sequencing a new genome, we recommend the use of 100–200 × short reads using Illumina and 50–60 × long reads with PacBio sequencing technologies. For identification of a large number of SNPs, resequencing pooled DNA samples from different populations is the most cost-effective way. We also discuss the challenges and future directions of whole genome sequencing in aquaculture.


July 7, 2019

A gapless genome sequence of the fungus Botrytis cinerea.

Following earlier incomplete and fragmented versions of a genome sequence for the grey mould Botrytis cinerea, we here report a gapless, near-finished genome sequence for B. cinerea strain B05.10. The assembly comprises 18 chromosomes and was confirmed by an optical map and a genetic map based on ~75 000 SNP markers. All chromosomes contain fully assembled centromeric regions, and 10 chromosomes have telomeres on both ends. The genetic map consisted of 4153 cM and comparison of genetic distances with the physical distances identified 40 recombination hotspots. The linkage map also identified two mutations, located in the previously described genes Bos1 and BcsdhB, that confer resistance to the fungicides boscalid and iprodione. The genome was predicted to encode 11 701 proteins. RNAseq data from >20 different samples were used to validate and improve gene models. Manual curation of chromosome 1 revealed interesting features, such as the occurrence of a dicistronic transcript and fully overlapping genes in opposite orientations, as well as many spliced antisense transcripts. Manual curation also revealed that UTRs of genes can be complex and long, with many UTRs exceeding lengths of 1 kb and possessing multiple introns. Community annotation is in progress. This article is protected by copyright. All rights reserved. © 2016 BSPP AND JOHN WILEY & SONS LTD.


July 7, 2019

Divergent evolution of multiple virus-resistance genes from a progenitor in Capsicum spp.

Plants have evolved hundreds of nucleotide-binding and leucine-rich domain proteins (NLRs) as potential intracellular immune receptors, but the evolutionary mechanism leading to the ability to recognize specific pathogen effectors is elusive. Here, we cloned Pvr4 (a Potyvirus resistance gene in Capsicum annuum) and Tsw (a Tomato spotted wilt virus resistance gene in Capsicum chinense) via a genome-based approach using independent segregating populations. The genes both encode typical NLRs and are located at the same locus on pepper chromosome 10. Despite the fact that these two genes recognize completely different viral effectors, the genomic structures and coding sequences of the two genes are strikingly similar. Phylogenetic studies revealed that these two immune receptors diverged from a progenitor gene of a common ancestor. Our results suggest that sequence variations caused by gene duplication and neofunctionalization may underlie the evolution of the ability to specifically recognize different effectors. These findings thereby provide insight into the divergent evolution of plant immune receptors.© 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.


July 7, 2019

Strategies for complete plastid genome sequencing.

Plastid sequencing is an essential tool in the study of plant evolution. This high-copy organelle is one of the most technically accessible regions of the genome, and its sequence conservation makes it a valuable region for comparative genome evolution, phylogenetic analysis and population studies. Here, we discuss recent innovations and approaches for de novo plastid assembly that harness genomic tools. We focus on technical developments including low-cost sequence library preparation approaches for genome skimming, enrichment via hybrid baits and methylation-sensitive capture, sequence platforms with higher read outputs and longer read lengths, and automated tools for assembly. These developments allow for a much more streamlined assembly than via conventional short-range PCR. Although newer methods make complete plastid sequencing possible for any land plant or green alga, there are still challenges for producing finished plastomes particularly from herbarium material or from structurally divergent plastids such as those of parasitic plants.© 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.