The long read lengths of PacBio’s SMRT Sequencing enable detection of linked mutations across multiple kilobases of sequence. This feature is particularly useful in the context of protein engineering, where large numbers of similar constructs are generated routinely to explore the effects of mutations on function and stability. We have developed a PCR-based barcoded sequencing method to generate high quality, full-length sequence data for batches of constructs generated in a common backbone. Individual barcodes are coupled to primers targeting a common region of the vector of interest. The amplified products are pooled into a single DNA library, and sequencing data are clustered by barcode to generate multi-molecule consensus sequences for each construct present in the pool. As a proof-of-concept dataset, we have generated a library of 384 randomly mutated variants of the Phi29 DNA polymerase, a 575 amino acid protein encoded by a 1.7 kb gene. These variants were amplified with a set of barcoded primers, and the resulting library was sequenced on a single SMRT Cell. The data produced sequences that were completely concordant with independent Sanger sequencing, for a 100% accurate reconstruction of the set of clones.
This documentary film features the wave of cutting-edge technologies that now provide the opportunity to create predictive models of living systems, and gain wisdom about the fundamental nature of life…
Part I of The New Biology documentary. This documentary film features the wave of cutting-edge technologies that now provide the opportunity to create predictive models of living systems, and gain…
To make improvements to crops like corn, soybeans, and canola, scientists at Corteva are building a compendium of crop genomics resources to provide actionable sequence info for genetic discovery, gene-editing,…
Webinar: Small Bodies, Big Genomes: Overcoming Large DNA Input Requirements for Long-Read Genome Assembly
In this webinar we present the low DNA input workflow, from DNA requirements through library preparation and sequencing, for generation of high-quality genome assemblies of small-bodied organisms. In addition, we…
Chlorella vulgaris genome assembly and annotation reveals the molecular basis for metabolic acclimation to high light conditions.
Chlorella vulgaris is a fast-growing fresh-water microalga cultivated at the industrial scale for applications ranging from food to biofuel production. To advance our understanding of its biology and to establish genetics tools for biotechnological manipulation, we sequenced the nuclear and organelle genomes of Chlorella vulgaris 211/11P by combining next generation sequencing and optical mapping of isolated DNA molecules. This hybrid approach allowed to assemble the nuclear genome in 14 pseudo-molecules with an N50 of 2.8 Mb and 98.9% of scaffolded genome. The integration of RNA-seq data obtained at two different irradiances of growth (high light-HL versus low light -LL) enabled to identify 10,724 nuclear genes, coding for 11,082 transcripts. Moreover 121 and 48 genes were respectively found in the chloroplast and mitochondrial genome. Functional annotation and expression analysis of nuclear, chloroplast and mitochondrial genome sequences revealed peculiar features of Chlorella vulgaris. Evidence of horizontal gene transfers from chloroplast to mitochondrial genome was observed. Furthermore, comparative transcriptomic analyses of LL vs HL provide insights into the molecular basis for metabolic rearrangement in HL vs. LL conditions leading to enhanced de novo fatty acid biosynthesis and triacylglycerol accumulation. The occurrence of a cytosolic fatty acid biosynthetic pathway can be predicted and its upregulation upon HL exposure is observed, consistent with increased lipid amount under HL. These data provide a rich genetic resource for future genome editing studies, and potential targets for biotechnological manipulation of Chlorella vulgaris or other microalgae species to improve biomass and lipid productivity.This article is protected by copyright. All rights reserved.
Rational development of transformation in Clostridium thermocellum ATCC 27405 via complete methylome analysis and evasion of native restriction-modification systems.
A major barrier to both metabolic engineering and fundamental biological studies is the lack of genetic tools in most microorganisms. One example is Clostridium thermocellum ATCC 27405T, where genetic tools are not available to help validate decades of hypotheses. A significant barrier to DNA transformation is restriction-modification systems, which defend against foreign DNA methylated differently than the host. To determine the active restriction-modification systems in this strain, we performed complete methylome analysis via single-molecule, real-time sequencing to detect 6-methyladenine and 4-methylcytosine and the rarely used whole-genome bisulfite sequencing to detect 5-methylcytosine. Multiple active systems were identified, and corresponding DNA methyltransferases were expressed from the Escherichia coli chromosome to mimic the C. thermocellum methylome. Plasmid methylation was experimentally validated and successfully electroporated into C. thermocellum ATCC 27405. This combined approach enabled genetic modification of the C. thermocellum-type strain and acts as a blueprint for transformation of other non-model microorganisms.
Transcriptional initiation of a small RNA, not R-loop stability, dictates the frequency of pilin antigenic variation in Neisseria gonorrhoeae.
Neisseria gonorrhoeae, the sole causative agent of gonorrhea, constitutively undergoes diversification of the Type IV pilus. Gene conversion occurs between one of the several donor silent copies located in distinct loci and the recipient pilE gene, encoding the major pilin subunit of the pilus. A guanine quadruplex (G4) DNA structure and a cis-acting sRNA (G4-sRNA) are located upstream of the pilE gene and both are required for pilin antigenic variation (Av). We show that the reduced sRNA transcription lowers pilin Av frequencies. Extended transcriptional elongation is not required for Av, since limiting the transcript to 32 nt allows for normal Av frequencies. Using chromatin immunoprecipitation (ChIP) assays, we show that cellular G4s are less abundant when sRNA transcription is lower. In addition, using ChIP, we demonstrate that the G4-sRNA forms a stable RNA:DNA hybrid (R-loop) with its template strand. However, modulating R-loop levels by controlling RNase HI expression does not alter G4 abundance quantified through ChIP. Since pilin Av frequencies were not altered when modulating R-loop levels by controlling RNase HI expression, we conclude that transcription of the sRNA is necessary, but stable R-loops are not required to promote pilin Av. © 2019 John Wiley & Sons Ltd.
Over the past decade, RNA sequencing (RNA-seq) has become an indispensable tool for transcriptome-wide analysis of differential gene expression and differential splicing of mRNAs. However, as next-generation sequencing technologies have developed, so too has RNA-seq. Now, RNA-seq methods are available for studying many different aspects of RNA biology, including single-cell gene expression, translation (the translatome) and RNA structure (the structurome). Exciting new applications are being explored, such as spatial transcriptomics (spatialomics). Together with new long-read and direct RNA-seq technologies and better computational tools for data analysis, innovations in RNA-seq are contributing to a fuller understanding of RNA biology, from questions such as when and where transcription occurs to the folding and intermolecular interactions that govern RNA function.
Forest tree species are increasingly subject to severe mortalities from exotic pests, diseases, and invasive organisms, accelerated by climate change. Forest health issues are threatening multiple species and ecosystem sustainability globally. While sources of resistance may be available in related species, or among surviving trees, introgression of resistance genes into threatened tree species in reasonable time frames requires genome-wide breeding tools. Asian species of chestnut (Castanea spp.) are being employed as donors of disease resistance genes to restore native chestnut species in North America and Europe. To aid in the restoration of threatened chestnut species, we present the assembly of a reference genome with chromosome-scale sequences for Chinese chestnut (C. mollissima), the disease-resistance donor for American chestnut restoration. We also demonstrate the value of the genome as a platform for research and species restoration, including new insights into the evolution of blight resistance in Asian chestnut species, the locations in the genome of ecologically important signatures of selection differentiating American chestnut from Chinese chestnut, the identification of candidate genes for disease resistance, and preliminary comparisons of genome organization with related species.
Whole-genome analysis of New Delhi Metallo-Beta-Lactamase-1-producing Acinetobacter haemolyticus from China.
Infections caused by multi-drug resistant Acinetobacter spp. has aroused worldwide attention. With the increasing isolation of non-baumannii Acinetobacter, the nature of infection and resistance associated with them needs to be elaborated. This study aimed to analyze the characteristics of New Delhi Metallo-Beta-Lactamase-1 (NDM-1)-producing Acinetobacter haemolyticus (named sz1652) isolated from Shenzhen city, China.Antibiotic spectrum was analyzed after antimicrobial susceptibility test. Combined disk test (CDT) was used to detecting the metallo-beta-lactamases (MBLs). Transferability of carbapenem resistance was tested by filter mating experiments and plasmid transformation assays. Whole-genome sequencing (WGS) was performed using HiSeq 2000 and PacBio RS system.The A. haemolyticus strain sz1652 was resistant to carbapenems and other tested agents except for amikacin, tigecycline and colistin. The production of MBLs was confirmed by CDT. Transfer of carbapenem resistance was not successful. WGS analysis showed the genome of sz1652 was comprised of chromosome and two plasmids, and sixteen genomic islands (GIs) were predicted. Genes associated with resistance were found in this strain including the beta-lactamase genes blaNDM-1, blaOXA-214 and blaLRA-18, the ?uoroquinolone resistant-related mutations [GyrA subunits (Ser81Ile) and ParC subunits (Ser84Tyr)], and efflux pump genes related to tetracycline and macrolide resistance. Analysis of the genetic environment showed that blaNDM-1was embedded in Tn125 transposon. The Tn125 structure was chromosomally located and shared more than 99% sequence identity with previously reported blaNDM-1 carrying region.The NDM-1-producing A.haemolyticus coexisted multiple durg-resistant determinants. The acquisition of the blaNDM-1 gene was probably facilitated by Tn125 in this strain. Non-A.baumannii species also contain GIs.Copyright © 2019. Published by Elsevier Ltd.
Defining transgene insertion sites and off-target effects of homology-based gene silencing informs the use of functional genomics tools in Phytophthora infestans.
DNA transformation and homology-based transcriptional silencing are frequently used to assess gene function in Phytophthora. Since unplanned side-effects of these tools are not well-characterized, we used P. infestans to study plasmid integration sites and whether knockdowns caused by homology-dependent silencing spreads to other genes. Insertions occurred both in gene-dense and gene-sparse regions but disproportionately near the 5′ ends of genes, which disrupted native coding sequences. Microhomology at the recombination site between plasmid and chromosome was common. Studies of transformants silenced for twelve different gene targets indicated that neighbors within 500-nt were often co-silenced, regardless of whether hairpin or sense constructs were employed and the direction of transcription of the target. However, cis-spreading of silencing did not occur in all transformants obtained with the same plasmid. Genome-wide studies indicated that unlinked genes with partial complementarity with the silencing-inducing transgene were not usually down-regulated. We learned that hairpin or sense transgenes were not co-silenced with the target in all transformants, which informs how screens for silencing should be performed. We conclude that transformation and gene silencing can be reliable tools for functional genomics in Phytophthora but must be used carefully, especially by testing for the spread of silencing to genes flanking the target.
BjuWRR1, a CC-NB-LRR gene identified in Brassica juncea, confers resistance to white rust caused by Albugo candida.
BjuWRR1, a CNL-type R gene, was identified from an east European gene pool line of Brassica juncea and validated for conferring resistance to white rust by genetic transformation. White rust caused by the oomycete pathogen Albugo candida is a significant disease of crucifer crops including Brassica juncea (mustard), a major oilseed crop of the Indian subcontinent. Earlier, a resistance-conferring locus named AcB1-A5.1 was mapped in an east European gene pool line of B. juncea-Donskaja-IV. This line was tested along with some other lines of B. juncea (AABB), B. rapa (AA) and B. nigra (BB) for resistance to six isolates of A. candida collected from different mustard growing regions of India. Donskaja-IV was found to be completely resistant to all the tested isolates. Sequencing of a BAC spanning the locus AcB1-A5.1 showed the presence of a single CC-NB-LRR protein encoding R gene. The genomic sequence of the putative R gene with its native promoter and terminator was used for the genetic transformation of a susceptible Indian gene pool line Varuna and was found to confer complete resistance to all the isolates. This is the first white rust resistance-conferring gene described from Brassica species and has been named BjuWRR1. Allelic variants of the gene in B. juncea germplasm and orthologues in the Brassicaceae genomes were studied to understand the evolutionary dynamics of the BjuWRR1 gene.
Brassica napus (AACC, 2n = 38) is an important oilseed crop grown worldwide. However, little is known about the population evolution of this species, the genomic difference between its major genetic groups, such as European and Asian rapeseed, and the impacts of historical large-scale introgression events on this young tetraploid. In this study, we reported the de novo assembly of the genome sequences of an Asian rapeseed (B. napus), Ningyou 7, and its four progenitors and compared these genomes with other available genomic data from diverse European and Asian cultivars. Our results showed that Asian rapeseed originally derived from European rapeseed but subsequently significantly diverged, with rapid genome differentiation after hybridization and intensive local selective breeding. The first historical introgression of B. rapa dramatically broadened the allelic pool but decreased the deleterious variations of Asian rapeseed. The second historical introgression of the double-low traits of European rapeseed (canola) has reshaped Asian rapeseed into two groups (double-low and double-high), accompanied by an increase in genetic load in the double-low group. This study demonstrates distinctive genomic footprints and deleterious SNP (single nucleotide polymorphism) variants for local adaptation by recent intra- and interspecies introgression events and provides novel insights for understanding the rapid genome evolution of a young allopolyploid crop. © 2019 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Optimized Cas9 expression systems for highly efficient Arabidopsis genome editing facilitate isolation of complex alleles in a single generation.
Genetic resources for the model plant Arabidopsis comprise mutant lines defective in almost any single gene in reference accession Columbia. However, gene redundancy and/or close linkage often render it extremely laborious or even impossible to isolate a desired line lacking a specific function or set of genes from segregating populations. Therefore, we here evaluated strategies and efficiencies for the inactivation of multiple genes by Cas9-based nucleases and multiplexing. In first attempts, we succeeded in isolating a mutant line carrying a 70 kb deletion, which occurred at a frequency of ~?1.6% in the T2 generation, through PCR-based screening of numerous individuals. However, we failed to isolate a line lacking Lhcb1 genes, which are present in five copies organized at two loci in the Arabidopsis genome. To improve efficiency of our Cas9-based nuclease system, regulatory sequences controlling Cas9 expression levels and timing were systematically compared. Indeed, use of DD45 and RPS5a promoters improved efficiency of our genome editing system by approximately 25-30-fold in comparison to the previous ubiquitin promoter. Using an optimized genome editing system with RPS5a promoter-driven Cas9, putatively quintuple mutant lines lacking detectable amounts of Lhcb1 protein represented approximately 30% of T1 transformants. These results show how improved genome editing systems facilitate the isolation of complex mutant alleles, previously considered impossible to generate, at high frequency even in a single (T1) generation.