Menu
July 7, 2019

Vibrio anguillarum is genetically and phenotypically unaffected by long-term continuous exposure to the antibacterial compound tropodithietic acid.

Minimizing the use of antibiotics in the food production chain is essential for limiting the development and spread of antibiotic-resistant bacteria. One alternative intervention strategy is the use of probiotic bacteria, and bacteria of the marine Roseobacter clade are capable of antagonizing fish-pathogenic vibrios in fish larvae and live feed cultures for fish larvae. The antibacterial compound tropodithietic acid (TDA), an antiporter that disrupts the proton motive force, is key in the antibacterial activity of several roseobacters. Introducing probiotics on a larger scale requires understanding of any potential side effects of long-term exposure of the pathogen to the probionts or any compounds they produce. Here we exposed the fish pathogen Vibrio anguillarum to TDA for several hundred generations in an adaptive evolution experiment. No tolerance or resistance arose during the 90 days of exposure, and whole-genome sequencing of TDA-exposed lineages and clones revealed few mutational changes, compared to lineages grown without TDA. Amino acid-changing mutations were found in two to six different genes per clone; however, no mutations appeared unique to the TDA-exposed lineages or clones. None of the virulence genes of V. anguillarum was affected, and infectivity assays using fish cell lines indicated that the TDA-exposed lineages and clones were less invasive than the wild-type strain. Thus, long-term TDA exposure does not appear to result in TDA resistance and the physiology of V. anguillarum appears unaffected, supporting the application of TDA-producing roseobacters as probiotics in aquaculture.It is important to limit the use of antibiotics in our food production, to reduce the risk of bacteria developing antibiotic resistance. We showed previously that marine bacteria of the Roseobacter clade can prevent or reduce bacterial diseases in fish larvae, acting as probiotics. Roseobacters produce the antimicrobial compound tropodithietic acid (TDA), and we were concerned regarding whether long-term exposure to this compound could induce resistance or affect the disease-causing ability of the fish pathogen. Therefore, we exposed the fish pathogen Vibrio anguillarum to increasing TDA concentrations over 3 months. We did not see the development of any resistance to TDA, and subsequent infection assays revealed that none of the TDA-exposed clones had increased virulence toward fish cells. Hence, this study supports the use of roseobacters as a non-risk-based disease control measure in aquaculture. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Evidence for an opportunistic and endophytic lifestyle of the Bursaphelenchus xylophilus-associated bacteria Serratia marcescens PWN146 isolated from wilting Pinus pinaster.

Pine wilt disease (PWD) results from the interaction of three elements: the pathogenic nematode, Bursaphelenchus xylophilus; the insect-vector, Monochamus sp.; and the host tree, mostly Pinus species. Bacteria isolated from B. xylophilus may be a fourth element in this complex disease. However, the precise role of bacteria in this interaction is unclear as both plant-beneficial and as plant-pathogenic bacteria may be associated with PWD. Using whole genome sequencing and phenotypic characterization, we were able to investigate in more detail the genetic repertoire of Serratia marcescens PWN146, a bacterium associated with B. xylophilus. We show clear evidence that S. marcescens PWN146 is able to withstand and colonize the plant environment, without having any deleterious effects towards a susceptible host (Pinus thunbergii), B. xylophilus nor to the nematode model C. elegans. This bacterium is able to tolerate growth in presence of xenobiotic/organic compounds, and use phenylacetic acid as carbon source. Furthermore, we present a detailed list of S. marcescens PWN146 potentials to interfere with plant metabolism via hormonal pathways and/or nutritional acquisition, and to be competitive against other bacteria and/or fungi in terms of resource acquisition or production of antimicrobial compounds. Further investigation is required to understand the role of bacteria in PWD. We have now reinforced the theory that B. xylophilus-associated bacteria may have a plant origin.


July 7, 2019

The Ditylenchus destructor genome provides new insights into the evolution of plant parasitic nematodes.

Plant-parasitic nematodes were found in 4 of the 12 clades of phylum Nematoda. These nematodes in different clades may have originated independently from their free-living fungivorous ancestors. However, the exact evolutionary process of these parasites is unclear. Here, we sequenced the genome sequence of a migratory plant nematode, Ditylenchus destructor We performed comparative genomics among the free-living nematode, Caenorhabditis elegans and all the plant nematodes with genome sequences available. We found that, compared with C. elegans, the core developmental control processes underwent heavy reduction, though most signal transduction pathways were conserved. We also found D. destructor contained more homologies of the key genes in the above processes than the other plant nematodes. We suggest that Ditylenchus spp. may be an intermediate evolutionary history stage from free-living nematodes that feed on fungi to obligate plant-parasitic nematodes. Based on the facts that D. destructor can feed on fungi and has a relatively short life cycle, and that it has similar features to both C. elegans and sedentary plant-parasitic nematodes from clade 12, we propose it as a new model to study the biology, biocontrol of plant nematodes and the interaction between nematodes and plants.© 2016 The Author(s).


July 7, 2019

Strategies for sequence assembly of plant genomes

The field of plant genome assembly has greatly benefited from the development and widespread adoption of next-generation DNA sequencing platforms. Very high sequencing throughputs and low costs per nucleotide have considerably reduced the technical and budgetary constraints associated with early assembly projects done primarily with a traditional Sanger-based approach. Those improvements led to a sharp increase in the number of plant genomes being sequenced, including large and complex genomes of economically important crops. Although next-generation DNA sequencing has considerably improved our understanding of the overall structure and dynamics of many plant genomes, severe limitations still remain because next-generation DNA sequencing reads typically are shorter than Sanger reads. In addition, the software tools used to de novo assemble sequences are not necessarily designed to optimize the use of short reads. These cause challenges, common to many plant species with large genome sizes, high repeat contents, polyploidy and genome-wide duplications. This chapter provides an overview of historical and current methods used to sequence and assemble plant genomes, along with new solutions offered by the emergence of technologies such as single molecule sequencing and optical mapping to address the limitations of current sequence assemblies.


July 7, 2019

Complete genome sequences of the Serratia plymuthica strains 3Rp8 and 3Re4-18, two rhizosphere bacteria with antagonistic activity towards fungal phytopathogens and plant growth promoting abilities.

The Serratia plymuthica strains 3Rp8 and 3Re4-18 are motile, Gram-negative, non-sporulating bacteria. Strain 3Rp8 was isolated from the rhizosphere of Brassica napus L. and strain 3Re4-18 from the endorhiza of Solanum tuberosum L. Studies have shown in vitro activity against the soil-borne fungi Verticillium dahliae Kleb., Rhizoctonia solani Kühn, and Sclerotinia sclerotiorum. Here, we announce and describe the complete genome sequence of S. plymuthica 3Rp8 consisting of a single circular chromosome of 5.5 Mb that encodes 4954 protein-coding and 108 RNA-only encoding genes and of S. plymuthica 3Re4-18 consisting of a single circular chromosome of 5.4 Mb that encodes 4845 protein-coding and 109 RNA-only encoding genes. The whole genome sequences and annotations are available in NCBI under the locus numbers CP012096 and CP012097, respectively. The genome analyses revealed genes putatively responsible for the promising plant growth promoting and biocontrol properties including predicting factors such as secretion systems, iron scavenging siderophores, chitinases, secreted proteases, glucanases and non-ribosomal peptide synthetases, as well as unique genomic islands.


July 7, 2019

Crystal structures of the TRIC trimeric intracellular cation channel orthologues.

Ca(2+) release from the sarcoplasmic reticulum (SR) and endoplasmic reticulum (ER) is crucial for muscle contraction, cell growth, apoptosis, learning and memory. The trimeric intracellular cation (TRIC) channels were recently identified as cation channels balancing the SR and ER membrane potentials, and are implicated in Ca(2+) signaling and homeostasis. Here we present the crystal structures of prokaryotic TRIC channels in the closed state and structure-based functional analyses of prokaryotic and eukaryotic TRIC channels. Each trimer subunit consists of seven transmembrane (TM) helices with two inverted repeated regions. The electrophysiological, biochemical and biophysical analyses revealed that TRIC channels possess an ion-conducting pore within each subunit, and that the trimer formation contributes to the stability of the protein. The symmetrically related TM2 and TM5 helices are kinked at the conserved glycine clusters, and these kinks are important for the channel activity. Furthermore, the kinks of the TM2 and TM5 helices generate lateral fenestrations at each subunit interface. Unexpectedly, these lateral fenestrations are occupied with lipid molecules. This study provides the structural and functional framework for the molecular mechanism of this ion channel superfamily.


July 7, 2019

Improved assembly of noisy long reads by k-mer validation.

Genome assembly depends critically on read length. Two recent technologies, from Pacific Biosciences (PacBio) and Oxford Nanopore, produce read lengths >20 kb, which yield de novo genome assemblies with vastly greater contiguity than those based on Sanger, Illumina, or other technologies. However, the very high error rates of these two new technologies (~15% per base) makes assembly imprecise at repeats longer than the read length and computationally expensive. Here we show that the contiguity and quality of the assembly of these noisy long reads can be significantly improved at a minimal cost, by leveraging on the low error rate and low cost of Illumina short reads. Namely, k-mers from the PacBio raw reads that are not present in Illumina reads (which account for ~95% of the distinct k-mers) are deemed sequencing errors and ignored at the seed alignment step. By focusing on the ~5% of k-mers that are error free, read overlap sensitivity is dramatically increased. Of equal importance, the validation procedure can be extended to exclude repetitive k-mers, which prevents read miscorrection at repeats and further improves the resulting assemblies. We tested the k-mer validation procedure using one long-read technology (PacBio) and one assembler (MHAP/Celera Assembler), but it is very likely to yield analogous improvements with alternative long-read technologies and assemblers, such as Oxford Nanopore and BLASR/DALIGNER/Falcon, respectively.© 2016 Carvalho et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D.

Completion of eukaryal genomes can be difficult task with the highly repetitive sequences along the chromosomes and short read lengths of second-generation sequencing. Saccharomyces cerevisiae strain CEN.PK113-7D, widely used as a model organism and a cell factory, was selected for this study to demonstrate the superior capability of very long sequence reads for de novo genome assembly. We generated long reads using two common third-generation sequencing technologies (Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio)) and used short reads obtained using Illumina sequencing for error correction. Assembly of the reads derived from all three technologies resulted in complete sequences for all 16 yeast chromosomes, as well as the mitochondrial chromosome, in one step. Further, we identified three types of DNA methylation (5mC, 4mC and 6mA). Comparison between the reference strain S288C and strain CEN.PK113-7D identified chromosomal rearrangements against a background of similar gene content between the two strains. We identified full-length transcripts through ONT direct RNA sequencing technology. This allows for the identification of transcriptional landscapes, including untranslated regions (UTRs) (5′ UTR and 3′ UTR) as well as differential gene expression quantification. About 91% of the predicted transcripts could be consistently detected across biological replicates grown either on glucose or ethanol. Direct RNA sequencing identified many polyadenylated non-coding RNAs, rRNAs, telomere-RNA, long non-coding RNA and antisense RNA. This work demonstrates a strategy to obtain complete genome sequences and transcriptional landscapes that can be applied to other eukaryal organisms.


July 7, 2019

Inferring synteny between genome assemblies: a systematic evaluation.

Genome assemblies across all domains of life are being produced routinely. Initial analysis of a new genome usually includes annotation and comparative genomics. Synteny provides a framework in which conservation of homologous genes and gene order is identified between genomes of different species. The availability of human and mouse genomes paved the way for algorithm development in large-scale synteny mapping, which eventually became an integral part of comparative genomics. Synteny analysis is regularly performed on assembled sequences that are fragmented, neglecting the fact that most methods were developed using complete genomes. It is unknown to what extent draft assemblies lead to errors in such analysis.We fragmented genome assemblies of model nematodes to various extents and conducted synteny identification and downstream analysis. We first show that synteny between species can be underestimated up to 40% and find disagreements between popular tools that infer synteny blocks. This inconsistency and further demonstration of erroneous gene ontology enrichment tests raise questions about the robustness of previous synteny analysis when gold standard genome sequences remain limited. In addition, assembly scaffolding using a reference guided approach with a closely related species may result in chimeric scaffolds with inflated assembly metrics if a true evolutionary relationship was overlooked. Annotation quality, however, has minimal effect on synteny if the assembled genome is highly contiguous.Our results show that a minimum N50 of 1 Mb is required for robust downstream synteny analysis, which emphasizes the importance of gold standard genomes to the science community, and should be achieved given the current progress in sequencing technology.


July 7, 2019

Ten steps to get started in Genome Assembly and Annotation.

As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).


July 7, 2019

Satellite DNA evolution: old ideas, new approaches.

A substantial portion of the genomes of most multicellular eukaryotes consists of large arrays of tandemly repeated sequence, collectively called satellite DNA. The processes generating and maintaining different satellite DNA abundances across lineages are important to understand as satellites have been linked to chromosome mis-segregation, disease phenotypes, and reproductive isolation between species. While much theory has been developed to describe satellite evolution, empirical tests of these models have fallen short because of the challenges in assessing satellite repeat regions of the genome. Advances in computational tools and sequencing technologies now enable identification and quantification of satellite sequences genome-wide. Here, we describe some of these tools and how their applications are furthering our knowledge of satellite evolution and function. Copyright © 2018 Elsevier Ltd. All rights reserved.


July 7, 2019

Regulation of neuronal differentiation, function, and plasticity by alternative splicing.

Posttranscriptional mechanisms provide powerful means to expand the coding power of genomes. In nervous systems, alternative splicing has emerged as a fundamental mechanism not only for the diversification of protein isoforms but also for the spatiotemporal control of transcripts. Thus, alternative splicing programs play instructive roles in the development of neuronal cell type-specific properties, neuronal growth, self-recognition, synapse specification, and neuronal network function. Here we discuss the most recent genome-wide efforts on mapping RNA codes and RNA-binding proteins for neuronal alternative splicing regulation. We illustrate how alternative splicing shapes key steps of neuronal development, neuronal maturation, and synaptic properties. Finally, we highlight efforts to dissect the spatiotemporal dynamics of alternative splicing and their potential contribution to neuronal plasticity and the mature nervous system. Expected final online publication date for the Annual Review of Cell and Developmental Biology Volume 34 is October 6, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


July 7, 2019

Genome size estimation of Chinese cultured artemisia annua L.

Almost all of antimalarial artemisinin is extracted from the traditional Chinese medicinal plant Artemisia annua L. However, under the condition of insufficient genomic in- formation and unresolved genetic backgrounds, regulatory mechanism of artemisinin biosynthetic pathway has not yet been clear. The genome size of genuine A. annua plants is an especially important and fundamental parameter, which helpful for further insight into genomic studies of ar- temisinin biosynthesis and improvement. In current study, all those genome sizes of A. annua samples collected with Barcoding identification were evaluated to be 1.38-1.49 Gb by Flow Cytometry (FCM) with Nipponbare as the bench- mark calibration standard and soybean and maize as two internal standards individually and simultaneously. The ge- nome estimation of seven A. annua strains came from five China provinces (Shandong, Hunan, Chongqing, Sichuan, and Hainan) with a low coefficient of variation (CV, = 2.96%) wasrelative accurate, 12.87% (220 Mb) less than previous reports about a foreign A. annuaspecies with a single con- trol. It facilitated the schedule of A. annua whole genome sequencing project, optimization of assembly methods and insight into its subsequent genetics and evolution.


July 7, 2019

Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations

Motivation A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can significantly vary. There is therefore a need for computational methods that can simultaneously analyze the architectures of multiple genomes without introducing bias from a reference. Results In this article, we present Ptolemy: a novel method for studying the diversity of genome architectures—such as structural variation and pan-genomes—across a collection of microbial assemblies without the need of a reference. Ptolemy is a ‘top-down’ approach to compare whole genome assemblies. Genomes are represented as labeled multi-directed graphs—known as quivers—which are then merged into a single, canonical quiver by identifying ‘gene anchors’ via synteny analysis. The canonical quiver represents an approximate, structural alignment of all genomes in a given collection encoding structural variation across (sub-) populations within the collection. We highlight various applications of Ptolemy by analyzing structural variation and the pan-genomes of different datasets composing of Mycobacterium, Saccharomyces, Escherichia and Shigella species. Our results show that Ptolemy is flexible and can handle both conserved and highly dynamic genome architectures. Ptolemy is user-friendly—requires only FASTA-formatted assembly along with a corresponding GFF-formatted file—and resource-friendly—can align 24 genomes in ~10 mins with four CPUs and <2 GB of RAM.


July 7, 2019

Measuring the mappability spectrum of reference genome assemblies

The ability to infer actionable information from genomic variation data in a resequencing experiment relies on accurately aligning the sequences to a reference genome. However, this accuracy is inherently limited by the quality of the reference assembly and the repetitive content of the subject’s genome. As long read sequencing technologies become more widespread, it is crucial to investigate the expected improvements in alignment accuracy and variant analysis over existing short read methods. The ability to quantify the read length and error rate necessary to uniquely map regions of interest in a sequence allows users to make informed decisions regarding experiment design and provides useful metrics for comparing the magnitude of repetition across different reference assemblies. To this end we have developed NEAT-Repeat, a toolkit for exhaustively identifying the minimum read length required to uniquely map each position of a reference sequence given a specified error rate. Using these tools we computed the -mappability spectrum” for ten reference sequences, including human and a range of plants and animals, quantifying the theoretical improvements in alignment accuracy that would result from sequencing with longer reads or reads with less base-calling errors. Our inclusion of read length and error rate builds upon existing methods for mappability tracks based on uniqueness or aligner-specific mapping scores, and thus enables more comprehensive analysis. We apply our mappability results to whole-genome variant call data, and demonstrate that variants called with low mapping and genotype quality scores are disproportionately found in reference regions that require long reads to be uniquely covered. We propose that our mappability metrics provide a valuable supplement to established variant filtering and annotation pipelines by supplying users with an additional metric related to read mapping quality. NEAT-Repeat can process large and repetitive genomes, such as those of corn and soybean, in a tractable amount of time by leveraging efficient methods for edit distance computation as well as running multiple jobs in parallel. NEAT-Repeat is written in Python 2.7 and C++, and is available at https://github.com/zstephens/neat-repeat.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.