Menu
April 21, 2020

A High-Quality Grapevine Downy Mildew Genome Assembly Reveals Rapidly Evolving and Lineage-Specific Putative Host Adaptation Genes.

Downy mildews are obligate biotrophic oomycete pathogens that cause devastating plant diseases on economically important crops. Plasmopara viticola is the causal agent of grapevine downy mildew, a major disease in vineyards worldwide. We sequenced the genome of Pl. viticola with PacBio long reads and obtained a new 92.94?Mb assembly with high contiguity (359 scaffolds for a N50 of 706.5?kb) due to a better resolution of repeat regions. This assembly presented a high level of gene completeness, recovering 1,592 genes encoding secreted proteins involved in plant-pathogen interactions. Plasmopara viticola had a two-speed genome architecture, with secreted protein-encoding genes preferentially located in gene-sparse, repeat-rich regions and evolving rapidly, as indicated by pairwise dN/dS values. We also used short reads to assemble the genome of Plasmopara muralis, a closely related species infecting grape ivy (Parthenocissus tricuspidata). The lineage-specific proteins identified by comparative genomics analysis included a large proportion of RxLR cytoplasmic effectors and, more generally, genes with high dN/dS values. We identified 270 candidate genes under positive selection, including several genes encoding transporters and components of the RNA machinery potentially involved in host specialization. Finally, the Pl. viticola genome assembly generated here will allow the development of robust population genomics approaches for investigating the mechanisms involved in adaptation to biotic and abiotic selective pressures in this species. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Human contamination in bacterial genomes has created thousands of spurious proteins.

Contaminant sequences that appear in published genomes can cause numerous problems for downstream analyses, particularly for evolutionary studies and metagenomics projects. Our large-scale scan of complete and draft bacterial and archaeal genomes in the NCBI RefSeq database reveals that 2250 genomes are contaminated by human sequence. The contaminant sequences derive primarily from high-copy human repeat regions, which themselves are not adequately represented in the current human reference genome, GRCh38. The absence of the sequences from the human assembly offers a likely explanation for their presence in bacterial assemblies. In some cases, the contaminating contigs have been erroneously annotated as containing protein-coding sequences, which over time have propagated to create spurious protein “families” across multiple prokaryotic and eukaryotic genomes. As a result, 3437 spurious protein entries are currently present in the widely used nr and TrEMBL protein databases. We report here an extensive list of contaminant sequences in bacterial genome assemblies and the proteins associated with them. We found that nearly all contaminants occurred in small contigs in draft genomes, which suggests that filtering out small contigs from draft genome assemblies may mitigate the issue of contamination while still keeping nearly all of the genuine genomic sequences. © 2019 Breitwieser et al.; Published by Cold Spring Harbor Laboratory Press.


April 21, 2020

The Reference Genome Sequence of Scutellaria baicalensis Provides Insights into the Evolution of Wogonin Biosynthesis.

Scutellaria baicalensis Georgi is important in Chinese traditional medicine where preparations of dried roots, “Huang Qin,” are used for liver and lung complaints and as complementary cancer treatments. We report a high-quality reference genome sequence for S. baicalensis where 93% of the 408.14-Mb genome has been assembled into nine pseudochromosomes with a super-N50 of 33.2 Mb. Comparison of this sequence with those of closely related species in the order Lamiales, Sesamum indicum and Salvia splendens, revealed that a specialized metabolic pathway for the synthesis of 4′-deoxyflavone bioactives evolved in the genus Scutellaria. We found that the gene encoding a specific cinnamate coenzyme A ligase likely obtained its new function following recent mutations, and that four genes encoding enzymes in the 4′-deoxyflavone pathway are present as tandem repeats in the genome of S. baicalensis. Further analyses revealed that gene duplications, segmental duplication, gene amplification, and point mutations coupled to gene neo- and subfunctionalizations were involved in the evolution of 4′-deoxyflavone synthesis in the genus Scutellaria. Our study not only provides significant insight into the evolution of specific flavone biosynthetic pathways in the mint family, Lamiaceae, but also will facilitate the development of tools for enhancing bioactive productivity by metabolic engineering in microbes or by molecular breeding in plants. The reference genome of S. baicalensis is also useful for improving the genome assemblies for other members of the mint family and offers an important foundation for decoding the synthetic pathways of bioactive compounds in medicinal plants.Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Long-read sequence and assembly of segmental duplications.

We have developed a computational method based on polyploid phasing of long sequence reads to resolve collapsed regions of segmental duplications within genome assemblies. Segmental Duplication Assembler (SDA; https://github.com/mvollger/SDA ) constructs graphs in which paralogous sequence variants define the nodes and long-read sequences provide attraction and repulsion edges, enabling the partition and assembly of long reads corresponding to distinct paralogs. We apply it to single-molecule, real-time sequence data from three human genomes and recover 33-79 megabase pairs (Mb) of duplications in which approximately half of the loci are diverged (<99.8%) compared to the reference genome. We show that the corresponding sequence is highly accurate (>99.9%) and that the diverged sequence corresponds to copy-number-variable paralogs that are absent from the human reference genome. Our method can be applied to other complex genomes to resolve the last gene-rich gaps, improve duplicate gene annotation, and better understand copy-number-variant genetic diversity at the base-pair level.


April 21, 2020

Transmission of ciprofloxacin resistance in Salmonella mediated by a novel type of conjugative helper plasmids.

Ciprofloxacin resistance in Salmonella has been increasingly reported due to the emergence and dissemination of multiple Plasmid-Mediated Quinolone Resistance (PMQR) determinants, which are mainly located in non-conjugative plasmids or chromosome. In this study, we aimed to depict the molecular mechanisms underlying the rare phenomenon of horizontal transfer of ciprofloxacin resistance phenotype in Salmonella by conjugation experiments, S1-PFGE and complete plasmid sequencing. Two types of non-conjugative plasmids, namely an IncX1 type carrying a qnrS1 gene, and an IncH1 plasmid carrying the oqxAB-qnrS gene, both ciprofloxacin resistance determinants in Salmonella, were recovered from two Salmonella strains. Importantly, these non-conjugative plasmids could be fused with a novel Incl1 type conjugative helper plasmid, which could target insertion sequence (IS) elements located in the non-conjugative, ciprofloxacin-resistance-encoding plasmid through replicative transcription, eventually forming a hybrid conjugative plasmid transmissible among members of Enterobacteriaceae. Since our data showed that such conjugative helper plasmids are commonly detectable among clinical Salmonella strains, particularly S. Typhimurium, fusion events leading to generation and enhanced dissemination of conjugative ciprofloxacin resistance-encoding plasmids in Salmonella are expected to result in a sharp increase in the incidence of resistance to fluoroquinolone, the key choice for treating life-threatening Salmonella infections, thereby posing a serious public health threat.


April 21, 2020

µLAS technology for DNA isolation coupled to Cas9-assisted targeting for sequencing and assembly of a 30 kb region in plant genome.

Cas9-assisted targeting of DNA fragments in complex genomes is viewed as an essential strategy to obtain high-quality and continuous sequence data. However, the purity of target loci selected by pulsed-field gel electrophoresis (PFGE) has so far been insufficient to assemble the sequence in one contig. Here, we describe the µLAS technology to capture and purify high molecular weight DNA. First, the technology is optimized to perform high sensitivity DNA profiling with a limit of detection of 20 fg/µl for 50 kb fragments and an analytical time of 50 min. Then, µLAS is operated to isolate a 31.5 kb locus cleaved by Cas9 in the genome of the plant Medicago truncatula. Target purification is validated on a Bacterial Artificial Chromosome plasmid, and subsequently carried out in whole genome with µLAS, PFGE or by combining these techniques. PacBio sequencing shows an enrichment factor of the target sequence of 84 with PFGE alone versus 892 by association of PFGE with µLAS. These performances allow us to sequence and assemble one contig of 29 441 bp with 99% sequence identity to the reference sequence. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

FadR1, a pathway-specific activator of fidaxomicin biosynthesis in Actinoplanes deccanensis Yp-1.

Fidaxomicin, an 18-membered macrolide antibiotic, is highly active against Clostridium difficile, the most common cause of diarrhea in hospitalized patients. Though the biosynthetic mechanism of fidaxomicin has been well studied, little is known about its regulatory mechanism. Here, we reported that FadR1, a LAL family transcriptional regulator in the fidaxomicin cluster of Actinoplanes deccanensis Yp-1, acts as an activator for fidaxomicin biosynthesis. The disruption of fadR1 abolished the ability to synthesize fidaxomicin, and production could be restored by reintegrating a single copy of fadR1. Overexpression of fadR1 resulted in an approximately 400 % improvement in fidaxomicin production. Electrophoretic mobility shift assays indicated that fidaxomicin biosynthesis is under the control of FadR1 through its binding to the promoter regions of fadM, fadA1-fadP2, fadS2-fadC, and fadE-fadF, respectively. And the conserved binding sites of FadR1 within the four promoter regions were determined by footprinting experiment. All results indicated that fadR1 encodes a pathway-specific positive regulator of fidaxomicin biosynthesis and upregulates the transcription levels of most of genes by binding to the four above intergenic regions. In summary, we not only clearly elucidate the regulatory mechanism of FadR1 but also provide strategies for the construction of industrial high-yield strain of fidaxomicin.


April 21, 2020

Genetic variation in the conjugative plasmidome of a hospital effluent multidrug resistant Escherichia coli strain.

Bacteria harboring conjugative plasmids have the potential for spreading antibiotic resistance through horizontal gene transfer. It is described that the selection and dissemination of antibiotic resistance is enhanced by stressors, like metals or antibiotics, which can occur as environmental contaminants. This study aimed at unveiling the composition of the conjugative plasmidome of a hospital effluent multidrug resistant Escherichia coli strain (H1FC54) under different mating conditions. To meet this objective, plasmid pulsed field gel electrophoresis, optical mapping analyses and DNA sequencing were used in combination with phenotype analysis. Strain H1FC54 was observed to harbor five plasmids, three of which were conjugative and two of these, pH1FC54_330 and pH1FC54_140, contained metal and antibiotic resistance genes. Transconjugants obtained in the absence or presence of tellurite (0.5?µM or 5?µM), arsenite (0.5?µM, 5?µM or 15?µM) or ceftazidime (10?mg/L) and selected in the presence of sodium azide (100?mg/L) and tetracycline (16?mg/L) presented distinct phenotypes, associated with the acquisition of different plasmid combinations, including two co-integrate plasmids, of 310 kbp and 517 kbp. The variable composition of the conjugative plasmidome, the formation of co-integrates during conjugation, as well as the transfer of non-transferable plasmids via co-integration, and the possible association between antibiotic, arsenite and tellurite tolerance was demonstrated. These evidences bring interesting insights into the comprehension of the molecular and physiological mechanisms that underlie antibiotic resistance propagation in the environment. Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Single-Molecule Sequencing: Towards Clinical Applications.

In the past several years, single-molecule sequencing platforms, such as those by Pacific Biosciences and Oxford Nanopore Technologies, have become available to researchers and are currently being tested for clinical applications. They offer exceptionally long reads that permit direct sequencing through regions of the genome inaccessible or difficult to analyze by short-read platforms. This includes disease-causing long repetitive elements, extreme GC content regions, and complex gene loci. Similarly, these platforms enable structural variation characterization at previously unparalleled resolution and direct detection of epigenetic marks in native DNA. Here, we review how these technologies are opening up new clinical avenues that are being applied to pathogenic microorganisms and viruses, constitutional disorders, pharmacogenomics, cancer, and more.Copyright © 2018 Elsevier Ltd. All rights reserved.


April 21, 2020

The Genome of Armadillidium vulgare (Crustacea, Isopoda) Provides Insights into Sex Chromosome Evolution in the Context of Cytoplasmic Sex Determination.

The terrestrial isopod Armadillidium vulgare is an original model to study the evolution of sex determination and symbiosis in animals. Its sex can be determined by ZW sex chromosomes, or by feminizing Wolbachia bacterial endosymbionts. Here, we report the sequence and analysis of the ZW female genome of A. vulgare. A distinguishing feature of the 1.72 gigabase assembly is the abundance of repeats (68% of the genome). We show that the Z and W sex chromosomes are essentially undifferentiated at the molecular level and the W-specific region is extremely small (at most several hundreds of kilobases). Our results suggest that recombination suppression has not spread very far from the sex-determining locus, if at all. This is consistent with A. vulgare possessing evolutionarily young sex chromosomes. We characterized multiple Wolbachia nuclear inserts in the A. vulgare genome, none of which is associated with the W-specific region. We also identified several candidate genes that may be involved in the sex determination or sexual differentiation pathways. The A. vulgare genome serves as a resource for studying the biology and evolution of crustaceans, one of the most speciose and emblematic metazoan groups. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Genome of Crucihimalaya himalaica, a close relative of Arabidopsis, shows ecological adaptation to high altitude.

Crucihimalaya himalaica, a close relative of Arabidopsis and Capsella, grows on the Qinghai-Tibet Plateau (QTP) about 4,000 m above sea level and represents an attractive model system for studying speciation and ecological adaptation in extreme environments. We assembled a draft genome sequence of 234.72 Mb encoding 27,019 genes and investigated its origin and adaptive evolutionary mechanisms. Phylogenomic analyses based on 4,586 single-copy genes revealed that C. himalaica is most closely related to Capsella (estimated divergence 8.8 to 12.2 Mya), whereas both species form a sister clade to Arabidopsis thaliana and Arabidopsis lyrata, from which they diverged between 12.7 and 17.2 Mya. LTR retrotransposons in C. himalaica proliferated shortly after the dramatic uplift and climatic change of the Himalayas from the Late Pliocene to Pleistocene. Compared with closely related species, C. himalaica showed significant contraction and pseudogenization in gene families associated with disease resistance and also significant expansion in gene families associated with ubiquitin-mediated proteolysis and DNA repair. We identified hundreds of genes involved in DNA repair, ubiquitin-mediated proteolysis, and reproductive processes with signs of positive selection. Gene families showing dramatic changes in size and genes showing signs of positive selection are likely candidates for C. himalaica’s adaptation to intense radiation, low temperature, and pathogen-depauperate environments in the QTP. Loss of function at the S-locus, the reason for the transition to self-fertilization of C. himalaica, might have enabled its QTP occupation. Overall, the genome sequence of C. himalaica provides insights into the mechanisms of plant adaptation to extreme environments.Copyright © 2019 the Author(s). Published by PNAS.


April 21, 2020

Complete Genome Sequence of the Wolbachia wAlbB Endosymbiont of Aedes albopictus.

Wolbachia, an alpha-proteobacterium closely related to Rickettsia, is a maternally transmitted, intracellular symbiont of arthropods and nematodes. Aedes albopictus mosquitoes are naturally infected with Wolbachia strains wAlbA and wAlbB. Cell line Aa23 established from Ae. albopictus embryos retains only wAlbB and is a key model to study host-endosymbiont interactions. We have assembled the complete circular genome of wAlbB from the Aa23 cell line using long-read PacBio sequencing at 500× median coverage. The assembled circular chromosome is 1.48 megabases in size, an increase of more than 300 kb over the published draft wAlbB genome. The annotation of the genome identified 1,205 protein coding genes, 34 tRNA, 3 rRNA, 1 tmRNA, and 3 other ncRNA loci. The long reads enabled sequencing over complex repeat regions which are difficult to resolve with short-read sequencing. Thirteen percent of the genome comprised insertion sequence elements distributed throughout the genome, some of which cause pseudogenization. Prophage WO genes encoding some essential components of phage particle assembly are missing, while the remainder are found in five prophage regions/WO-like islands or scattered around the genome. Orthology analysis identified a core proteome of 535 orthogroups across all completed Wolbachia genomes. The majority of proteins could be annotated using Pfam and eggNOG analyses, including ankyrins and components of the Type IV secretion system. KEGG analysis revealed the absence of five genes in wAlbB which are present in other Wolbachia. The availability of a complete circular chromosome from wAlbB will enable further biochemical, molecular, and genetic analyses on this strain and related Wolbachia. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Tools and Strategies for Long-Read Sequencing and De Novo Assembly of Plant Genomes.

The commercial release of third-generation sequencing technologies (TGSTs), giving long and ultra-long sequencing reads, has stimulated the development of new tools for assembling highly contiguous genome sequences with unprecedented accuracy across complex repeat regions. We survey here a wide range of emerging sequencing platforms and analytical tools for de novo assembly, provide background information for each of their steps, and discuss the spectrum of available options. Our decision tree recommends workflows for the generation of a high-quality genome assembly when used in combination with the specific needs and resources of a project.Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Characterizing the major structural variant alleles of the human genome.

In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity. Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Inter-chromosomal coupling between vision and pigmentation genes during genomic divergence.

Recombination between loci underlying mate choice and ecological traits is a major evolutionary force acting against speciation with gene flow. The evolution of linkage disequilibrium between such loci is therefore a fundamental step in the origin of species. Here, we show that this process can take place in the absence of physical linkage in hamlets-a group of closely related reef fishes from the wider Caribbean that differ essentially in colour pattern and are reproductively isolated through strong visually-based assortative mating. Using full-genome analysis, we identify four narrow genomic intervals that are consistently differentiated among sympatric species in a backdrop of extremely low genomic divergence. These four intervals include genes involved in pigmentation (sox10), axial patterning (hoxc13a), photoreceptor development (casz1) and visual sensitivity (SWS and LWS opsins) that develop islands of long-distance and inter-chromosomal linkage disequilibrium as species diverge. The relatively simple genomic architecture of species differences facilitates the evolution of linkage disequilibrium in the presence of gene flow.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.