Menu
April 21, 2020

A High-Quality Grapevine Downy Mildew Genome Assembly Reveals Rapidly Evolving and Lineage-Specific Putative Host Adaptation Genes.

Downy mildews are obligate biotrophic oomycete pathogens that cause devastating plant diseases on economically important crops. Plasmopara viticola is the causal agent of grapevine downy mildew, a major disease in vineyards worldwide. We sequenced the genome of Pl. viticola with PacBio long reads and obtained a new 92.94?Mb assembly with high contiguity (359 scaffolds for a N50 of 706.5?kb) due to a better resolution of repeat regions. This assembly presented a high level of gene completeness, recovering 1,592 genes encoding secreted proteins involved in plant-pathogen interactions. Plasmopara viticola had a two-speed genome architecture, with secreted protein-encoding genes preferentially located in gene-sparse, repeat-rich regions and evolving rapidly, as indicated by pairwise dN/dS values. We also used short reads to assemble the genome of Plasmopara muralis, a closely related species infecting grape ivy (Parthenocissus tricuspidata). The lineage-specific proteins identified by comparative genomics analysis included a large proportion of RxLR cytoplasmic effectors and, more generally, genes with high dN/dS values. We identified 270 candidate genes under positive selection, including several genes encoding transporters and components of the RNA machinery potentially involved in host specialization. Finally, the Pl. viticola genome assembly generated here will allow the development of robust population genomics approaches for investigating the mechanisms involved in adaptation to biotic and abiotic selective pressures in this species. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Human contamination in bacterial genomes has created thousands of spurious proteins.

Contaminant sequences that appear in published genomes can cause numerous problems for downstream analyses, particularly for evolutionary studies and metagenomics projects. Our large-scale scan of complete and draft bacterial and archaeal genomes in the NCBI RefSeq database reveals that 2250 genomes are contaminated by human sequence. The contaminant sequences derive primarily from high-copy human repeat regions, which themselves are not adequately represented in the current human reference genome, GRCh38. The absence of the sequences from the human assembly offers a likely explanation for their presence in bacterial assemblies. In some cases, the contaminating contigs have been erroneously annotated as containing protein-coding sequences, which over time have propagated to create spurious protein “families” across multiple prokaryotic and eukaryotic genomes. As a result, 3437 spurious protein entries are currently present in the widely used nr and TrEMBL protein databases. We report here an extensive list of contaminant sequences in bacterial genome assemblies and the proteins associated with them. We found that nearly all contaminants occurred in small contigs in draft genomes, which suggests that filtering out small contigs from draft genome assemblies may mitigate the issue of contamination while still keeping nearly all of the genuine genomic sequences. © 2019 Breitwieser et al.; Published by Cold Spring Harbor Laboratory Press.


April 21, 2020

Complete Genome Sequence of Actinosynnema pretiosum X47, An Industrial Strain that Produces the Antibiotic Ansamitocin AP-3.

Ansamitocins are extraordinarily potent antitumor agents. Ansamitocin P-3 (AP-3), which is produced by Actinosynnema pretiosum, has been developed as a cytotoxic drug for breast cancer. Despite its importance, AP-3 is of limited applicability because of the low production yield. A. pretiosum strain X47 was developed from A. pretiosum ATCC 31565 by mutation breeding and shows a relatively high AP-3 yield. Here, we analyzed the A. pretiosum X47 genome, which is ~8.13 Mb in length with 6693 coding sequences, 58 tRNA genes, and 15 rRNA genes. The DNA sequence of the ansamitocin biosynthetic gene cluster is highly similar to that of the corresponding cluster in A. pretiosum ATCC 31565, with 99.9% identity. However, RT-qPCR analysis showed that the expression levels of ansamitocin biosynthetic genes were significantly increased in X47 compared with the levels in the wild-type strain, consistent with the higher yield of AP-3 in X47. The annotated complete genome sequence of this strain will facilitate understanding the molecular mechanisms of ansamitocin biosynthesis and regulation in A. pretiosum and help further genetic engineering studies to enhance the production of AP-3.


April 21, 2020

Mycobiome diversity: high-throughput sequencing and identification of fungi.

Fungi are major ecological players in both terrestrial and aquatic environments by cycling organic matter and channelling nutrients across trophic levels. High-throughput sequencing (HTS) studies of fungal communities are redrawing the map of the fungal kingdom by hinting at its enormous – and largely uncharted – taxonomic and functional diversity. However, HTS approaches come with a range of pitfalls and potential biases, cautioning against unwary application and interpretation of HTS technologies and results. In this Review, we provide an overview and practical recommendations for aspects of HTS studies ranging from sampling and laboratory practices to data processing and analysis. We also discuss upcoming trends and techniques in the field and summarize recent and noteworthy results from HTS studies targeting fungal communities and guilds. Our Review highlights the need for reproducibility and public data availability in the study of fungal communities. If the associated challenges and conceptual barriers are overcome, HTS offers immense possibilities in mycology and elsewhere.


April 21, 2020

The Draft Genome of an Octocoral, Dendronephthya gigantea.

Coral reefs composed of stony corals are threatened by global marine environmental changes. However, soft coral communities of octocorallian species, appear more resilient. The genomes of several cnidarians species have been published, including from stony corals, sea anemones, and hydra. To fill the phylogenetic gap for octocoral species of cnidarians, we sequenced the octocoral, Dendronephthya gigantea, a nonsymbiotic soft coral, commonly known as the carnation coral. The D. gigantea genome size is ~276?Mb. A high-quality genome assembly was constructed from PacBio long reads (29.85 Gb with 108× coverage) and Illumina short paired-end reads (35.54 Gb with 128× coverage) resulting in the highest N50 value (1.4?Mb) reported thus far among cnidarian genomes. About 12% of the genome is repetitive elements and contained 28,879 predicted protein-coding genes. This gene set is composed of 94% complete BUSCO ortholog benchmark genes, which is the second highest value among the cnidarians, indicating high quality. Based on molecular phylogenetic analysis, octocoral and hexacoral divergence times were estimated at 544 MYA. There is a clear difference in Hox gene composition between these species: unlike hexacorals, the Antp superclass Evx gene was absent in D. gigantea. Here, we present the first genome assembly of a nonsymbiotic octocoral, D. gigantea to aid in the comparative genomic analysis of cnidarians, including stony and soft corals, both symbiotic and nonsymbiotic. The D. gigantea genome may also provide clues to mechanisms of differential coping between the soft and stony corals in response to scenarios of global warming. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

The Reference Genome Sequence of Scutellaria baicalensis Provides Insights into the Evolution of Wogonin Biosynthesis.

Scutellaria baicalensis Georgi is important in Chinese traditional medicine where preparations of dried roots, “Huang Qin,” are used for liver and lung complaints and as complementary cancer treatments. We report a high-quality reference genome sequence for S. baicalensis where 93% of the 408.14-Mb genome has been assembled into nine pseudochromosomes with a super-N50 of 33.2 Mb. Comparison of this sequence with those of closely related species in the order Lamiales, Sesamum indicum and Salvia splendens, revealed that a specialized metabolic pathway for the synthesis of 4′-deoxyflavone bioactives evolved in the genus Scutellaria. We found that the gene encoding a specific cinnamate coenzyme A ligase likely obtained its new function following recent mutations, and that four genes encoding enzymes in the 4′-deoxyflavone pathway are present as tandem repeats in the genome of S. baicalensis. Further analyses revealed that gene duplications, segmental duplication, gene amplification, and point mutations coupled to gene neo- and subfunctionalizations were involved in the evolution of 4′-deoxyflavone synthesis in the genus Scutellaria. Our study not only provides significant insight into the evolution of specific flavone biosynthetic pathways in the mint family, Lamiaceae, but also will facilitate the development of tools for enhancing bioactive productivity by metabolic engineering in microbes or by molecular breeding in plants. The reference genome of S. baicalensis is also useful for improving the genome assemblies for other members of the mint family and offers an important foundation for decoding the synthetic pathways of bioactive compounds in medicinal plants.Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Distribution and Genetic Diversity of Genes Involved in Quorum Sensing and Prodigiosin Biosynthesis in the Complete Genome Sequences of Serratia marcescens.

Quorum sensing is a cell density-dependent regulation of gene expression. N-acyl-l-homoserine lactone (AHL) is a major quorum-sensing signaling molecule in gram-negative bacteria and synthesized by the LuxI family protein. The genus Serratia is known as a producer of the red pigment, prodigiosin, whose biosynthesis is dependent on the pig gene cluster. Some Serratia strains regulate prodigiosin production via AHL-mediated quorum sensing, whereas there is red-pigmented Serratia strains without quorum-sensing system. In addition, nonpigmented Serratia marcescens, which does not produce prodigiosin, has also been isolated from natural and clinical environments. In this study, we aim to reveal the distribution and genetic diversity of quorum-sensing genes and pig gene cluster in the complete genome sequences of S. marcescens. We previously demonstrated that S. marcescens AS-1 regulates the production of prodigiosin via AHL-mediated quorum sensing. We sequenced the genomes of AS-1 and compared with the complete genomes of AS-1 and the other 34 strains of S. marcescens. The luxI homolog was present on 25 complete genome sequences. The deduced amino acid sequences of the luxI homolog were divided into three phylogenetic classes. In contrast, the pig gene cluster was present in the genome of seven S. marcescens strains and only two strains, AS-1 and N4-5 contained both the luxI homolog and pig gene cluster in their genome. It is therefore assumed that prodigiosin production and its regulation by quorum sensing are not essential for the life cycle of S. marcescens. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Genome Analysis of Carbaryl-Degrading Strain Pseudomonas putida XWY-1.

Carbaryl was a widely used pesticide in the agriculture industry. The toxicity against non-target organisms and the environmental pollution it caused became the focus of public concern. However, the microbial mechanism of carbaryl degradation was not fully investigated. In the study, we reported the complete genome of the carbaryl-degrading Pseudomonas putida strain XWY-1, which consists of a chromosome (5.9 Mbp) and a plasmid (0.4 Mbp). The carbaryl degradation genes are located on the plasmid. The study on the genome will facilitate to further elucidate the carbaryl degradation and advance the potential biotechnological applications of P. putida strain XWY-1.


April 21, 2020

Porous Zero-Mode Waveguides for Picogram-Level DNA Capture.

We have recently shown that nanopore zero-mode waveguides are effective tools for capturing picogram levels of long DNA fragments for single-molecule DNA sequencing. Despite these key advantages, the manufacturing of large arrays is not practical due to the need for serial nanopore fabrication. To overcome this challenge, we have developed an approach for the wafer-scale fabrication of waveguide arrays on low-cost porous membranes, which are deposited using molecular-layer deposition. The membrane at each waveguide base contains a network of serpentine pores that allows for efficient electrophoretic DNA capture at picogram levels while eliminating the need for prohibitive serial pore milling. Here, we show that the loading efficiency of these porous waveguides is up to 2 orders of magnitude greater than their nanopore predecessors. This new device facilitates the scaling-up of the process, greatly reducing the cost and effort of manufacturing. Furthermore, the porous zero-mode waveguides can be used for applications that benefit from low-input single-molecule real-time sequencing.


April 21, 2020

Biphasic cellular adaptations and ecological implications of Alteromonas macleodii degrading a mixture of algal polysaccharides.

Algal polysaccharides are an important bacterial nutrient source and central component of marine food webs. However, cellular and ecological aspects concerning the bacterial degradation of polysaccharide mixtures, as presumably abundant in natural habitats, are poorly understood. Here, we contextualize marine polysaccharide mixtures and their bacterial utilization in several ways using the model bacterium Alteromonas macleodii 83-1, which can degrade multiple algal polysaccharides and contributes to polysaccharide degradation in the oceans. Transcriptomic, proteomic and exometabolomic profiling revealed cellular adaptations of A. macleodii 83-1 when degrading a mix of laminarin, alginate and pectin. Strain 83-1 exhibited substrate prioritization driven by catabolite repression, with initial laminarin utilization followed by simultaneous alginate/pectin utilization. This biphasic phenotype coincided with pronounced shifts in gene expression, protein abundance and metabolite secretion, mainly involving CAZymes/polysaccharide utilization loci but also other functional traits. Distinct temporal changes in exometabolome composition, including the alginate/pectin-specific secretion of pyrroloquinoline quinone, suggest that substrate-dependent adaptations influence chemical interactions within the community. The ecological relevance of cellular adaptations was underlined by molecular evidence that common marine macroalgae, in particular Saccharina and Fucus, release mixtures of alginate and pectin-like rhamnogalacturonan. Moreover, CAZyme microdiversity and the genomic predisposition towards polysaccharide mixtures among Alteromonas spp. suggest polysaccharide-related traits as an ecophysiological factor, potentially relating to distinct ‘carbohydrate utilization types’ with different ecological strategies. Considering the substantial primary productivity of algae on global scales, these insights contribute to the understanding of bacteria-algae interactions and the remineralization of chemically diverse polysaccharide pools, a key step in marine carbon cycling.


April 21, 2020

Sequencing of Cultivated Peanut, Arachis hypogaea, Yields Insights into Genome Evolution and Oil Improvement.

Cultivated peanut (Arachis hypogaea) is an allotetraploid crop planted in Asia, Africa, and America for edible oil and protein. To explore the origins and consequences of tetraploidy, we sequenced the allotetraploid A. hypogaea genome and compared it with the related diploid Arachis duranensis and Arachis ipaensis genomes. We annotated 39 888 A-subgenome genes and 41 526 B-subgenome genes in allotetraploid peanut. The A. hypogaea subgenomes have evolved asymmetrically, with the B subgenome resembling the ancestral state and the A subgenome undergoing more gene disruption, loss, conversion, and transposable element proliferation, and having reduced gene expression during seed development despite lacking genome-wide expression dominance. Genomic and transcriptomic analyses identified more than 2 500 oil metabolism-related genes and revealed that most of them show altered expression early in seed development while their expression ceases during desiccation, presenting a comprehensive map of peanut lipid biosynthesis. The availability of these genomic resources will facilitate a better understanding of the complex genome architecture, agronomically and economically important genes, and genetic improvement of peanut.Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Diversification and collapse of a telomere elongation mechanism.

In most eukaryotes, telomerase counteracts chromosome erosion by adding repetitive sequence to terminal ends. Drosophila melanogaster instead relies on specialized retrotransposons that insert exclusively at telomeres. This exchange of goods between host and mobile element-wherein the mobile element provides an essential genome service and the host provides a hospitable niche for mobile element propagation-has been called a “genomic symbiosis.” However, these telomere-specialized, jockey family retrotransposons may actually evolve to “selfishly” overreplicate in the genomes that they ostensibly serve. Under this model, we expect rapid diversification of telomere-specialized retrotransposon lineages and, possibly, the breakdown of this ostensibly symbiotic relationship. Here we report data consistent with both predictions. Searching the raw reads of the 15-Myr-old melanogaster species group, we generated de novo jockey retrotransposon consensus sequences and used phylogenetic tree-building to delineate four distinct telomere-associated lineages. Recurrent gains, losses, and replacements account for this retrotransposon lineage diversity. In Drosophila biarmipes, telomere-specialized elements have disappeared completely. De novo assembly of long reads and cytogenetics confirmed this species-specific collapse of retrotransposon-dependent telomere elongation. Instead, telomere-restricted satellite DNA and DNA transposon fragments occupy its terminal ends. We infer that D. biarmipes relies instead on a recombination-based mechanism conserved from yeast to flies to humans. Telomeric retrotransposon diversification and disappearance suggest that persistently “selfish” machinery shapes telomere elongation across Drosophila rather than completely domesticated, symbiotic mobile elements. © 2019 Saint-Leandre et al.; Published by Cold Spring Harbor Laboratory Press.


April 21, 2020

Emergence of a ST2570 Klebsiella pneumoniae isolate carrying mcr-1 and blaCTX-M-14 recovered from a bloodstream infection in China.

The worldwide emergence of the plasmid-borne colistin resistance mediated by mcr-1 gene not only extended our knowledge on colistin resistance, but also poses a serious threat to clinical and public health [1, 2]. Since its first discovery, mcr-1-carrying Enterobacteriaceae from human, animal, food, and environmental origins have been widely identified, but few mcr-1-positive clinical strains of Klebsiella pneumoniae have been reported so far, especially when associated with community-acquired infections [3, 4]. Here, we report the emergence of a colistin-resistant K. pneumoniae isolate, which belonged to a rare sporadic clone, co-carrying mcr-1 and blaCTX-M-14 genes simultaneous recovered from a community-acquired bloodstream infection in China. Whole-genome sequencing and microbiological analysis were performed to elucidate its antimicrobial resistance mechanisms.


April 21, 2020

Enrichment of fetal and maternal long cell-free DNA fragments from maternal plasma following DNA repair.

Cell-free DNA (cfDNA) fragments in maternal plasma contain DNA damage and may negatively impact the sensitivity of noninvasive prenatal testing (NIPT). However, some of these DNA damages are potentially reparable. We aimed to recover these damaged cfDNA molecules using PreCR DNA repair mix.cfDNA was extracted from 20 maternal plasma samples and was repaired and sequenced by the Illumina platform. Size profiles and fetal DNA fraction changes of repaired samples were characterized. Targeted sequencing of chromosome Y sequences was used to enrich fetal cfDNA molecules following repair. Single-molecule real-time (SMRT) sequencing platform was employed to characterize long (>250 bp) cfDNA molecules. NIPT of five trisomy 21 samples was performed.Size profiles of repaired libraries were altered, with significantly increased long (>250 bp) cfDNA molecules. Single nucleotide polymorphism (SNP)-based analyses showed that both fetal- and maternal-derived cfDNA molecules were enriched by the repair. Fetal DNA fractions in maternal plasma showed a small but consistent (4.8%) increase, which were contributed by a higher increment of long fetal cfDNA molecules. z-score values were improved in NIPT of all trisomy 21 samples.Plasma DNA repair recovers and enriches long cfDNA molecules of both fetal and maternal origins in maternal plasma. © 2018 John Wiley & Sons, Ltd.


April 21, 2020

Long-read sequence and assembly of segmental duplications.

We have developed a computational method based on polyploid phasing of long sequence reads to resolve collapsed regions of segmental duplications within genome assemblies. Segmental Duplication Assembler (SDA; https://github.com/mvollger/SDA ) constructs graphs in which paralogous sequence variants define the nodes and long-read sequences provide attraction and repulsion edges, enabling the partition and assembly of long reads corresponding to distinct paralogs. We apply it to single-molecule, real-time sequence data from three human genomes and recover 33-79 megabase pairs (Mb) of duplications in which approximately half of the loci are diverged (<99.8%) compared to the reference genome. We show that the corresponding sequence is highly accurate (>99.9%) and that the diverged sequence corresponds to copy-number-variable paralogs that are absent from the human reference genome. Our method can be applied to other complex genomes to resolve the last gene-rich gaps, improve duplicate gene annotation, and better understand copy-number-variant genetic diversity at the base-pair level.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.