Menu
July 7, 2019

De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping.

It remains a challenge to perform de novo assembly using next-generation sequencing (NGS). Despite the availability of multiple sequencing technologies and tools (e.g., assemblers) it is still difficult to assemble new genomes at chromosome resolution (i.e., one sequence per chromosome). Obtaining high quality draft assemblies is extremely important in the case of yeast genomes to better characterise major events in their evolutionary history. The aim of this work is two-fold: on the one hand we want to show how combining different and somewhat complementary technologies is key to improving assembly quality and correctness, and on the other hand we present a de novo assembly pipeline we believe to be beneficial to core facility bioinformaticians. To demonstrate both the effectiveness of combining technologies and the simplicity of the pipeline, here we present the results obtained using the Dekkera bruxellensis genome.In this work we used short-read Illumina data and long-read PacBio data combined with the extreme long-range information from OpGen optical maps in the task of de novo genome assembly and finishing. Moreover, we developed NouGAT, a semi-automated pipeline for read-preprocessing, de novo assembly and assembly evaluation, which was instrumental for this work.We obtained a high quality draft assembly of a yeast genome, resolved on a chromosomal level. Furthermore, this assembly was corrected for mis-assembly errors as demonstrated by resolving a large collapsed repeat and by receiving higher scores by assembly evaluation tools. With the inclusion of PacBio data we were able to fill about 5 % of the optical mapped genome not covered by the Illumina data.


July 7, 2019

Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches.

Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.


July 7, 2019

Bovine NK-lysin: Copy number variation and functional diversification.

NK-lysin is an antimicrobial peptide and effector protein in the host innate immune system. It is coded by a single gene in humans and most other mammalian species. In this study, we provide evidence for the existence of four NK-lysin genes in a repetitive region on cattle chromosome 11. The NK2A, NK2B, and NK2C genes are tandemly arrayed as three copies in ~30-35-kb segments, located 41.8 kb upstream of NK1. All four genes are functional, albeit with differential tissue expression. NK1, NK2A, and NK2B exhibited the highest expression in intestine Peyer’s patch, whereas NK2C was expressed almost exclusively in lung. The four peptide products were synthesized ex vivo, and their antimicrobial effects against both Gram-positive and Gram-negative bacteria were confirmed with a bacteria-killing assay. Transmission electron microcopy indicated that bovine NK-lysins exhibited their antimicrobial activities by lytic action in the cell membranes. In summary, the single NK-lysin gene in other mammals has expanded to a four-member gene family by tandem duplications in cattle; all four genes are transcribed, and the synthetic peptides corresponding to the core regions are biologically active and likely contribute to innate immunity in ruminants.


July 7, 2019

Exploring the genomic traits of fungus-feeding bacterial genus Collimonas.

Collimonas is a genus belonging to the class of Betaproteobacteria and consists mostly of soil bacteria with the ability to exploit living fungi as food source (mycophagy). Collimonas strains differ in a range of activities, including swimming motility, quorum sensing, extracellular protease activity, siderophore production, and antimicrobial activities.In order to reveal ecological traits possibly related to Collimonas lifestyle and secondary metabolites production, we performed a comparative genomics analysis based on whole-genome sequencing of six strains representing 3 recognized species. The analysis revealed that the core genome represents 43.1 to 52.7 % of the genomes of the six individual strains. These include genes coding for extracellular enzymes (chitinase, peptidase, phospholipase), iron acquisition and type II secretion systems. In the variable genome, differences were found in genes coding for secondary metabolites (e.g. tripropeptin A and volatile terpenes), several unknown orphan polyketide synthase-nonribosomal peptide synthetase (PKS-NRPS), nonribosomal peptide synthetase (NRPS) gene clusters, a new lipopeptide and type III and type VI secretion systems. Potential roles of the latter genes in the interaction with other organisms were investigated. Mutation of a gene involved in tripropeptin A biosynthesis strongly reduced the antibacterial activity against Staphylococcus aureus, while disruption of a gene involved in the biosynthesis of the new lipopeptide had a large effect on the antifungal/oomycetal activities.Overall our results indicated that Collimonas genomes harbour many genes encoding for novel enzymes and secondary metabolites (including terpenes) important for interactions with other organisms and revealed genomic plasticity, which reflect the behaviour, antimicrobial activity and lifestylesof Collimonas spp.


July 7, 2019

Botrytis, the good, the bad and the ugly

Botrytis spp. are efficient pathogens, causing devastating diseases and significant crop losses in a wide variety of plant species. Here we outline our review of these pathogens, as well as highlight the major advances of the past 10 years in studying Botrytis in interaction with its hosts. Progress in molecular genetics and the development of relevant phylogenetic markers in particular, has resulted in the characterisation of approximately 30 species. The host range of Botrytis spp. includes plant species that are members of 170 families of cultivated plants.


July 7, 2019

Current overview on the study of bacteria in the rhizosphere by modern molecular techniques: a mini–review

The rhizosphere (soil zone influenced by roots) is a complex environment that harbors diverse bacterial populations, which have an important role in biogeochemical cycling of organic matter and mineral nutrients. Nevertheless, our knowledge of the ecology and role of these bacteria in the rhizosphere is very limited, particularly regarding how indigenous bacteria are able to communicate, colonize root environments, and compete along the rhizosphere microsites. In recent decades, the development and improvement of molecular techniques have provided more accurate knowledge of bacteria in their natural environment, refining microbial ecology and generating new questions about the roles and functions of bacteria in the rhizosphere. Recently, advances in soil post?genomic techniques (metagenomics, metaproteomics and metatranscriptomics) are being applied to improve our understanding of the microbial communities at a higher resolution. Moreover, advantages and limitations of classical and post?genomic techniques must be considered when studying bacteria in the rhizosphere. This review provides an overview of the current knowledge on the study of bacterial community in the rhizosphere by using modern molecular techniques, describing the bias of classical molecular techniques, next generation sequencing platforms and post?genomics techniques.


July 7, 2019

Quorum sensing activity of Aeromonas caviae strain YL12, a bacterium isolated from compost.

Quorum sensing is a well-studied cell-to-cell communication method that involves a cell-density dependent regulation of genes expression mediated by signalling molecules. In this study, a bacterium isolated from a plant material compost pile was found to possess quorum sensing activity based on bioassay screening. Isolate YL12 was identified using matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometry and molecular typing using rpoD gene which identified the isolate as Aeromonas caviae. High resolution tandem mass spectrometry was subsequently employed to identify the N-acyl homoserine lactone profile of Aeromonas caviae YL12 and confirmed that this isolate produced two short chain N-acyl homoserine lactones, namely C4-HSL and C6, and the production was observed to be cell density-dependent. Using the thin layer chromatography (TLC) bioassay, both AHLs were found to activate C. violaceum CV026, whereas only C6-HSL was revealed to induce bioluminescence expression of E. coli [pSB401]. The data presented in this study will be the leading steps in understanding the role of quorum sensing in Aeromonas caviae strain YL12.


July 7, 2019

Draft genome sequence of Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate.

We determined the genome sequence of a thermotolerant yeast, Kluyveromyces marxianus strain DMB1, isolated from sugarcane bagasse hydrolysate, and the sequence provides further insights into the genomic differences between this strain and other reported K. marxianus strains. The genome described here is composed of 11,165,408 bases and has 4,943 protein-coding genes. Copyright © 2014 Suzuki et al.


July 7, 2019

Draft genome sequence of Kitasatospora cheerisanensis KCTC 2395, which produces plecomacrolide against phytopathogenic fungi.

Kitasatospora cheerisanensis KCTC 2395, which produces antifungal metabolites with bafilomycin derivatives, including bafilomycin C1-amide, was isolated from a soil sample at Mt. Jiri, South Korea. Here, we report its draft genome sequence, which contains 8.04 Mb with 73.6% G+C content and 7,810 protein-coding genes. Copyright © 2014 Hwang et al.


July 7, 2019

The oxygen-independent metabolism of cyclic monoterpenes in Castellaniella defragrans 65Phen.

The facultatively anaerobic betaproteobacterium Castellaniella defragrans 65Phen utilizes acyclic, monocyclic and bicyclic monoterpenes as sole carbon source under oxic as well as anoxic conditions. A biotransformation pathway of the acyclic ß-myrcene required linalool dehydratase-isomerase as initial enzyme acting on the hydrocarbon. An in-frame deletion mutant did not use myrcene, but was able to grow on monocyclic monoterpenes. The genome sequence and a comparative proteome analysis together with a random transposon mutagenesis were conducted to identify genes involved in the monocyclic monoterpene metabolism. Metabolites accumulating in cultures of transposon and in-frame deletion mutants disclosed the degradation pathway.Castellaniella defragrans 65Phen oxidizes the monocyclic monoterpene limonene at the primary methyl group forming perillyl alcohol. The genome of 3.95 Mb contained a 70 kb genome island coding for over 50 proteins involved in the monoterpene metabolism. This island showed higher homology to genes of another monoterpene-mineralizing betaproteobacterium, Thauera terpenica 58EuT, than to genomes of the family Alcaligenaceae, which harbors the genus Castellaniella. A collection of 72 transposon mutants unable to grow on limonene contained 17 inactivated genes, with 46 mutants located in the two genes ctmAB (cyclic terpene metabolism). CtmA and ctmB were annotated as FAD-dependent oxidoreductases and clustered together with ctmE, a 2Fe-2S ferredoxin gene, and ctmF, coding for a NADH:ferredoxin oxidoreductase. Transposon mutants of ctmA, B or E did not grow aerobically or anaerobically on limonene, but on perillyl alcohol. The next steps in the pathway are catalyzed by the geraniol dehydrogenase GeoA and the geranial dehydrogenase GeoB, yielding perillic acid. Two transposon mutants had inactivated genes of the monoterpene ring cleavage (mrc) pathway. 2-Methylcitrate synthase and 2-methylcitrate dehydratase were also essential for the monoterpene metabolism but not for growth on acetate.The genome of Castellaniella defragrans 65Phen is related to other genomes of Alcaligenaceae, but contains a genomic island with genes of the monoterpene metabolism. Castellaniella defragrans 65Phen degrades limonene via a limonene dehydrogenase and the oxidation of perillyl alcohol. The initial oxidation at the primary methyl group is independent of molecular oxygen.


July 7, 2019

Organellar genomes of the four-toothed moss, Tetraphis pellucida.

Mosses are the largest of the three extant clades of gametophyte-dominant land plants and remain poorly studied using comparative genomic methods. Major monophyletic moss lineages are characterised by different types of a spore dehiscence apparatus called the peristome, and the most important unsolved problem in higher-level moss systematics is the branching order of these peristomate clades. Organellar genome sequencing offers the potential to resolve this issue through the provision of both genomic structural characters and a greatly increased quantity of nucleotide substitution characters, as well as to elucidate organellar evolution in mosses. We publish and describe the chloroplast and mitochondrial genomes of Tetraphis pellucida, representative of the most phylogenetically intractable and morphologically isolated peristomate lineage.Assembly of reads from Illumina SBS and Pacific Biosciences RS sequencing reveals that the Tetraphis chloroplast genome comprises 127,489 bp and the mitochondrial genome 107,730 bp. Although genomic structures are similar to those of the small number of other known moss organellar genomes, the chloroplast lacks the petN gene (in common with Tortula ruralis) and the mitochondrion has only a non-functional pseudogenised remnant of nad7 (uniquely amongst known moss chondromes).Structural genomic features exist with the potential to be informative for phylogenetic relationships amongst the peristomate moss lineages, and thus organellar genome sequences are urgently required for exemplars from other clades. The unique genomic and morphological features of Tetraphis confirm its importance for resolving one of the major questions in land plant phylogeny and for understanding the evolution of the peristome, a likely key innovation underlying the diversity of mosses. The functional loss of nad7 from the chondrome is now shown to have occurred independently in all three bryophyte clades as well as in the early-diverging tracheophyte Huperzia squarrosa.


July 7, 2019

Genome sequence of Pseudomonas sp. strain P482, a tomato rhizosphere isolate with broad-spectrum antimicrobial activity.

The tomato rhizosphere isolate Pseudomonas sp. strain P482 is a member of a diverse group of fluorescent pseudomonads. P482 produces a yet unidentified broad-spectrum antimicrobial compound(s), active inter alia (i.a.) against Dickeya spp. Here, we present a nearly complete genome of P482 obtained by a hybrid assembly of Illumina and PacBio sequencing data. Copyright © 2014 Krzyzanowska et al.


July 7, 2019

Genome Sequence of Pseudomonas brassicacearum DF41.

Pseudomonas brassicacearum DF41, a Gram-negative soil bacterium, is able to suppress the fungal pathogen Sclerotinia sclerotiorum through a process known as biological control. Here, we present a 6.8-Mb assembly of its genome, which is the second fully assembled genome of a P. brassicacearum strain.


July 7, 2019

Complete genome sequence of the sugar cane endophyte Pseudomonas aurantiaca PB-St2, a disease-suppressive bacterium with antifungal activity toward the plant pathogen Colletotrichum falcatum.

The endophytic bacterium Pseudomonas aurantiaca PB-St2 exhibits antifungal activity and represents a biocontrol agent to suppress red rot disease of sugar cane. Here, we report the completely sequenced 6.6-Mb genome of P. aurantiaca PB-St2. The sequence contains a repertoire of biosynthetic genes for secondary metabolites that putatively contribute to its antagonistic activity and its plant-microbe interactions.


July 7, 2019

Whole-genome analysis of Exserohilum rostratum from an outbreak of fungal meningitis and other infections.

Exserohilum rostratum was the cause of most cases of fungal meningitis and other infections associated with the injection of contaminated methylprednisolone acetate produced by the New England Compounding Center (NECC). Until this outbreak, very few human cases of Exserohilum infection had been reported, and very little was known about this dematiaceous fungus, which usually infects plants. Here, we report using whole-genome sequencing (WGS) for the detection of single nucleotide polymorphisms (SNPs) and phylogenetic analysis to investigate the molecular origin of the outbreak using 22 isolates of E. rostratum retrieved from 19 case patients with meningitis or epidural/spinal abscesses, 6 isolates from contaminated NECC vials, and 7 isolates unrelated to the outbreak. Our analysis indicates that all 28 isolates associated with the outbreak had nearly identical genomes of 33.8 Mb. A total of 8 SNPs were detected among the outbreak genomes, with no more than 2 SNPs separating any 2 of the 28 genomes. The outbreak genomes were separated from the next most closely related control strain by ~136,000 SNPs. We also observed significant genomic variability among strains unrelated to the outbreak, which may suggest the possibility of cryptic speciation in E. rostratum. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.