Menu
April 21, 2020

Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture

Bacteria capable of dechlorinating the toxic environmental contaminant dichloromethane (DCM, CHt2Cl2) are of great interest for potential bioremediation applications. A novel, strictly anaerobic, DCM-fermenting bacterium, “DCMF”, was enriched from organochlorine-contaminated groundwater near Botany Bay, Australia. The enrichment culture was maintained in minimal, mineral salt medium amended with dichloromethane as the sole energy source. PacBio whole genome SMRTtextsuperscriptTM sequencing of DCMF allowed textitde novo, gap-free assembly despite the presence of cohabiting organisms in the culture. Illumina sequencing reads were utilised to correct minor indels. The single, circularised 6.44 Mb chromosome was annotated with the IMG pipeline and contains 5,773 predicted protein-coding genes. Based on 16S rRNA gene and predicted proteome phylogeny, the organism appears to be a novel member of the textitPeptococcaceae family. The DCMF genome is large in comparison to known DCM-fermenting bacteria and includes 96 predicted methylamine methyltransferases, which may provide clues to the basis of its DCM metabolism. Full annotation has been provided in a custom genome browser and search tool, in addition to multiple sequence alignments and phylogenetic trees for every predicted protein, available at http://www.slimsuite.unsw.edu.au/research/dcmf/.


April 21, 2020

Complete genome sequence of Helicobacter pylori B128 7.13 and a single-step method for the generation of unmarked mutations.

Helicobacter pylori represents an interesting model of bacterial pathogenesis given that most infections are asymptomatic, while a minority of infections cause severe gastric disease. H pylori strain B128 7.13 is used extensively to understand H pylori pathophysiology. Due to extensive restriction-modification systems, the fact that only some H pylori strains are naturally transformable, the inability of common plasmid and transposon vectors to replicate in this bacterium, as well as the limited number of antibiotic cassettes that are functional in H pylori, there are relatively few genetic tools for the mutagenesis of this bacterium.Here, we use PacBio and Illumina sequencing to reveal the complete genome sequence of H pylori B128 7.13. Furthermore, we describe a system to generate markerless and scarless mutations on the H pylori chromosome using the counter-selection marker, galactokinase from Escherichia coli.We show that this mutagenesis strategy can be used to generate in-frame insertions, gene deletions, and multiple independent mutations in B128 7.13. Using the closed genome as a reference, we also report the absence of second site chromosomal mutations and/or rearrangements in our mutagenized strains. We compare the genome sequence of H pylori B128 7.13 with a closely related strain, H pylori B8, and reveal one notable region of difference, which is a 1430 bp insertion encoding a H pylori-specific DUF874 family protein of unknown function.This article reports the closed genome of the important H pylori B128 7.13 strain and a mutagenesis method that can be adopted by researchers as an alternative strategy to generate isogenic mutants of H pylori in order to further our understanding of this bacterium. © 2019. The Authors. Helicobacter Published by John Wiley & Sons Ltd.


April 21, 2020

Long-read amplicon denoising.

Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called ‘amplicon denoising’, this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution.

Targeted PCR amplification and high-throughput sequencing (amplicon sequencing) of 16S rRNA gene fragments is widely used to profile microbial communities. New long-read sequencing technologies can sequence the entire 16S rRNA gene, but higher error rates have limited their attractiveness when accuracy is important. Here we present a high-throughput amplicon sequencing methodology based on PacBio circular consensus sequencing and the DADA2 sample inference method that measures the full-length 16S rRNA gene with single-nucleotide resolution and a near-zero error rate. In two artificial communities of known composition, our method recovered the full complement of full-length 16S sequence variants from expected community members without residual errors. The measured abundances of intra-genomic sequence variants were in the integral ratios expected from the genuine allelic variants within a genome. The full-length 16S gene sequences recovered by our approach allowed Escherichia coli strains to be correctly classified to the O157:H7 and K12 sub-species clades. In human fecal samples, our method showed strong technical replication and was able to recover the full complement of 16S rRNA alleles in several E. coli strains. There are likely many applications beyond microbial profiling for which high-throughput amplicon sequencing of complete genes with single-nucleotide resolution will be of use. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020

Intercellular communication is required for trap formation in the nematode-trapping fungus Duddingtonia flagrans.

Nematode-trapping fungi (NTF) are a large and diverse group of fungi, which may switch from a saprotrophic to a predatory lifestyle if nematodes are present. Different fungi have developed different trapping devices, ranging from adhesive cells to constricting rings. After trapping, fungal hyphae penetrate the worm, secrete lytic enzymes and form a hyphal network inside the body. We sequenced the genome of Duddingtonia flagrans, a biotechnologically important NTF used to control nematode populations in fields. The 36.64 Mb genome encodes 9,927 putative proteins, among which are more than 638 predicted secreted proteins. Most secreted proteins are lytic enzymes, but more than 200 were classified as small secreted proteins (< 300 amino acids). 117 putative effector proteins were predicted, suggesting interkingdom communication during the colonization. As a first step to analyze the function of such proteins or other phenomena at the molecular level, we developed a transformation system, established the fluorescent proteins GFP and mCherry, adapted an assay to monitor protein secretion, and established gene-deletion protocols using homologous recombination or CRISPR/Cas9. One putative virulence effector protein, PefB, was transcriptionally induced during the interaction. We show that the mature protein is able to be imported into nuclei in Caenorhabditis elegans cells. In addition, we studied trap formation and show that cell-to-cell communication is required for ring closure. The availability of the genome sequence and the establishment of many molecular tools will open new avenues to studying this biotechnologically relevant nematode-trapping fungus.


April 21, 2020

Chromulinavorax destructans, a pathogen of microzooplankton that provides a window into the enigmatic candidate phylum Dependentiae.

Members of the major candidate phylum Dependentiae (a.k.a. TM6) are widespread across diverse environments from showerheads to peat bogs; yet, with the exception of two isolates infecting amoebae, they are only known from metagenomic data. The limited knowledge of their biology indicates that they have a long evolutionary history of parasitism. Here, we present Chromulinavorax destructans (Strain SeV1) the first isolate of this phylum to infect a representative from a widespread and ecologically significant group of heterotrophic flagellates, the microzooplankter Spumella elongata (Strain CCAP 955/1). Chromulinavorax destructans has a reduced 1.2 Mb genome that is so specialized for infection that it shows no evidence of complete metabolic pathways, but encodes an extensive transporter system for importing nutrients and energy in the form of ATP from the host. Its replication causes extensive reorganization and expansion of the mitochondrion, effectively surrounding the pathogen, consistent with its dependency on the host for energy. Nearly half (44%) of the inferred proteins contain signal sequences for secretion, including many without recognizable similarity to proteins of known function, as well as 98 copies of proteins with an ankyrin-repeat domain; ankyrin-repeats are known effectors of host modulation, suggesting the presence of an extensive host-manipulation apparatus. These observations help to cement members of this phylum as widespread and diverse parasites infecting a broad range of eukaryotic microbes.


April 21, 2020

Modern technologies and algorithms for scaffolding assembled genomes.

The computational reconstruction of genome sequences from shotgun sequencing data has been greatly simplified by the advent of sequencing technologies that generate long reads. In the case of relatively small genomes (e.g., bacterial or viral), complete genome sequences can frequently be reconstructed computationally without the need for further experiments. However, large and complex genomes, such as those of most animals and plants, continue to pose significant challenges. In such genomes, assembly software produces incomplete and fragmented reconstructions that require additional experimentally derived information and manual intervention in order to reconstruct individual chromosome arms. Recent technologies originally designed to capture chromatin structure have been shown to effectively complement sequencing data, leading to much more contiguous reconstructions of genomes than previously possible. Here, we survey these technologies and the algorithms used to assemble and analyze large eukaryotic genomes, placed within the historical context of genome scaffolding technologies that have been in existence since the dawn of the genomic era.


April 21, 2020

Genome-Wide Screening for Enteric Colonization Factors in Carbapenem-Resistant ST258 Klebsiella pneumoniae.

A diverse, antibiotic-naive microbiota prevents highly antibiotic-resistant microbes, including carbapenem-resistant Klebsiella pneumoniae (CR-Kp), from achieving dense colonization of the intestinal lumen. Antibiotic-mediated destruction of the microbiota leads to expansion of CR-Kp in the gut, markedly increasing the risk of bacteremia in vulnerable patients. While preventing dense colonization represents a rational approach to reduce intra- and interpatient dissemination of CR-Kp, little is known about pathogen-associated factors that enable dense growth and persistence in the intestinal lumen. To identify genetic factors essential for dense colonization of the gut by CR-Kp, we constructed a highly saturated transposon mutant library with >150,000 unique mutations in an ST258 strain of CR-Kp and screened for in vitro growth and in vivo intestinal colonization in antibiotic-treated mice. Stochastic and partially reversible fluctuations in the representation of different mutations during dense colonization revealed the dynamic nature of intestinal microbial populations. We identified genes that are crucial for early and late stages of dense gut colonization and confirmed their role by testing isogenic mutants in in vivo competition assays with wild-type CR-Kp Screening of the transposon library also identified mutations that enhanced in vivo CR-Kp growth. These newly identified colonization factors may provide novel therapeutic opportunities to reduce intestinal colonization by CR-KpIMPORTANCEKlebsiella pneumoniae is a common cause of bloodstream infections in immunocompromised and hospitalized patients, and over the last 2 decades, some strains have acquired resistance to nearly all available antibiotics, including broad-spectrum carbapenems. The U.S. Centers for Disease Control and Prevention has listed carbapenem-resistant K. pneumoniae (CR-Kp) as an urgent public health threat. Dense colonization of the intestine by CR-Kp and other antibiotic-resistant bacteria is associated with an increased risk of bacteremia. Reducing the density of gut colonization by CR-Kp is likely to reduce their transmission from patient to patient in health care facilities as well as systemic infections. How CR-Kp expands and persists in the gut lumen, however, is poorly understood. Herein, we generated a highly saturated mutant library in a multidrug-resistant K. pneumoniae strain and identified genetic factors that are associated with dense gut colonization by K. pneumoniae This study sheds light on host colonization by K. pneumoniae and identifies potential colonization factors that contribute to high-density persistence of K. pneumoniae in the intestine. Copyright © 2019 Jung et al.


April 21, 2020

Trophic specialization results in genomic reduction in free-living marine idiomarina bacteria.

The streamlining hypothesis is generally used to explain the genomic reduction events related to the small genome size of free-living bacteria like marine bacteria SAR11. However, our current understanding of the correlation between bacterial genome size and environmental adaptation relies on too few species. It is still unclear whether there are other paths leading to genomic reduction in free-living bacteria. The genome size of marine free-living bacteria of the genus Idiomarina belonging to the order Alteromonadales (Gammaproteobacteria) is much smaller than the size of related genomes from bacteria in the same order. Comparative genomic and physiological analyses showed that the genomic reduction pattern in this genus is different from that of the classical SAR11 lineage. Genomic reduction reconstruction and substrate utilization profile showed that Idiomarina spp. lost a large number of genes related to carbohydrate utilization, and instead they specialized on using proteinaceous resources. Here we propose a new hypothesis to explain genomic reduction in this genus; we propose that trophic specialization increasing the metabolic efficiency for using one kind of substrate but reducing the substrate utilization spectrum could result in bacterial genomic reduction, which would be not uncommon in nature. This hypothesis was further tested in another free-living genus, Kangiella, which also shows dramatic genomic reduction. These findings highlight that trophic specialization is potentially an important path leading to genomic reduction in some marine free-living bacteria, which is distinct from the classical lineages like SAR11.IMPORTANCE The streamlining hypothesis is usually used to explain the genomic reduction events in free-living bacteria like SAR11. However, we find that the genomic reduction phenomenon in the bacterial genus Idiomarina is different from that in SAR11. Therefore, we propose a new hypothesis to explain genomic reduction in this genus based on trophic specialization that could result in genomic reduction, which would be not uncommon in nature. Not only can the trophic specialization hypothesis explain the genomic reduction in the genus Idiomarina, but it also sheds new light on our understanding of the genomic reduction processes in other free-living bacterial lineages. Copyright © 2019 Qin et al.


April 21, 2020

A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.

In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.


April 21, 2020

Genomic characterization of Kerstersia gyiorum SWMUKG01, an isolate from a patient with respiratory infection in China.

The Gram-negative bacterium Kerstersia gyiorum, a potential etiological agent of clinical infections, was isolated from several human patients presenting clinical symptoms. Its significance as a possible pathogen has been previously overlooked as no disease has thus far been definitively associated with this bacterium. To better understand how the organism contributes to the infectious disease, we determined the complete genomic sequence of K. gyiorum SWMUKG01, the first clinical isolate from southwest China.The genomic data obtained displayed a single circular chromosome of 3, 945, 801 base pairs in length, which contains 3, 441 protein-coding genes, 55 tRNA genes and 9 rRNA genes. Analysis on the full spectrum of protein coding genes for cellular structures, two-component regulatory systems and iron uptake pathways that may be important for the success of the bacterial survival, colonization and establishment in the host conferred new insights into the virulence characteristics of K. gyiorum. Phylogenomic comparisons with Alcaligenaceae species indicated that K. gyiorum SWMUKG01 had a close evolutionary relationships with Alcaligenes aquatilis and Alcaligenes faecalis.The comprehensive analysis presented in this work determinates for the first time a complete genome sequence of K. gyiorum, which is expected to provide useful information for subsequent studies on pathogenesis of this species.


April 21, 2020

Comparative genomic analysis of eight novel haloalkaliphilic bacteriophages from Lake Elmenteita, Kenya.

We report complete genome sequences of eight bacteriophages isolated from Haloalkaline Lake Elmenteita found on the floor of Kenyan Rift Valley. The bacteriophages were sequenced, annotated and a comparative genomic analysis using various Bioinformatics tools carried out to determine relatedness of the bacteriophages to each other, and to those in public databases. Basic genome properties like genome size, percentage coding density, number of open reading frames, percentage GC content and gene organizations revealed the bacteriophages had no relationship to each other. Comparison to other nucleotide sequences in GenBank database showed no significant similarities hence novel. At the amino acid level, phages of our study revealed mosaicism to genes with conserved domains to already described phages. Phylogenetic analyses of large terminase gene responsible for DNA packaging and DNA polymerase gene for replication further showed diversity among the bacteriophages. Our results give insight into diversity of bacteriophages in Lake Elmenteita and provide information on their evolution. By providing primary sequence information, this study not only provides novel sequences for biotechnological exploitation, but also sets stage for future studies aimed at better understanding of virus diversity and genomes from haloalkaline lakes in the Rift Valley.


April 21, 2020

Information about variations in multiple copies of bacterial 16S rRNA genes may aid in species identification.

Variable region analysis of 16S rRNA gene sequences is the most common tool in bacterial taxonomic studies. Although used for distinguishing bacterial species, its use remains limited due to the presence of variable copy numbers with sequence variation in the genomes. In this study, 16S rRNA gene sequences, obtained from completely assembled whole genome and Sanger electrophoresis sequencing of cloned PCR products from Serratia fonticola GS2, were compared. Sanger sequencing produced a combination of sequences from multiple copies of 16S rRNA genes. To determine whether the variant copies of 16S rRNA genes affected Sanger sequencing, two ratios (5:5 and 8:2) with different concentrations of cloned 16S rRNA genes were used; it was observed that the greater the number of copies with similar sequences the higher its chance of amplification. Effect of multiple copies for taxonomic classification of 16S rRNA gene sequences was investigated using the strain GS2 as a model. 16S rRNA copies with the maximum variation had 99.42% minimum pairwise similarity and this did not have an effect on species identification. Thus, PCR products from genomes containing variable 16S rRNA gene copies can provide sufficient information for species identification except from species which have high similarity of sequences in their 16S rRNA gene copies like the case of Bacillus thuringiensis and Bacillus cereus. In silico analysis of 1,616 bacterial genomes from long-read sequencing was also done. The average minimum pairwise similarity for each phylum was reported with their average genome size and average “unique copies” of 16S rRNA genes and we found that the phyla Proteobacteria and Firmicutes showed the highest amount of variation in their copies of their 16S rRNA genes. Overall, our results shed light on how the variations in the multiple copies of the 16S rRNA genes of bacteria can aid in appropriate species identification.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.