Development of high-throughput sequencing techniques have greatly benefited our understanding about microbial ecology; yet the methods producing short reads suffer from species-level resolution and uncertainty of identification. Here we optimize PacBio-based metabarcoding protocols covering the Internal Transcribed Spacer (ITS region) and partial Small Subunit (SSU) of the rRNA gene for species-level identification of all eukaryotes, with a specific focus on Fungi (including Glomeromycota) and Stramenopila (particularly Oomycota). Based on tests on composite soil samples and mock communities, we propose best suitable degenerate primers, ITS9munngs + ITS4ngsUni for eukaryotes and selected groups therein and discuss pros and cons of long read-based identification of eukaryotes. This article is protected by copyright. All rights reserved.
Relative Performance of MinION (Oxford Nanopore Technologies) versus Sequel (Pacific Biosciences) Third-Generation Sequencing Instruments in Identification of Agricultural and Forest Fungal Pathogens.
Culture-based molecular identification methods have revolutionized detection of pathogens, yet these methods are slow and may yield inconclusive results from environmental materials. The second-generation sequencing tools have much-improved precision and sensitivity of detection, but these analyses are costly and may take several days to months. Of the third-generation sequencing techniques, the portable MinION device (Oxford Nanopore Technologies) has received much attention because of its small size and possibility of rapid analysis at reasonable cost. Here, we compare the relative performances of two third-generation sequencing instruments, MinION and Sequel (Pacific Biosciences), in identification and diagnostics of fungal and oomycete pathogens from conifer (Pinaceae) needles and potato (Solanum tuberosum) leaves and tubers. We demonstrate that the Sequel instrument is efficient for metabarcoding of complex samples, whereas MinION is not suited for this purpose due to a high error rate and multiple biases. However, we find that MinION can be utilized for rapid and accurate identification of dominant pathogenic organisms and other associated organisms from plant tissues following both amplicon-based and PCR-free metagenomics approaches. Using the metagenomics approach with shortened DNA extraction and incubation times, we performed the entire MinION workflow, from sample preparation through DNA extraction, sequencing, bioinformatics, and interpretation, in 2.5 h. We advocate the use of MinION for rapid diagnostics of pathogens and potentially other organisms, but care needs to be taken to control or account for multiple potential technical biases.IMPORTANCE Microbial pathogens cause enormous losses to agriculture and forestry, but current combined culturing- and molecular identification-based detection methods are too slow for rapid identification and application of countermeasures. Here, we develop new and rapid protocols for Oxford Nanopore MinION-based third-generation diagnostics of plant pathogens that greatly improve the speed of diagnostics. However, due to high error rate and technical biases in MinION, the Pacific BioSciences Sequel platform is more useful for in-depth amplicon-based biodiversity monitoring (metabarcoding) from complex environmental samples.Copyright © 2019 American Society for Microbiology.
Identification of Initial Colonizing Bacteria in Dental Plaques from Young Adults Using Full-Length 16S rRNA Gene Sequencing.
Development of dental plaque begins with the adhesion of salivary bacteria to the acquired pellicle covering the tooth surface. In this study, we collected in vivo dental plaque formed on hydroxyapatite disks for 6 h from 74 young adults and identified initial colonizing taxa based on full-length 16S rRNA gene sequences. A long-read, single-molecule sequencer, PacBio Sequel, provided 100,109 high-quality full-length 16S rRNA gene sequence reads from the early plaque microbiota, which were assigned to 90 oral bacterial taxa. The microbiota obtained from every individual mostly comprised the 21 predominant taxa with the maximum relative abundance of over 10% (95.8?±?6.2%, mean ± SD), which included Streptococcus species as well as nonstreptococcal species. A hierarchical cluster analysis of their relative abundance distribution suggested three major patterns of microbiota compositions: a Streptococcus mitis/Streptococcus sp. HMT-423-dominant profile, a Neisseria sicca/Neisseria flava/Neisseria mucosa-dominant profile, and a complex profile with high diversity. No notable variations in the community structures were associated with the dental caries status, although the total bacterial amounts were larger in the subjects with a high number of caries-experienced teeth (=8) than in those with no or a low number of caries-experienced teeth. Our results revealed the bacterial taxa primarily involved in early plaque formation on hydroxyapatite disks in young adults.IMPORTANCE Selective attachment of salivary bacteria to the tooth surface is an initial and repetitive phase in dental plaque development. We employed full-length 16S rRNA gene sequence analysis with a high taxonomic resolution using a third-generation sequencer, PacBio Sequel, to determine the bacterial composition during early plaque formation in 74 young adults accurately and in detail. The results revealed 21 bacterial taxa primarily involved in early plaque formation on hydroxyapatite disks in young adults, which include several streptococcal species as well as nonstreptococcal species, such as Neisseria sicca/Nflava/Nmucosa and Rothia dentocariosa Given that no notable variations in the microbiota composition were associated with the dental caries status, the maturation process, rather than the specific bacterial species that are the initial colonizers, is likely to play an important role in the development of dysbiotic microbiota associated with dental caries. Copyright © 2019 Ihara et al.
Harnessing long-read amplicon sequencing to uncover NRPS and Type I PKS gene sequence diversity in polar desert soils.
The severity of environmental conditions at Earth’s frigid zones present attractive opportunities for microbial biomining due to their heightened potential as reservoirs for novel secondary metabolites. Arid soil microbiomes within the Antarctic and Arctic circles are remarkably rich in Actinobacteria and Proteobacteria, bacterial phyla known to be prolific producers of natural products. Yet the diversity of secondary metabolite genes within these cold, extreme environments remain largely unknown. Here, we employed amplicon sequencing using PacBio RS II, a third generation long-read platform, to survey over 200 soils spanning twelve east Antarctic and high Arctic sites for natural product-encoding genes, specifically targeting non-ribosomal peptides (NRPS) and Type I polyketides (PKS). NRPS-encoding genes were more widespread across the Antarctic, whereas PKS genes were only recoverable from a handful of sites. Many recovered sequences were deemed novel due to their low amino acid sequence similarity to known protein sequences, particularly throughout the east Antarctic sites. Phylogenetic analysis revealed that a high proportion were most similar to antifungal and biosurfactant-type clusters. Multivariate analysis showed that soil fertility factors of carbon, nitrogen and moisture displayed significant negative relationships with natural product gene richness. Our combined results suggest that secondary metabolite production is likely to play an important physiological component of survival for microorganisms inhabiting arid, nutrient-starved soils. © FEMS 2019.
Full-length 16S rRNA gene classification of Atlantic salmon bacteria and effects of using different 16S variable regions on community structure analysis.
Understanding fish-microbial relationships may be of great value for fish producers as fish growth, development and welfare are influenced by the microbial community associated with the rearing systems and fish surfaces. Accurate methods to generate and analyze these microbial communities would be an important tool to help improve understanding of microbial effects in the industry. In this study, we performed taxonomic classification and determination of operational taxonomic units on Atlantic salmon microbiota by taking advantage of full-length 16S rRNA gene sequences. Skin mucus was dominated by the genera Flavobacterium and Psychrobacter. Intestinal samples were dominated by the genera Carnobacterium, Aeromonas, Mycoplasma and by sequences assigned to the order Clostridiales. Applying Sanger sequencing on the full-length bacterial 16S rRNA gene from the pool of 46 isolates obtained in this study showed a clear assignment of the PacBio full-length bacterial 16S rRNA gene sequences down to the genus level. One of the bottlenecks in comparing microbial profiles is that different studies use different 16S rRNA gene regions. Comparisons of sequence assignments between full-length and in silico derived variable 16S rRNA gene regions showed different microbial profiles with variable effects between phylogenetic groups and taxonomic ranks. © 2019 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Long-read next-generation amplicon sequencing shows promise for studying complete genes or genomes from complex and diverse populations. Current long-read sequencing technologies have challenging error profiles, hindering data processing and incorporation into downstream analyses. Here we consider the problem of how to reconstruct, free of sequencing error, the true sequence variants and their associated frequencies from PacBio reads. Called ‘amplicon denoising’, this problem has been extensively studied for short-read sequencing technologies, but current solutions do not always successfully generalize to long reads with high indel error rates. We introduce two methods: one that runs nearly instantly and is very accurate for medium length reads and high template coverage, and another, slower method that is more robust when reads are very long or coverage is lower. On two Mock Virus Community datasets with ground truth, each sequenced on a different PacBio instrument, and on a number of simulated datasets, we compare our two approaches to each other and to existing algorithms. We outperform all tested methods in accuracy, with competitive run times even for our slower method, successfully discriminating templates that differ by a just single nucleotide. Julia implementations of Fast Amplicon Denoising (FAD) and Robust Amplicon Denoising (RAD), and a webserver interface, are freely available. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution.
Targeted PCR amplification and high-throughput sequencing (amplicon sequencing) of 16S rRNA gene fragments is widely used to profile microbial communities. New long-read sequencing technologies can sequence the entire 16S rRNA gene, but higher error rates have limited their attractiveness when accuracy is important. Here we present a high-throughput amplicon sequencing methodology based on PacBio circular consensus sequencing and the DADA2 sample inference method that measures the full-length 16S rRNA gene with single-nucleotide resolution and a near-zero error rate. In two artificial communities of known composition, our method recovered the full complement of full-length 16S sequence variants from expected community members without residual errors. The measured abundances of intra-genomic sequence variants were in the integral ratios expected from the genuine allelic variants within a genome. The full-length 16S gene sequences recovered by our approach allowed Escherichia coli strains to be correctly classified to the O157:H7 and K12 sub-species clades. In human fecal samples, our method showed strong technical replication and was able to recover the full complement of 16S rRNA alleles in several E. coli strains. There are likely many applications beyond microbial profiling for which high-throughput amplicon sequencing of complete genes with single-nucleotide resolution will be of use. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
Information about variations in multiple copies of bacterial 16S rRNA genes may aid in species identification.
Variable region analysis of 16S rRNA gene sequences is the most common tool in bacterial taxonomic studies. Although used for distinguishing bacterial species, its use remains limited due to the presence of variable copy numbers with sequence variation in the genomes. In this study, 16S rRNA gene sequences, obtained from completely assembled whole genome and Sanger electrophoresis sequencing of cloned PCR products from Serratia fonticola GS2, were compared. Sanger sequencing produced a combination of sequences from multiple copies of 16S rRNA genes. To determine whether the variant copies of 16S rRNA genes affected Sanger sequencing, two ratios (5:5 and 8:2) with different concentrations of cloned 16S rRNA genes were used; it was observed that the greater the number of copies with similar sequences the higher its chance of amplification. Effect of multiple copies for taxonomic classification of 16S rRNA gene sequences was investigated using the strain GS2 as a model. 16S rRNA copies with the maximum variation had 99.42% minimum pairwise similarity and this did not have an effect on species identification. Thus, PCR products from genomes containing variable 16S rRNA gene copies can provide sufficient information for species identification except from species which have high similarity of sequences in their 16S rRNA gene copies like the case of Bacillus thuringiensis and Bacillus cereus. In silico analysis of 1,616 bacterial genomes from long-read sequencing was also done. The average minimum pairwise similarity for each phylum was reported with their average genome size and average “unique copies” of 16S rRNA genes and we found that the phyla Proteobacteria and Firmicutes showed the highest amount of variation in their copies of their 16S rRNA genes. Overall, our results shed light on how the variations in the multiple copies of the 16S rRNA genes of bacteria can aid in appropriate species identification.
Conventional culture methods with commercially available media unveil the presence of novel culturable bacteria.
Recent metagenomic analysis has revealed that our gut microbiota plays an important role in not only the maintenance of our health but also various diseases such as obesity, diabetes, inflammatory bowel disease, and allergy. However, most intestinal bacteria are considered ‘unculturable’ bacteria, and their functions remain unknown. Although culture-independent genomic approaches have enabled us to gain insight into their potential roles, culture-based approaches are still required to understand their characteristic features and phenotypes. To date, various culturing methods have been attempted to obtain these ‘unculturable’ bacteria, but most such methods require advanced techniques. Here, we have tried to isolate possible unculturable bacteria from a healthy Japanese individual by using commercially available media. A 16S rRNA (ribosomal RNA) gene metagenomic analysis revealed that each culture medium showed bacterial growth depending on its selective features and a possibility of the presence of novel bacterial species. Whole genome sequencing of these candidate strains suggested the isolation of 8 novel bacterial species classified in the Actinobacteria and Firmicutes phyla. Our approach indicates that a number of intestinal bacteria hitherto considered unculturable are potentially culturable and can be cultured on commercially available media. We have obtained novel gut bacteria from a healthy Japanese individual using a combination of comprehensive genomics and conventional culturing methods. We would expect that the discovery of such novel bacteria could illuminate pivotal roles for the gut microbiota in association with human health.
Newly designed 16S rRNA metabarcoding primers amplify diverse and novel archaeal taxa from the environment.
High-throughput studies of microbial communities suggest that Archaea are a widespread component of microbial diversity in various ecosystems. However, proper quantification of archaeal diversity and community ecology remains limited, as sequence coverage of Archaea is usually low owing to the inability of available prokaryotic primers to efficiently amplify archaeal compared to bacterial rRNA genes. To improve identification and quantification of Archaea, we designed and validated the utility of several primer pairs to efficiently amplify archaeal 16S rRNA genes based on up-to-date reference genes. We demonstrate that several of these primer pairs amplify phylogenetically diverse Archaea with high sequencing coverage, outperforming commonly used primers. Based on comparing the resulting long 16S rRNA gene fragments with public databases from all habitats, we found several novel family- to phylum-level archaeal taxa from topsoil and surface water. Our results suggest that archaeal diversity has been largely overlooked due to the limitations of available primers, and that improved primer pairs enable to estimate archaeal diversity more accurately. © 2018 The Authors. Environmental Microbiology Reports published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Investigating the bacterial microbiota of traditional fermented dairy products using propidium monoazide with single-molecule real-time sequencing.
Traditional fermented dairy foods have been the major components of the Mongolian diet for millennia. In this study, we used propidium monoazide (PMA; binds to DNA of nonviable cells so that only viable cells are enumerated) and single-molecule real-time sequencing (SMRT) technology to investigate the total and viable bacterial compositions of 19 traditional fermented dairy foods, including koumiss from Inner Mongolia (KIM), koumiss from Mongolia (KM), and fermented cow milk from Mongolia (CM); sample groups treated with PMA were designated PKIM, PKM, and PCM. Full-length 16S rRNA sequencing identified 195 bacterial species in 121 genera and 13 phyla in PMA-treated and untreated samples. The PMA-treated and untreated samples differed significantly in their bacterial community composition and a-diversity values. The predominant species in KM, KIM, and CM were Lactobacillus helveticus, Streptococcus parauberis, and Lactobacillus delbrueckii, whereas the predominant species in PKM, PKIM, and PCM were Enterobacter xiangfangensis, Lactobacillus helveticus, and E. xiangfangensis, respectively. Weighted and unweighted principal coordinate analyses showed a clear clustering pattern with good separation and only minor overlapping. In addition, a pure culture method was performed to obtain lactic acid bacteria resources in dairy samples according to the results of SMRT sequencing. A total of 102 LAB strains were identified and Lb. helveticus (68.63%) was the most abundant, in agreement with SMRT sequencing results. Our results revealed that the bacterial communities of traditional dairy foods are complex and vary by type of fermented dairy product. The PMA treatment induced significant changes in bacterial community structure.Copyright © 2019 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.
Amplicon sequencing of the 16S rRNA gene is the predominant method to quantify microbial compositions and to discover novel lineages. However, traditional short amplicons often do not contain enough information to confidently resolve their phylogeny. Here we present a cost-effective protocol that amplifies a large part of the rRNA operon and sequences the amplicons with PacBio technology. We tested our method on a mock community and developed a read-curation pipeline that reduces the overall read error rate to 0.18%. Applying our method on four environmental samples, we captured near full-length rRNA operon amplicons from a large diversity of prokaryotes. The method operated at moderately high-throughput (22286-37,850 raw ccs reads) and generated a large amount of putative novel archaeal 23S rRNA gene sequences compared to the archaeal SILVA database. These long amplicons allowed for higher resolution during taxonomic classification by means of long (~1000 bp) 16S rRNA gene fragments and for substantially more confident phylogenies by means of combined near full-length 16S and 23S rRNA gene sequences, compared to shorter traditional amplicons (250 bp of the 16S rRNA gene). We recommend our method to those who wish to cost-effectively and confidently estimate the phylogenetic diversity of prokaryotes in environmental samples at high throughput. © 2019 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
We investigated inflow of a wastewater treatment plant and sediment of an urban lake for the presence of Clostridioides difficile by cultivation and PCR. Among seven colonies we sequenced the complete genomes of three: two non-toxigenic isolates from wastewater and one toxigenic isolate from the urban lake. For all obtained isolates, a close genomic relationship with human-derived isolates was observed.Copyright © 2019 Elsevier Ltd. All rights reserved.
PacBio sequencing reveals bacterial community diversity in cheeses collected from different regions.
Cheese is a fermented dairy product that is popular for its unique flavor and nutritional value. Recent studies have shown that microorganisms in cheese play an important role in the fermentation process and determine the quality of the cheese. We collected 12 cheese samples from different regions and studied the composition of their bacterial communities using PacBio small-molecule real-time sequencing (Pacific Biosciences, Menlo Park, CA). Our data revealed 144 bacterial genera (including Lactobacillus, Streptococcus, Lactococcus, and Staphylococcus) and 217 bacterial species (including Lactococcus lactis, Streptococcus thermophilus, Staphylococcus equorum, and Streptococcus uberis). We investigated the flavor quality of the cheese samples using an electronic nose system and we found differences in flavor-quality indices among samples from different regions. We found a clustering tendency based on flavor quality using principal component analysis. We found correlations between lactic acid bacteria and the flavor quality of the cheese samples. Biodegradation and metabolism of xenobiotics, and lipid-metabolism-related pathways, were predicted to contribute to differences in cheese flavor using Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt). This preliminary study explored the bacterial communities in cheeses collected from different regions and their potential genome functions from the perspective of flavor quality.Copyright © 2020 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Effect of sulfur-iron modified biochar on the available cadmium and bacterial community structure in contaminated soils.
Cadmium contamination in paddy soils has aroused increasing concern around the world, and biochar has many positive properties, such as large specific surface areas, micro porous structure for the heavy metal immobilization in soils. However there are few studies on sulfur-iron modified biochar as well as its microbiology effects. The purpose of this study was to evaluate the Cd immobilization effects of sulfur or sulfur-iron modified biochar and its related microbial community changes in Cd-contaminated soils. SEM-EDX analysis confirmed that sulfur and iron were loaded on the raw biochar successfully. Sulfur-modified biochar (S-BC) and sulfur-iron modified biochar (SF-BC) addition increased pH value and the content of soil organic matter, and also decreased DTPA-extractable Cd. There was a negative significant correlation between organic matter content and the available Cd (P?0.05). During a 45-d incubation period, the fractions of Cd are mainly with the exchangeable (25.16-35.79%) and carbonate (22.01-25.10%) fractions. Compared with the control, the concentrations of exchangeable Cd in soil were significantly (P?0.05) decreased by 12.54%, 29.71%, 18.53% under the treatments of BC, S-BC, SF-BC respectively. The S-BC and SF-BC treatments significantly (P?0.05) increased Chao1, observed, Shannon and Simpson diversity indices compared with the control and biochar treatments. Meanwhile, the relative abundance of Proteobacteria, Bacteroidetes, and Actinobacteria increased, whereas the abundance of Acidobacteria and Germmatimonadetes decreased. Capsule: Sulfur-modified and sulfur-iron modified biochar applications decreased the available Cd and changed the microbial community.Copyright © 2018 Elsevier B.V. All rights reserved.