Menu
June 1, 2021  |  

Comparison of sequencing approaches applied to complex soil metagenomes to resolve proteins of interest

Background: Long-read sequencing presents several potential advantages for providing more complete gene profiling of metagenomic samples. Long reads can capture multiple genes in a single read, and longer reads typically result in assemblies with better contiguity, especially for higher abundance organisms. However, a major challenge with using long reads has been the higher cost per base, which may lead to insufficient coverage of low-abundance species. Additionally, lower single-pass accuracy can make gene discovery for low-abundance organisms difficult. Methods: To evaluate the pros and cons of long reads for metagenomics, we directly compared PacBio and Illumina sequencing on a soil-derived sample, which included spike-in controls of known concentrations of pure referenced samples. For PacBio sequencing, a 10 kb library was sequenced on the Sequel System with 3.0 chemistry. Highly accurate long reads (HiFi reads) with Q20 and higher were generated for downstream analyses using PacBio Circular Consensus Sequencing (CCS) mode. Results were assessed according to the following criteria: DNA extraction capacity, bioinformatics pipeline status, % of proteins with ambiguous AA’s, total unique error-free genes/$1000, total proteins observed in spike-ins/$1000, proteins of interest/$1000, median length of contigs with proteins, and assembly requirements. Results: Both methods had areas of superior performance. DNA extraction capacity was higher for Illumina, the bioinformatics pipeline is well-tested, and there was a lower proportion of proteins with ambiguous AA’s. On the other hand, with PacBio, twice as many unique error-free genes, twice as many total proteins from spike-ins, and ~6 times more proteins of interest were found per $1000 cost. PacBio data produced on average 5 times longer contigs capturing proteins of interest. Additionally, assembly was not required for gene or protein finding, as was the case with Illumina data. Conclusions: In this comparison of PacBio Sequel System with Illumina NextSeq on a complex microbiome, we conclude that the sequencing system of choice may vary, depending on the goals and resources for the project. PacBio sequencing requires a longer DNA extraction method, and the bioinformatics pipeline may require development. On the other hand, the Sequel System generates hundreds of thousands of long HiFi reads per SMRT Cell, producing more genes, more proteins, and longer contigs, thereby offering more information about the metagenomic samples for a lower cost.


September 22, 2019  |  

Complete genome sequence of Paenibacillus polymyxa YC0136, a plant growth–promoting rhizobacterium isolated from tobacco rhizosphere.

Paenibacillus polymyxa strain YC0136 is a plant growth-promoting rhizobacterium with antimicrobial activity, which was isolated from tobacco rhizosphere. Here, we report the complete genome sequence of P. polymyxa YC0136. Several genes with antifungal and antibacterial activity were discovered. Copyright © 2017 Liu et al.


September 22, 2019  |  

Long-read, Single Molecule, Real-Time (SMRT) DNA Sequencing for metagenomic applications

In this chapter, we describe applications of single molecule, real-time (SMRT) DNA sequencing toward metagenomic research. The long sequence reads, combined with a lack of bias with respect to DNA sequence context or GC content, facilitate a more comprehensive analysis of the genomic constitution of microbial communities. Full-length 16S RNA gene sequencing at high (>99%) accuracy allows for species-level characterization of community members concomitant with the determination of community structure. The application of SMRT sequencing to whole-community shotgun microbial metagenomics has also been discussed.


September 22, 2019  |  

Soil microbial communities are shaped by plant-driven changes in resource availability during secondary succession.

Although we understand the ecological processes eliciting changes in plant community composition during secondary succession, we do not understand whether co-occurring changes in plant detritus shape saprotrophic microbial communities in soil. In this study, we investigated soil microbial composition and function across an old-field chronosequence ranging from 16 to 86 years following agricultural abandonment, as well as three forests representing potential late-successional ecosystems. Fungal and bacterial community composition was quantified from ribosomal DNA, and insight into the functional potential of the microbial community to decay plant litter was gained from shotgun metagenomics and extracellular enzyme assays. Accumulation of soil organic matter across the chronosequence exerted a positive and significant effect on fungal phylogenetic ß-diversity and the activity of extracellular enzymes with lignocellulolytic activity. In addition, the increasing abundance of lignin-rich C4 grasses was positively related to the composition of fungal genes with lignocellulolytic function, thereby linking plant community composition, litter biochemistry, and microbial community function. However, edaphic properties were the primary agent shaping bacterial communities, as bacterial ß-diversity and variation in functional gene composition displayed a significant and positive relationship to soil pH across the chronosequence. The late-successional forests were compositionally distinct from the oldest old fields, indicating that substantial changes occur in soil microbial communities as old fields give way to forests. Taken together, our observations demonstrate that plants govern the turnover of soil fungal communities and functional characteristics during secondary succession, due to the continual input of detritus and differences in litter biochemistry among plant species.


September 22, 2019  |  

Atmospheric N deposition increases bacterial laccase-like multicopper oxidases: implications for organic matter decay.

Anthropogenic release of biologically available nitrogen (N) has increased dramatically over the last 150 years, which can alter the processes controlling carbon (C) storage in terrestrial ecosystems. In a northern hardwood forest ecosystem located in Michigan in the United States, nearly 20 years of experimentally increased atmospheric N deposition has reduced forest floor decay and increased soil C storage. This change occurred concomitantly with compositional changes in Basidiomycete fungi and in Actinobacteria, as well as the downregulation of fungal lignocelluloytic genes. Recently, laccase-like multicopper oxidases (LMCOs) have been discovered among bacteria which can oxidize ß-O-4 linkages in phenolic compounds (e.g., lignin and humic compounds), resulting in the production of dissolved organic carbon (DOC). Here, we examined how nearly 2 decades of experimental N deposition has affected the abundance and composition of saprotrophic bacteria possessing LMCO genes. In our experiment, LMCO genes were more abundant in the forest floor under experimental N deposition whereas the abundances of bacteria and fungi were unchanged. Experimental N deposition also led to less-diverse, significantly altered bacterial and LMCO gene assemblages, with taxa implicated in organic matter decay (i.e., Actinobacteria, Proteobacteria) accounting for the majority of compositional changes. These results suggest that experimental N deposition favors bacteria in the forest floor that harbor the LMCO gene and represents a plausible mechanism by which anthropogenic N deposition has reduced decomposition, increased soil C storage, and accelerated phenolic DOC production in our field experiment. Our observations suggest that future rates of atmospheric N deposition could fundamentally alter the physiological potential of soil microbial communities. Copyright © 2014, American Society for Microbiology. All Rights Reserved.


September 22, 2019  |  

Soil bacterial communities are shaped by temporal and environmental filtering: evidence from a long-term chronosequence.

Soil microbial communities are abundant, hyper-diverse and mediate global biogeochemical cycles, but we do not yet understand the processes mediating their assembly. Current hypothetical frameworks suggest temporal (e.g. dispersal limitation) and environmental (e.g. soil pH) filters shape microbial community composition; however, there is limited empirical evidence supporting this framework in the hyper-diverse soil environment, particularly at large spatial (i.e. regional to continental) and temporal (i.e. 100 to 1000 years) scales. Here, we present evidence from a long-term chronosequence (4000 years) that temporal and environmental filters do indeed shape soil bacterial community composition. Furthermore, nearly 20 years of environmental monitoring allowed us to control for potentially confounding environmental variation. Soil bacterial communities were phylogenetically distinct across the chronosequence. We determined that temporal and environmental factors accounted for significant portions of bacterial phylogenetic structure using distance-based linear models. Environmental factors together accounted for the majority of phylogenetic structure, namely, soil temperature (19%), pH (17%) and litter carbon:nitrogen (C:N; 17%). However, of all individual factors, time since deglaciation accounted for the greatest proportion of bacterial phylogenetic structure (20%). Taken together, our results provide empirical evidence that temporal and environmental filters act together to structure soil bacterial communities across large spatial and long-term temporal scales. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019  |  

Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.


July 7, 2019  |  

Finished genome sequence of Collimonas arenae Cal35.

We announce the finished genome sequence of soil forest isolate Collimonas arenae Cal35, which comprises a 5.6-Mbp chromosome and 41-kb plasmid. The Cal35 genome is the second one published for the bacterial genus Collimonas and represents the first opportunity for high-resolution comparison of genome content and synteny among collimonads. Copyright © 2015 Wu et al.


July 7, 2019  |  

Complete genome sequence of Paenibacillus polymyxa strain Sb3-1, a soilborne bacterium with antagonistic activity toward plant pathogens.

The genome of Paenibacillus polymyxa Sb3-1, a strain that shows antagonistic activities against pathogenic fungi and bacteria, consists of one 5.6-Mb circular chromosome and two plasmids of 223 kb and 8 kb. The genome reveals several genes that potentially contribute to its antagonistic and plant growth promotion activity. Copyright © 2015 Rybakova et al.


July 7, 2019  |  

Genome sequence of Kosakonia radicincitans strain YD4, a plant growth-promoting rhizobacterium isolated from yerba mate (Ilex paraguariensis St. Hill.).

Kosakonia radicincitans strain YD4 is a rhizospheric isolate from yerba mate (Ilex paraguariensis St. Hill.) with plant growth-promoting effects on this crop. Genes involved in different plant growth-promoting activities are present in this genome, suggesting its potential as a bioinoculant for yerba mate. Copyright © 2015 Bergottini et al.


July 7, 2019  |  

Complete genome sequence of oxalate-degrading bacterium Pandoraea vervacti DSM 23571(T).

Pandoraea vervacti DSM 23571(T) is an oxalate metabolizing bacterium isolated from an uncultivated field soil in Mugla, Turkey. Here, we present the first complete genome sequence of P. vervacti DSM 23571(T). A complete pathway for degradation of oxalate was revealed from the genome analysis. These data are important to path new opportunities for genetic engineering in the field of biotechnology. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019  |  

Complete genome sequence of Paenibacillus beijingensis 7188(T) (=DSM 24997(T)), a novel rhizobacterium from jujube garden soil.

We present here the complete genome sequence of a novel species Paenibacillus beijingensis 7188(T) (=DSM 24997(T)) from jujube rhizosphere soil that consists of one circular chromosome of 5,749,967bp with a GC content of 52.5%. On the significance of first genome information in this species, the genome sequence of strain 7188(T) will provide a better comprehension of Paenibacillus species for the practical uses as a biofertilizer in agriculture. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019  |  

Complete genome sequence of Corynebacterium glutamicum B253, a Chinese lysine-producing strain.

We disclosed the complete genome sequence of Corynebacterium glutamicum B253, an important lysine-producing strain in China. The genome consists a circular chromosome (3,159,203bp) and a plasmid (24,775bp), encoding 2767 protein coding genes in total. The genome contains all genes for lysine biosynthesis, and some mutations potentially relevant to lysine production were detected in comparison with sequence of other C. glutamicum. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019  |  

Draft genome sequence of Streptacidiphilus oryzae TH49T, an acidophilic actinobacterium isolated from soil.

The draft genome sequence of Streptacidiphilus oryzae strain TH49(T), an acidophilic actinobacterium, was obtained. The draft is composed of six scaffolds totaling 7.8 Mbp, and it contains 6,829 protein-coding genes and 91 RNA genes. Genes related to respiratory nitrate reduction, siderophore production, and biosynthesis of other secondary metabolites were identified. Copyright © 2015 Kim et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.