Menu
September 22, 2019

Genomic insights into the acid adaptation of novel methanotrophs enriched from acidic forest soils.

Soil acidification is accelerated by anthropogenic and agricultural activities, which could significantly affect global methane cycles. However, detailed knowledge of the genomic properties of methanotrophs adapted to acidic soils remains scarce. Using metagenomic approaches, we analyzed methane-utilizing communities enriched from acidic forest soils with pH 3 and 4, and recovered near-complete genomes of proteobacterial methanotrophs. Novel methanotroph genomes designated KS32 and KS41, belonging to two representative clades of methanotrophs (Methylocystis of Alphaproteobacteria and Methylobacter of Gammaproteobacteria), were dominant. Comparative genomic analysis revealed diverse systems of membrane transporters for ensuring pH homeostasis and defense against toxic chemicals. Various potassium transporter systems, sodium/proton antiporters, and two copies of proton-translocating F1F0-type ATP synthase genes were identified, which might participate in the key pH homeostasis mechanisms in KS32. In addition, the V-type ATP synthase and urea assimilation genes might be used for pH homeostasis in KS41. Genes involved in the modification of membranes by incorporation of cyclopropane fatty acids and hopanoid lipids might be used for reducing proton influx into cells. The two methanotroph genomes possess genes for elaborate heavy metal efflux pumping systems, possibly owing to increased heavy metal toxicity in acidic conditions. Phylogenies of key genes involved in acid adaptation, methane oxidation, and antiviral defense in KS41 were incongruent with that of 16S rRNA. Thus, the detailed analysis of the genome sequences provides new insights into the ecology of methanotrophs responding to soil acidification.


September 22, 2019

Accurate determination of bacterial abundances in human metagenomes using full-length 16S sequencing reads

DNA sequencing of PCR-amplified marker genes, especially but not limited to the 16S rRNA gene, is perhaps the most common approach for profiling microbial communities. Due to technological constraints of commonly available DNA sequencing, these approaches usually take the form of short reads sequenced from a narrow, targeted variable region, with a corresponding loss of taxonomic resolution relative to the full length marker gene. We use Pacific Biosciences single-molecule, real-time circular consensus sequencing to sequence amplicons spanning the entire length of the 16S rRNA gene. However, this sequencing technology suffers from high sequencing error rate that needs to be addressed in order to take full advantage of the longer sequence. Here, we present a method to model the sequencing error process using a generalized pair hidden Markov chain model and estimate bacterial abundances in microbial samples. We demonstrate, with simulated and real data, that our model and its associated estimation procedure are able to give accurate estimates at the species (or subspecies) level, and is more flexible than existing methods like SImple Non-Bayesian TAXonomy (SINTAX).


September 22, 2019

Young genes have distinct gene structure, epigenetic profiles, and transcriptional regulation.

Species-specific, new, or “orphan” genes account for 10%-30% of eukaryotic genomes. Although initially considered to have limited function, an increasing number of orphan genes have been shown to provide important phenotypic innovation. How new genes acquire regulatory sequences for proper temporal and spatial expression is unknown. Orphan gene regulation may rely in part on origination in open chromatin adjacent to preexisting promoters, although this has not yet been assessed by genome-wide analysis of chromatin states. Here, we combine taxon-rich nematode phylogenies with Iso-Seq, RNA-seq, ChIP-seq, and ATAC-seq to identify the gene structure and epigenetic signature of orphan genes in the satellite model nematode Pristionchus pacificus Consistent with previous findings, we find young genes are shorter, contain fewer exons, and are on average less strongly expressed than older genes. However, the subset of orphan genes that are expressed exhibit distinct chromatin states from similarly expressed conserved genes. Orphan gene transcription is determined by a lack of repressive histone modifications, confirming long-held hypotheses that open chromatin is important for new gene formation. Yet orphan gene start sites more closely resemble enhancers defined by H3K4me1, H3K27ac, and ATAC-seq peaks, in contrast to conserved genes that exhibit traditional promoters defined by H3K4me3 and H3K27ac. Although the majority of orphan genes are located on chromosome arms that contain high recombination rates and repressive histone marks, strongly expressed orphan genes are more randomly distributed. Our results support a model of new gene origination by rare integration into open chromatin near enhancers.© 2018 Werner et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

A community-based culture collection for targeting novel plant growth-promoting bacteria from the sugarcane microbiome.

The soil-plant ecosystem harbors an immense microbial diversity that challenges investigative approaches to study traits underlying plant-microbe association. Studies solely based on culture-dependent techniques have overlooked most microbial diversity. Here we describe the concomitant use of culture-dependent and -independent techniques to target plant-beneficial microbial groups from the sugarcane microbiome. The community-based culture collection (CBC) approach was used to access microbes from roots and stalks. The CBC recovered 399 unique bacteria representing 15.9% of the rhizosphere core microbiome and 61.6-65.3% of the endophytic core microbiomes of stalks. By cross-referencing the CBC (culture-dependent) with the sugarcane microbiome profile (culture-independent), we designed a synthetic community comprised of naturally occurring highly abundant bacterial groups from roots and stalks, most of which has been poorly explored so far. We then used maize as a model to probe the abundance-based synthetic inoculant. We show that when inoculated in maize plants, members of the synthetic community efficiently colonize plant organs, displace the natural microbiota and dominate at 53.9% of the rhizosphere microbial abundance. As a result, inoculated plants increased biomass by 3.4-fold as compared to uninoculated plants. The results demonstrate that abundance-based synthetic inoculants can be successfully applied to recover beneficial plant microbes from plant microbiota.


September 22, 2019

Metabolism of toxic sugars by strains of the bee gut symbiont Gilliamella apicola.

Social bees collect carbohydrate-rich food to support their colonies, and yet, certain carbohydrates present in their diet or produced through the breakdown of pollen are toxic to bees. The gut microbiota of social bees is dominated by a few core bacterial species, including the Gram-negative species Gilliamella apicola We isolated 42 strains of G. apicola from guts of honey bees and bumble bees and sequenced their genomes. All of the G. apicola strains share high 16S rRNA gene similarity, but they vary extensively in gene repertoires related to carbohydrate metabolism. Predicted abilities to utilize different sugars were verified experimentally. Some strains can utilize mannose, arabinose, xylose, or rhamnose (monosaccharides that can cause toxicity in bees) as their sole carbon and energy source. All of the G. apicola strains possess a manO-associated mannose family phosphotransferase system; phylogenetic analyses suggest that this was acquired from Firmicutes through horizontal gene transfer. The metabolism of mannose is specifically dependent on the presence of mannose-6-phosphate isomerase (MPI). Neither growth rates nor the utilization of glucose and fructose are affected in the presence of mannose when the gene encoding MPI is absent from the genome, suggesting that mannose is not taken up by G. apicola strains which harbor the phosphotransferase system but do not encode the MPI. Given their ability to simultaneously utilize glucose, fructose, and mannose, as well as the ability of many strains to break down other potentially toxic carbohydrates, G. apicola bacteria may have key roles in improving dietary tolerances and maintaining the health of their bee hosts.Bees are important pollinators of agricultural plants. Our study documents the ability of Gilliamella apicola, a dominant gut bacterium in honey bees and bumble bees, to utilize several sugars that are harmful to bee hosts. Using genome sequencing and growth assays, we found that the ability to metabolize certain toxic carbohydrates is directly correlated with the presence of their respective degradation pathways, indicating that metabolic potential can be accurately predicted from genomic data in these gut symbionts. Strains vary considerably in their range of utilizable carbohydrates, which likely reflects historical horizontal gene transfer and gene deletion events. Unlike their bee hosts, G. apicola bacteria are not detrimentally affected by growth on mannose-containing medium, even in strains that cannot metabolize this sugar. These results suggest that G. apicola may be an important player in modulating nutrition in the bee gut, with ultimate effects on host health. Copyright © 2016 Zheng et al.


September 22, 2019

Transcriptional fates of human-specific segmental duplications in brain.

Despite the importance of duplicate genes for evolutionary adaptation, accurate gene annotation is often incomplete, incorrect, or lacking in regions of segmental duplication. We developed an approach combining long-read sequencing and hybridization capture to yield full-length transcript information and confidently distinguish between nearly identical genes/paralogs. We used biotinylated probes to enrich for full-length cDNA from duplicated regions, which were then amplified, size-fractionated, and sequenced using single-molecule, long-read sequencing technology, permitting us to distinguish between highly identical genes by virtue of multiple paralogous sequence variants. We examined 19 gene families as expressed in developing and adult human brain, selected for their high sequence identity (average >99%) and overlap with human-specific segmental duplications (SDs). We characterized the transcriptional differences between related paralogs to better understand the birth-death process of duplicate genes and particularly how the process leads to gene innovation. In 48% of the cases, we find that the expressed duplicates have changed substantially from their ancestral models due to novel sites of transcription initiation, splicing, and polyadenylation, as well as fusion transcripts that connect duplication-derived exons with neighboring genes. We detect unannotated open reading frames in genes currently annotated as pseudogenes, while relegating other duplicates to nonfunctional status. Our method significantly improves gene annotation, specifically defining full-length transcripts, isoforms, and open reading frames for new genes in highly identical SDs. The approach will be more broadly applicable to genes in structurally complex regions of other genomes where the duplication process creates novel genes important for adaptive traits.© 2018 Dougherty et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Full-length transcriptome of Misgurnus anguillicaudatus provides insights into evolution of genus Misgurnus.

Reconstruction and annotation of transcripts, particularly for a species without reference genome, plays a critical role in gene discovery, investigation of genomic signatures, and genome annotation in the pre-genomic era. This study generated 33,330 full-length transcripts of diploid M. anguillicaudatus using PacBio SMRT Sequencing. A total of 6,918 gene families were identified with two or more isoforms, and 26,683 complete ORFs with an average length of 1,497?bp were detected. Totally, 1,208 high-confidence lncRNAs were identified, and most of these appeared to be precursor transcripts of miRNAs or snoRNAs. Phylogenetic tree of the Misgurnus species was inferred based on the 1,905 single copy orthologous genes. The tetraploid and diploid M. anguillicaudatus grouped into a clade, and M. bipartitus showed a closer relationship with the M. anguillicaudatus. The overall evolutionary rates of tetraploid M. anguillicaudatus were significantly higher than those of other Misgurnus species. Meanwhile, 28 positively selected genes were identified in M. anguillicaudatus clade. These positively selected genes may play critical roles in the adaptation to various habitat environments for M. anguillicaudatus. This study could facilitate further exploration of the genomic signatures of M. anguillicaudatus and provide potential insights into unveiling the evolutionary history of tetraploid loach.


September 22, 2019

Localized electron transfer rates and microelectrode-based enrichment of microbial communities within a phototrophic microbial mat.

Phototrophic microbial mats frequently exhibit sharp, light-dependent redox gradients that regulate microbial respiration on specific electron acceptors as a function of depth. In this work, a benthic phototrophic microbial mat from Hot Lake, a hypersaline, epsomitic lake located near Oroville in north-central Washington, was used to develop a microscale electrochemical method to study local electron transfer processes within the mat. To characterize the physicochemical variables influencing electron transfer, we initially quantified redox potential, pH, and dissolved oxygen gradients by depth in the mat under photic and aphotic conditions. We further demonstrated that power output of a mat fuel cell was light-dependent. To study local electron transfer processes, we deployed a microscale electrode (microelectrode) with tip size ~20 µm. To enrich a subset of microorganisms capable of interacting with the microelectrode, we anodically polarized the microelectrode at depth in the mat. Subsequently, to characterize the microelectrode-associated community and compare it to the neighboring mat community, we performed amplicon sequencing of the V1-V3 region of the 16S gene. Differences in Bray-Curtis beta diversity, illustrated by large changes in relative abundance at the phylum level, suggested successful enrichment of specific mat community members on the microelectrode surface. The microelectrode-associated community exhibited substantially reduced alpha diversity and elevated relative abundances of Prosthecochloris, Loktanella, Catellibacterium, other unclassified members of Rhodobacteraceae, Thiomicrospira, and Limnobacter, compared with the community at an equivalent depth in the mat. Our results suggest that local electron transfer to an anodically polarized microelectrode selected for a specific microbial population, with substantially more abundance and diversity of sulfur-oxidizing phylotypes compared with the neighboring mat community.


September 22, 2019

MEGAN-LR: new algorithms allow accurate binning and easy interactive exploration of metagenomic long reads and contigs.

There are numerous computational tools for taxonomic or functional analysis of microbiome samples, optimized to run on hundreds of millions of short, high quality sequencing reads. Programs such as MEGAN allow the user to interactively navigate these large datasets. Long read sequencing technologies continue to improve and produce increasing numbers of longer reads (of varying lengths in the range of 10k-1M bps, say), but of low quality. There is an increasing interest in using long reads in microbiome sequencing, and there is a need to adapt short read tools to long read datasets.We describe a new LCA-based algorithm for taxonomic binning, and an interval-tree based algorithm for functional binning, that are explicitly designed for long reads and assembled contigs. We provide a new interactive tool for investigating the alignment of long reads against reference sequences. For taxonomic and functional binning, we propose to use LAST to compare long reads against the NCBI-nr protein reference database so as to obtain frame-shift aware alignments, and then to process the results using our new methods.All presented methods are implemented in the open source edition of MEGAN, and we refer to this new extension as MEGAN-LR (MEGAN long read). We evaluate the LAST+MEGAN-LR approach in a simulation study, and on a number of mock community datasets consisting of Nanopore reads, PacBio reads and assembled PacBio reads. We also illustrate the practical application on a Nanopore dataset that we sequenced from an anammox bio-rector community.This article was reviewed by Nicola Segata together with Moreno Zolfo, Pete James Lockhart and Serghei Mangul.This work extends the applicability of the widely-used metagenomic analysis software MEGAN to long reads. Our study suggests that the presented LAST+MEGAN-LR pipeline is sufficiently fast and accurate.


September 22, 2019

Interaction between the microbiome and TP53 in human lung cancer.

Lung cancer is the leading cancer diagnosis worldwide and the number one cause of cancer deaths. Exposure to cigarette smoke, the primary risk factor in lung cancer, reduces epithelial barrier integrity and increases susceptibility to infections. Herein, we hypothesize that somatic mutations together with cigarette smoke generate a dysbiotic microbiota that is associated with lung carcinogenesis. Using lung tissue from 33 controls and 143 cancer cases, we conduct 16S ribosomal RNA (rRNA) bacterial gene sequencing, with RNA-sequencing data from lung cancer cases in The Cancer Genome Atlas serving as the validation cohort.Overall, we demonstrate a lower alpha diversity in normal lung as compared to non-tumor adjacent or tumor tissue. In squamous cell carcinoma specifically, a separate group of taxa are identified, in which Acidovorax is enriched in smokers. Acidovorax temporans is identified within tumor sections by fluorescent in situ hybridization and confirmed by two separate 16S rRNA strategies. Further, these taxa, including Acidovorax, exhibit higher abundance among the subset of squamous cell carcinoma cases with TP53 mutations, an association not seen in adenocarcinomas.The results of this comprehensive study show both microbiome-gene and microbiome-exposure interactions in squamous cell carcinoma lung cancer tissue. Specifically, tumors harboring TP53 mutations, which can impair epithelial function, have a unique bacterial consortium that is higher in relative abundance in smoking-associated tumors of this type. Given the significant need for clinical diagnostic tools in lung cancer, this study may provide novel biomarkers for early detection.


September 22, 2019

Transcriptome analysis of distinct cold tolerance strategies in the rubber tree (Hevea brasiliensis)

Natural rubber is an indispensable commodity used in approximately 40,000 products and is fundamental to the tire industry. Among the species that produce latex, the rubber tree [Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell-Arg.], a species native to the Amazon rainforest, is the major producer of latex used worldwide. The Amazon Basin presents optimal conditions for rubber tree growth, but the occurrence of South American leaf blight, which is caused by the fungus Microcyclus ulei (P. Henn) v. Arx, limits rubber tree production. Currently, rubber tree plantations are located in scape regions that exhibit suboptimal conditions such as high winds and cold temperatures. Rubber tree breeding programs aim to identify clones that are adapted to these stress conditions. However, rubber tree breeding is time-consuming, taking more than 20 years to develop a new variety. It is also expensive and requires large field areas. Thus, genetic studies could optimize field evaluations, thereby reducing the time and area required for these experiments. Transcriptome sequencing using next-generation sequencing (RNA-seq) is a powerful tool to identify a full set of transcripts and for evaluating gene expression in model and non-model species. In this study, we constructed a comprehensive transcriptome to evaluate the cold response strategies of the RRIM600 (cold-resistant) and GT1 (cold-tolerant) genotypes. Furthermore, we identified putative microsatellite (SSR) and single-nucleotide polymorphism (SNP) markers. Alternative splicing, which is an important mechanism for plant adaptation under abiotic stress, was further identified, providing an important database for further studies of cold tolerance.


September 22, 2019

Interpreting microbial biosynthesis in the genomic age: Biological and practical considerations.

Genome mining has become an increasingly powerful, scalable, and economically accessible tool for the study of natural product biosynthesis and drug discovery. However, there remain important biological and practical problems that can complicate or obscure biosynthetic analysis in genomic and metagenomic sequencing projects. Here, we focus on limitations of available technology as well as computational and experimental strategies to overcome them. We review the unique challenges and approaches in the study of symbiotic and uncultured systems, as well as those associated with biosynthetic gene cluster (BGC) assembly and product prediction. Finally, to explore sequencing parameters that affect the recovery and contiguity of large and repetitive BGCs assembled de novo, we simulate Illumina and PacBio sequencing of the Salinispora tropica genome focusing on assembly of the salinilactam (slm) BGC.


September 22, 2019

Plant 24-nt reproductive phasiRNAs from intramolecular duplex mRNAs in diverse monocots.

In grasses, two pathways that generate diverse and numerous 21-nt (premeiotic) and 24-nt (meiotic) phased siRNAs are highly enriched in anthers, the male reproductive organs. These “phasiRNAs” are analogous to mammalian piRNAs, yet their functions and evolutionary origins remain largely unknown. The 24-nt meiotic phasiRNAs have only been described in grasses, wherein their biogenesis is dependent on a specialized Dicer (DCL5). To assess how evolution gave rise to this pathway, we examined reproductive phasiRNA pathways in nongrass monocots: garden asparagus, daylily, and lily. The common ancestors of these species diverged approximately 115-117 million years ago (MYA). We found that premeiotic 21-nt and meiotic 24-nt phasiRNAs were abundant in all three species and displayed spatial localization and temporal dynamics similar to grasses. The miR2275-triggered pathway was also present, yielding 24-nt reproductive phasiRNAs, and thus originated more than 117 MYA. In asparagus, unlike in grasses, these siRNAs are largely derived from inverted repeats (IRs); analyses in lily identified thousands of precursor loci, and many were also predicted to form foldback substrates for Dicer processing. Additionally, reproductive phasiRNAs were present in female reproductive organs and thus may function in both male and female germinal development. These data describe several distinct mechanisms of production for 24-nt meiotic phasiRNAs and provide new insights into the evolution of reproductive phasiRNA pathways in monocots.© 2018 Kakrana et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Transcriptome profiling of two ornamental and medicinal papaver herbs.

The Papaver spp. (Papaver rhoeas (Corn poppy) and Papaver nudicaule (Iceland poppy)) genera are ornamental and medicinal plants that are used for the isolation of alkaloid drugs. In this study, we generated 700 Mb of transcriptome sequences with the PacBio platform. They were assembled into 120,926 contigs, and 1185 (82.2%) of the benchmarking universal single-copy orthologs (BUSCO) core genes were completely present in our assembled transcriptome. Furthermore, using 128 Gb of Illumina sequences, the transcript expression was assessed at three stages of Papaver plant development (30, 60, and 90 days), from which we identified 137 differentially expressed transcripts. Furthermore, three co-occurrence heat maps are generated from 51 different plant genomes along with the Papaver transcriptome, i.e., secondary metabolite biosynthesis, isoquinoline alkaloid biosynthesis (BIA) pathway, and cytochrome. Sixty-nine transcripts in the BIA pathway along with 22 different alkaloids (quantified with LC-QTOF-MS/MS) were mapped into the BIA KEGG map (map00950). Finally, we identified 39 full-length cytochrome transcripts and compared them with other genomes. Collectively, this transcriptome data, along with the expression and quantitative metabolite profiles, provides an initial recording of secondary metabolites and their expression related to Papaver plant development. Moreover, these profiles could help to further detail the functional characterization of the various secondary metabolite biosynthesis and Papaver plant development associated problems.


September 22, 2019

Quantitative metaproteomics highlight the metabolic contributions of uncultured phylotypes in a thermophilic anaerobic digester.

In this study, we used multiple meta-omic approaches to characterize the microbial community and the active metabolic pathways of a stable industrial biogas reactor with food waste as the dominant feedstock, operating at thermophilic temperatures (60°C) and elevated levels of free ammonia (367 mg/liter NH3-N). The microbial community was strongly dominated (76% of all 16S rRNA amplicon sequences) by populations closely related to the proteolytic bacterium Coprothermobacter proteolyticus. Multiple Coprothermobacter-affiliated strains were detected, introducing an additional level of complexity seldom explored in biogas studies. Genome reconstructions provided metabolic insight into the microbes that performed biomass deconstruction and fermentation, including the deeply branching phyla Dictyoglomi and Planctomycetes and the candidate phylum “Atribacteria” These biomass degraders were complemented by a synergistic network of microorganisms that convert key fermentation intermediates (fatty acids) via syntrophic interactions with hydrogenotrophic methanogens to ultimately produce methane. Interpretation of the proteomics data also suggested activity of a Methanosaeta phylotype acclimatized to high ammonia levels. In particular, we report multiple novel phylotypes proposed as syntrophic acetate oxidizers, which also exert expression of enzymes needed for both the Wood-Ljungdahl pathway and ß-oxidation of fatty acids to acetyl coenzyme A. Such an arrangement differs from known syntrophic oxidizing bacteria and presents an interesting hypothesis for future studies. Collectively, these findings provide increased insight into active metabolic roles of uncultured phylotypes and presents new synergistic relationships, both of which may contribute to the stability of the biogas reactor.Biogas production through anaerobic digestion of organic waste provides an attractive source of renewable energy and a sustainable waste management strategy. A comprehensive understanding of the microbial community that drives anaerobic digesters is essential to ensure stable and efficient energy production. Here, we characterize the intricate microbial networks and metabolic pathways in a thermophilic biogas reactor. We discuss the impact of frequently encountered microbial populations as well as the metabolism of newly discovered novel phylotypes that seem to play distinct roles within key microbial stages of anaerobic digestion in this stable high-temperature system. In particular, we draft a metabolic scenario whereby multiple uncultured syntrophic acetate-oxidizing bacteria are capable of syntrophically oxidizing acetate as well as longer-chain fatty acids (via the ß-oxidation and Wood-Ljundahl pathways) to hydrogen and carbon dioxide, which methanogens subsequently convert to methane. Copyright © 2016 American Society for Microbiology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.