Menu
September 22, 2019  |  

Long-read DNA metabarcoding of ribosomal RNA in the analysis of fungi from aquatic environments.

DNA metabarcoding is widely used to study prokaryotic and eukaryotic microbial diversity. Technological constraints limit most studies to marker lengths below 600 base pairs (bp). Longer sequencing reads of several thousand bp are now possible with third-generation sequencing. Increased marker lengths provide greater taxonomic resolution and allow for phylogenetic methods of classification, but longer reads may be subject to higher rates of sequencing error and chimera formation. In addition, most bioinformatics tools for DNA metabarcoding were designed for short reads and are therefore unsuitable. Here, we used Pacific Biosciences circular consensus sequencing (CCS) to DNA-metabarcode environmental samples using a ca. 4,500 bp marker that included most of the eukaryote SSU and LSU rRNA genes and the complete ITS region. We developed an analysis pipeline that reduced error rates to levels comparable to short-read platforms. Validation using a mock community indicated that our pipeline detected 98% of chimeras de novo. We recovered 947 OTUs from water and sediment samples from a natural lake, 848 of which could be classified to phylum, 397 to genus and 330 to species. By allowing for the simultaneous use of three databases (Unite, SILVA and RDP LSU), long-read DNA metabarcoding provided better taxonomic resolution than any single marker. We foresee the use of long reads enabling the cross-validation of reference sequences and the synthesis of ribosomal rRNA gene databases. The universal nature of the rRNA operon and our recovery of >100 nonfungal OTUs indicate that long-read DNA metabarcoding holds promise for studies of eukaryotic diversity more broadly.© 2018 John Wiley & Sons Ltd.


September 22, 2019  |  

Single-molecule real-time transcript sequencing facilitates common wheat genome annotation and grain transcriptome research.

The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat.Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.


September 22, 2019  |  

Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 a-, 11 ?-, one d- and five ?-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of a-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The d-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic a-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.


September 22, 2019  |  

Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data.

DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1?kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications.


September 22, 2019  |  

Atmospheric N deposition alters connectance, but not functional potential among saprotrophic bacterial communities.

The use of co-occurrence patterns to investigate interactions between micro-organisms has provided novel insight into organismal interactions within microbial communities. However, anthropogenic impacts on microbial co-occurrence patterns and ecosystem function remain an important gap in our ecological knowledge. In a northern hardwood forest ecosystem located in Michigan, USA, 20 years of experimentally increased atmospheric N deposition has reduced forest floor decay and increased soil C storage. This ecosystem-level response occurred concomitantly with compositional changes in saprophytic fungi and bacteria. Here, we investigated the influence of experimental N deposition on biotic interactions among forest floor bacterial assemblages by employing phylogenetic and molecular ecological network analysis. When compared to the ambient treatment, the forest floor bacterial community under experimental N deposition was less rich, more phylogenetically dispersed and exhibited a more clustered co-occurrence network topology. Together, our observations reveal the presence of increased biotic interactions among saprotrophic bacterial assemblages under future rates of N deposition. Moreover, they support the hypothesis that nearly two decades of experimental N deposition can modify the organization of microbial communities and provide further insight into why anthropogenic N deposition has reduced decomposition, increased soil C storage and accelerated phenolic DOC production in our field experiment. © 2015 John Wiley & Sons Ltd.


September 22, 2019  |  

Towards long-read metagenomics: complete assembly of three novel genomes from bacteria dependent on a diazotrophic cyanobacterium in a freshwater lake co-culture.

Here we report three complete bacterial genome assemblies from a PacBio shotgun metagenome of a co-culture from Upper Klamath Lake, OR. Genome annotations and culture conditions indicate these bacteria are dependent on carbon and nitrogen fixation from the cyanobacterium Aphanizomenon flos-aquae, whose genome was assembled to draft-quality. Due to their taxonomic novelty relative to previously sequenced bacteria, we have temporarily designated these bacteria as incertae sedis Hyphomonadaceae strain UKL13-1 (3,501,508 bp and 56.12% GC), incertae sedis Betaproteobacterium strain UKL13-2 (3,387,087 bp and 54.98% GC), and incertae sedis Bacteroidetes strain UKL13-3 (3,236,529 bp and 37.33% GC). Each genome consists of a single circular chromosome with no identified plasmids. When compared with binned Illumina assemblies of the same three genomes, there was ~7% discrepancy in total genome length. Gaps where Illumina assemblies broke were often due to repetitive elements. Within these missing sequences were essential genes and genes associated with a variety of functional categories. Annotated gene content reveals that both Proteobacteria are aerobic anoxygenic phototrophs, with Betaproteobacterium UKL13-2 potentially capable of phototrophic oxidation of sulfur compounds. Both proteobacterial genomes contain transporters suggesting they are scavenging fixed nitrogen from A. flos-aquae in the form of ammonium. Bacteroidetes UKL13-3 has few completely annotated biosynthetic pathways, and has a comparatively higher proportion of unannotated genes. The genomes were detected in only a few other freshwater metagenomes, suggesting that these bacteria are not ubiquitous in freshwater systems. Our results indicate that long-read sequencing is a viable method for sequencing dominant members from low-diversity microbial communities, and should be considered for environmental metagenomics when conditions meet these requirements.


September 22, 2019  |  

Complete genome sequence of Sphingobium baderi DE-13, an alkyl-substituted aniline-mineralizing bacterium.

Alkyl-substituted aniline is an important aniline derivative that may be associated with serious environmental risks. Previously, Sphingobium baderi DE-13, a bacterium that can mineralize alkyl substituted anilines such as 2,6-dimethylaniline, 2,6-diethylaniline, 2-methyl-6-ethylaniline, 2-methylaniline, and 2-ethylaniline, was isolated from active sludge. Here, we report the complete genome sequence of strain DE-13. It contains one circular chromosome and eight circular plasmids with total 4,583,422 bp and GC content of 62.41%. The reported and predicted genes involved in the catabolism of alkyl-substituted anilines are indicated. This study will provide insights into the bacterial catabolism of alkyl substituted anilines.


September 22, 2019  |  

Identification of the biosynthetic pathway for the antibiotic bicyclomycin.

Diketopiperazines (DKPs) make up a large group of natural products with diverse structures and biological activities. Bicyclomycin is a broad-spectrum DKP antibiotic with unique structure and function: it contains a highly oxidized bicyclic [4.2.2] ring and is the only known selective inhibitor of the bacterial transcription termination factor, Rho. Here, we identify the biosynthetic gene cluster for bicyclomycin containing six iron-dependent oxidases. We demonstrate that the DKP core is made by a tRNA-dependent cyclodipeptide synthase, and hydroxylations on two unactivated sp(3) carbons are performed by two mononuclear iron, a-ketoglutarate-dependent hydroxylases. Using bioinformatics, we also identify a homologous gene cluster prevalent in a human pathogen Pseudomonas aeruginosa. We detect bicyclomycin by overexpressing this gene cluster and establish P. aeruginosa as a new producer of bicyclomycin. Our work uncovers the biosynthetic pathway for bicyclomycin and sheds light on the intriguing oxidation chemistry that converts a simple DKP into a powerful antibiotic.


September 22, 2019  |  

Pangenome analyses of the wheat pathogen Zymoseptoria tritici reveal the structural basis of a highly plastic eukaryotic genome.

Structural variation contributes substantially to polymorphism within species. Chromosomal rearrangements that impact genes can lead to functional variation among individuals and influence the expression of phenotypic traits. Genomes of fungal pathogens show substantial chromosomal polymorphism that can drive virulence evolution on host plants. Assessing the adaptive significance of structural variation is challenging, because most studies rely on inferences based on a single reference genome sequence.We constructed and analyzed the pangenome of Zymoseptoria tritici, a major pathogen of wheat that evolved host specialization by chromosomal rearrangements and gene deletions. We used single-molecule real-time sequencing and high-density genetic maps to assemble multiple genomes. We annotated the gene space based on transcriptomics data that covered the infection life cycle of each strain. Based on a total of five telomere-to-telomere genomes, we constructed a pangenome for the species and identified a core set of 9149 genes. However, an additional 6600 genes were exclusive to a subset of the isolates. The substantial accessory genome encoded on average fewer expressed genes but a larger fraction of the candidate effector genes that may interact with the host during infection. We expanded our analyses of the pangenome to a worldwide collection of 123 isolates of the same species. We confirmed that accessory genes were indeed more likely to show deletion polymorphisms and loss-of-function mutations compared to core genes.The pangenome construction of a highly polymorphic eukaryotic pathogen showed that a single reference genome significantly underestimates the gene space of a species. The substantial accessory genome provides a cradle for adaptive evolution.


September 22, 2019  |  

In situ analyses directly in diarrheal stool reveal large variations in bacterial load and active toxin expression of enterotoxigenic Escherichia coli and Vibrio cholerae.

The bacterial pathogens enterotoxigenicEscherichia coli(ETEC) andVibrio choleraeare major causes of diarrhea. ETEC causes diarrhea by production of the heat-labile toxin (LT) and heat-stable toxins (STh and STp), whileV. choleraeproduces cholera toxin (CT). In this study, we determined the occurrence and bacterial doses of the two pathogens and their respective toxin expression levels directly in liquid diarrheal stools of patients in Dhaka, Bangladesh. By quantitative culture and real-time quantitative PCR (qPCR) detection of the toxin genes, the two pathogens were found to coexist in several of the patients, at concentrations between 102and 108bacterial gene copies per ml. Even in culture-negative samples, gene copy numbers of 102to 104of either ETEC orV. choleraetoxin genes were detected by qPCR. RNA was extracted directly from stool, and gene expression levels, quantified by reverse transcriptase qPCR (RT-qPCR), of the genes encoding CT, LT, STh, and STp showed expression of toxin genes. Toxin enzyme-linked immunosorbent assay (ELISA) confirmed active toxin secretion directly in the liquid diarrhea. Analysis of ETEC isolates by multiplex PCR, dot blot analysis, and genome sequencing suggested that there are genetic ETEC profiles that are more commonly found as dominating single pathogens and others that are coinfectants with lower bacterial loads. The ETEC genomes, including assembled genomes of dominating ETEC isolates expressing LT/STh/CS5/CS6 and LT/CS7, are provided. In addition, this study highlights an emerging important ETEC strain expressing LT/STp and the novel colonization factor CS27b. These findings have implications for investigations of pathogenesis as well as for vaccine development. IMPORTANCEThe cause of diarrheal disease is usually determined by screening for several microorganisms by various methods, and sole detection is used to assign the agent as the cause of disease. However, it has become increasingly clear that many infections are caused by coinfections with several pathogens and that the dose of the infecting pathogen is important. We quantified the absolute numbers of enterotoxigenicE. coli(ETEC) andVibrio choleraedirectly in diarrheal fluid. We noted several events where both pathogens were found but also a large dose dependency. In three samples, we found ETEC as the only pathogen sought for. These isolates belonged to globally distributed ETEC clones and were the dominating species in stool with active toxin expression. This suggests that certain superior virulent ETEC lineages are able to outcompete the gut microbiota and be the sole cause of disease and hence need to be specifically monitored.


September 22, 2019  |  

Comparative genomics of completely sequenced Lactobacillus helveticus genomes provides insights into strain-specific genes and resolves metagenomics data down to the strain level.

Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences’ long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus-to our knowledge-identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level.


September 22, 2019  |  

Characterization of ß-glucan formation by Lactobacillus brevis TMW 1.2112 isolated from slimy spoiled beer.

Despite several hurdles, which hinder bacterial growth in beer, certain bacteria are still able to spoil beer. One type of spoilage is characterized by an increased viscosity and slimy texture caused by exopolysaccharide (EPS) formation of lactic acid bacteria (LAB). In this study, we characterize for the first time EPS production in a beer-spoiling strain (TMW 1.2112) of Lactobacillus brevis, a species commonly involved in beer spoilage. The strain’s growth dynamics were assessed and we found an increased viscosity or ropiness in liquid or on solid media, respectively. Capsular polysaccharides (CPS) and released EPS from the cells or supernatant, respectively, were analyzed via NMR spectroscopy and methylation analysis. Both are identical ß-(1?3)-glucans, which are ramified with ß-glucose residues at position O2. Therefore, we assume that this EPS is mainly produced as CPS and partially released into the surrounding medium, causing viscosity of e.g. beer. CPS formation was confirmed via an agglutination test. A plasmid-located glycosyltransferase-2 was found as responsible for excess ß-glucan formation, chromosomal glucanases were proposed for its degradation. The glycosyltransferase-2 gene could also be specifically identified in beer-spoiling, slime-producing Lactobacillus rossiae and Lactobacillus parabuchneri strains, suggesting it as promising marker gene for the early detection of ß-glucan-producing Lactobacilli in breweries. Copyright © 2017 Elsevier B.V. All rights reserved.


September 22, 2019  |  

Comparative genomic analysis reveals the evolution and environmental adaptation strategies of vibrios.

Vibrios are among the most diverse and ecologically important marine bacteria, which have evolved many characteristics and lifestyles to occupy various niches. The relationship between genome features and environmental adaptation strategies is an essential part for understanding the ecological functions of vibrios in the marine system. The advent of complete genome sequencing technology has provided an important method of examining the genetic characteristics of vibrios on the genomic level.Two Vibrio genomes were sequenced and found to occupy many unique orthologues families which absent from the previously genes pool of the complete genomes of vibrios. Comparative genomics analysis found vibrios encompass a steady core-genome and tremendous pan-genome with substantial gene gain and horizontal gene transfer events in the evolutionary history. Evolutionary analysis based on the core-genome tree suggested that V. fischeri emerged ~?385 million years ago, along with the occurrence of cephalopods and the flourish of fish. The relatively large genomes, the high number of 16S rRNA gene copies, and the presence of R-M systems and CRISPR system help vibrios live in various marine environments. Chitin-degrading related genes are carried in nearly all the Vibrio genomes. The number of chitinase genes in vibrios has been extremely expanded compared to which in the most recent ancestor of the genus. The chitinase A genes were estimated to have evolved along with the genus, and have undergone significant purifying selective force to conserve the ancestral state.Vibrios have experienced extremely genome expansion events during their evolutionary history, allowing them to develop various functions to spread globally. Despite their close phylogenetic relationships, vibrios were found to have a tremendous pan-genome with a steady core-genome, which indicates the highly plastic genome of the genus. Additionally, the existence of various chitin-degrading related genes and the expansion of chitinase A in the genus demonstrate the importance of the chitin utilization for vibrios. Defensive systems in the Vibrio genomes may protect them from the invasion of external DNA. These genomic features investigated here provide a better knowledge of how the evolutionary process has forged Vibrio genomes to occupy various niches.


September 22, 2019  |  

Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides.

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted function in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. These results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi.


September 22, 2019  |  

Comparative genomics of the Baltic Sea toxic cyanobacteria Nodularia spumigena UHCC 0039 and its response to varying salinity.

Salinity is an important abiotic factor controlling the distribution and abundance of Nodularia spumigena, the dominating diazotrophic and toxic phototroph, in the brackish water cyanobacterial blooms of the Baltic Sea. To expand the available genomic information for brackish water cyanobacteria, we sequenced the isolate Nodularia spumigena UHCC 0039 using an Illumina-SMRT hybrid sequencing approach, revealing a chromosome of 5,294,286 base pairs (bp) and a single plasmid of 92,326 bp. Comparative genomics in Nostocales showed pronounced genetic similarity among Nodularia spumigena strains evidencing their short evolutionary history. The studied Baltic Sea strains share similar sets of CRISPR-Cas cassettes and a higher number of insertion sequence (IS) elements compared to Nodularia spumigena CENA596 isolated from a shrimp production pond in Brazil. Nodularia spumigena UHCC 0039 proliferated similarly at three tested salinities, whereas the lack of salt inhibited its growth and triggered transcriptome remodeling, including the up-regulation of five sigma factors and the down-regulation of two other sigma factors, one of which is specific for strain UHCC 0039. Down-regulated genes additionally included a large genetic region for the synthesis of two yet unidentified natural products. Our results indicate a remarkable plasticity of the Nodularia salinity acclimation, and thus salinity strongly impacts the intensity and distribution of cyanobacterial blooms in the Baltic Sea.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.