Menu
September 22, 2019

Multi-platform assessment of transcriptome profiling using RNA-seq in the ABRF next-generation sequencing study.

High-throughput RNA sequencing (RNA-seq) greatly expands the potential for genomics discoveries, but the wide variety of platforms, protocols and performance capabilitites has created the need for comprehensive reference data. Here we describe the Association of Biomolecular Resource Facilities next-generation sequencing (ABRF-NGS) study on RNA-seq. We carried out replicate experiments across 15 laboratory sites using reference RNA standards to test four protocols (poly-A-selected, ribo-depleted, size-selected and degraded) on five sequencing platforms (Illumina HiSeq, Life Technologies PGM and Proton, Pacific Biosciences RS and Roche 454). The results show high intraplatform (Spearman rank R > 0.86) and inter-platform (R > 0.83) concordance for expression measures across the deep-count platforms, but highly variable efficiency and cost for splice junction and variant detection between all platforms. For intact RNA, gene expression profiles from rRNA-depletion and poly-A enrichment are similar. In addition, rRNA depletion enables effective analysis of degraded RNA samples. This study provides a broad foundation for cross-platform standardization, evaluation and improvement of RNA-seq.


September 22, 2019

Metagenomic approaches to assess bacteriophages in various environmental niches.

Bacteriophages are ubiquitous and numerous parasites of bacteria and play a critical evolutionary role in virtually every ecosystem, yet our understanding of the extent of the diversity and role of phages remains inadequate for many ecological niches, particularly in cases in which the host is unculturable. During the past 15 years, the emergence of the field of viral metagenomics has drastically enhanced our ability to analyse the so-called viral ‘dark matter’ of the biosphere. Here, we review the evolution of viral metagenomic methodologies, as well as providing an overview of some of the most significant applications and findings in this field of research.


September 22, 2019

Great differences in performance and outcome of high-throughput sequencing data analysis platforms for fungal metabarcoding.

Along with recent developments in high-throughput sequencing (HTS) technologies and thus fast accumulation of HTS data, there has been a growing need and interest for developing tools for HTS data processing and communication. In particular, a number of bioinformatics tools have been designed for analysing metabarcoding data, each with specific features, assumptions and outputs. To evaluate the potential effect of the application of different bioinformatics workflow on the results, we compared the performance of different analysis platforms on two contrasting high-throughput sequencing data sets. Our analysis revealed that the computation time, quality of error filtering and hence output of specific bioinformatics process largely depends on the platform used. Our results show that none of the bioinformatics workflows appears to perfectly filter out the accumulated errors and generate Operational Taxonomic Units, although PipeCraft, LotuS and PIPITS perform better than QIIME2 and Galaxy for the tested fungal amplicon dataset. We conclude that the output of each platform requires manual validation of the OTUs by examining the taxonomy assignment values.


September 22, 2019

A manganese superoxide dismutase (MnSOD) from red lip mullet, Liza haematocheila: Evaluation of molecular structure, immune response, and antioxidant function.

Manganese superoxide dismutase (MnSOD) is a nuclear-encoded antioxidant metalloenzyme. The main function of this enzyme is to dismutase the toxic superoxide anion (O2-) into less toxic hydrogen peroxide (H2O2) and oxygen (O2). Structural analysis of mullet MnSOD (MuMnSOD) was performed using different bioinformatics tools. Pairwise alignment revealed that the protein sequence matched to that derived from Larimichthys crocea with a 95.2% sequence identity. Phylogenetic tree analysis showed that the MuMnSOD was included in the category of teleosts. Multiple sequence alignment showed that a SOD Fe-N domain, SOD Fe-C domain, and Mn/Fe SOD signature were highly conserved among the other examined MnSOD orthologs. Quantitative real-time PCR showed that the highest MuMnSOD mRNA expression level was in blood cells. The highest expression level of MuMnSOD was observed in response to treatment with both Lactococcus garvieae and lipopolysaccharide (LPS) at 6?h post treatment in the head kidney and blood. Potential ROS-scavenging ability of the purified recombinant protein (rMuMnSOD) was examined by the xanthine oxidase assay (XOD assay). The optimum temperature and pH for XOD activity were found to be 25?°C and pH 7, respectively. Relative XOD activity was significantly increased with the dose of rMuMnSOD, revealing its dose dependency. Activity of rMuMnSOD was inhibited by potassium cyanide (KCN) and N-N’-diethyl-dithiocarbamate (DDC). Moreover, expression of MuMnSOD resulted in considerable growth retardation of both gram-positive and gram-negative bacteria. Results of the current study suggest that MuMnSOD acts as an antioxidant enzyme and participates in the immune response in mullet. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019

Discovery of enzymes for toluene synthesis from anoxic microbial communities.

Microbial toluene biosynthesis was reported in anoxic lake sediments more than three decades ago, but the enzyme catalyzing this biochemically challenging reaction has never been identified. Here we report the toluene-producing enzyme PhdB, a glycyl radical enzyme of bacterial origin that catalyzes phenylacetate decarboxylation, and its cognate activating enzyme PhdA, a radical S-adenosylmethionine enzyme, discovered in two distinct anoxic microbial communities that produce toluene. The unconventional process of enzyme discovery from a complex microbial community (>300,000 genes), rather than from a microbial isolate, involved metagenomics- and metaproteomics-enabled biochemistry, as well as in vitro confirmation of activity with recombinant enzymes. This work expands the known catalytic range of glycyl radical enzymes (only seven reaction types had been characterized previously) and aromatic-hydrocarbon-producing enzymes, and will enable first-time biochemical synthesis of an aromatic fuel hydrocarbon from renewable resources, such as lignocellulosic biomass, rather than from petroleum.


September 22, 2019

Transcriptome characterization of moso bamboo (Phyllostachys edulis) seedlings in response to exogenous gibberellin applications.

Moso bamboo (Phyllostachys edulis) is a well-known bamboo species of high economic value in the textile industry due to its rapid growth. Phytohormones, which are master regulators of growth and development, serve as important endogenous signals. However, the mechanisms through which phytohormones regulate growth in moso bamboo remain unknown to date.Here, we reported that exogenous gibberellins (GA) applications resulted in a significantly increased internode length and lignin condensation. Transcriptome sequencing revealed that photosynthesis-related genes were enriched in the GA-repressed gene class, which was consistent with the decrease in leaf chlorophyll concentrations and the lower rate of photosynthesis following GA treatment. Exogenous GA applications on seedlings are relatively easy to perform, thus we used 4-week-old whole seedlings of bamboo for GA- treatment followed by high throughput sequencing. In this study, we identified 932 cis-nature antisense transcripts (cis-NATs), and 22,196 alternative splicing (AS) events in total. Among them, 42 cis-nature antisense transcripts (cis-NATs) and 442 AS events were differentially expressed upon exposure to exogenous GA3, suggesting that post-transcriptional regulation might be also involved in the GA3 response. Targets of differential expression of cis-NATs included genes involved in hormone receptor, photosynthesis and cell wall biogenesis. For example, LAC4 and its corresponding cis-NATs were GA3-induced, and may be involved in the accumulation of lignin, thus affecting cell wall composition.This study provides novel insights illustrating how GA alters post-transcriptional regulation and will shed light on the underlying mechanism of growth modulated by GA in moso bamboo.


September 22, 2019

Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.

Switchgrass (Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts.We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures.Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.


September 22, 2019

Analysis of transcripts and splice isoforms in red clover (Trifolium pratense L.) by single-molecule long-read sequencing.

Red clover (Trifolium pratense L.) is an important cool-season legume plant, which is the most widely planted forage legume after alfalfa. Although a draft genome sequence was published already, the sequences and completed structure of mRNA transcripts remain unclear, which limit further explore on red clover.In this study, the red clover transcriptome was sequenced using single-molecule long-read sequencing to identify full-length splice isoforms, and 29,730 novel isoforms from known genes and 2194 novel isoforms from novel genes were identified. A total of 5492 alternative splicing events was identified and the majority of alter spliced events in red clover was corrected as intron retention. In addition, of the 15,229 genes detected by SMRT, 8719 including 186,517 transcripts have at least one poly(A) site. Furthermore, we identified 4333 long non-coding RNAs and 3762 fusion transcripts.We analyzed full-length transcriptome of red clover with PacBio SMRT. Those new findings provided important information for improving red clover draft genome annotation and fully characterization of red clover transcriptome.


September 22, 2019

Long-read sequencing and de novo assembly of a Chinese genome.

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93?Gb (contig N50: 8.3?Mb, scaffold N50: 22.0?Mb, including 39.3?Mb N-bases), together with 206?Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%) N-gaps in the reference genome GRCh38. Comparison to GRCh38 reveals 12.8?Mb of HX1-specific sequences, including 4.1?Mb that are not present in previously reported Asian genomes. Furthermore, long-read sequencing of the transcriptome reveals novel spliced genes that are not annotated in GENCODE and are missed by short-read RNA-Seq. Our results imply that improved characterization of genome functional variation may require the use of a range of genomic technologies on diverse human populations.


September 22, 2019

Base modifications affecting RNA polymerase and reverse transcriptase fidelity.

Ribonucleic acid (RNA) is capable of hosting a variety of chemically diverse modifications, in both naturally-occurring post-transcriptional modifications and artificial chemical modifications used to expand the functionality of RNA. However, few studies have addressed how base modifications affect RNA polymerase and reverse transcriptase activity and fidelity. Here, we describe the fidelity of RNA synthesis and reverse transcription of modified ribonucleotides using an assay based on Pacific Biosciences Single Molecule Real-Time sequencing. Several modified bases, including methylated (m6A, m5C and m5U), hydroxymethylated (hm5U) and isomeric bases (pseudouridine), were examined. By comparing each modified base to the equivalent unmodified RNA base, we can determine how the modification affected cumulative RNA polymerase and reverse transcriptase fidelity. 5-hydroxymethyluridine and N6-methyladenosine both increased the combined error rate of T7 RNA polymerase and reverse transcriptases, while pseudouridine specifically increased the error rate of RNA synthesis by T7 RNA polymerase. In addition, we examined the frequency, mutational spectrum and sequence context of reverse transcription errors on DNA templates from an analysis of second strand DNA synthesis.


September 22, 2019

Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon

A significant portion of genes in vertebrate genomes belongs to multigene families, with each family containing several gene copies whose presence/absence, as well as isoform structure, can be highly variable across individuals. Existing de novo techniques for assaying the sequences of such highly-similar gene families fall short of reconstructing end-to-end transcripts with nucleotide-level precision or assigning alternatively spliced transcripts to their respective gene copies. We present IsoCon, a high-precision method using long PacBio Iso-Seq reads to tackle this challenge. We apply IsoCon to nine Y chromosome ampliconic gene families and show that it outperforms existing methods on both experimental and simulated data. IsoCon has allowed us to detect an unprecedented number of novel isoforms and has opened the door for unraveling the structure of many multigene families and gaining a deeper understanding of genome evolution and human diseases.


September 22, 2019

Construction of a draft reference transcripts of onion (Allium cepa) using long-read sequencing

To obtain intact and full-length RNA transcripts of onion (Allium cepa), long-read sequencing technology was first applied. Total RNAs extracted from four tissues; flowers, leaves, bulbs and roots, of red–purple and yellow-colored onions (A. cepa) were sequenced using long-read sequencing (RSII platform, P4-C2 chemistry). The 99,247 polished high-quality isoforms were produced by sequence correction processes of consensus calling, quality filtering, orientation verification, misread-nucleotide correction and dot-matrix view. The dot-matrix view was subsequently used to remove artificial inverted repeats (IRs), and resultantly 421 IRs were removed. The remaining 98,826 isoforms were condensed to 35,505 through the removal process of redundant isoforms. To assess the completeness of the 35,505 isoforms, the ratio of full-length isoforms, short-read mapping to the isoforms, and differentially expressed genes among the four tissues were analyzed along with the gene ontology across the tissues. As a result, the 35,505 isoforms were verified as a collection of isoforms with high completeness, and designated as draft reference transcripts (DRTs, ver 1.0) constructed by long-read sequencing.


September 22, 2019

Bacterial diversity and community structure in Chongqing radish paocai brines revealed using PacBio single-molecule real-time sequencing technology.

Traditional Chongqing radish paocai fermented with aged brine is considered to have the most intense flavor and authentic taste. Eight ‘Yanzhi’ (red, RRPB group) and ‘Chunbulao’ (white, WRPB) radish paocai brine samples were collected from Chongqing peasant households, and the diversity and community structures of bacteria present in these brines were determined using PacBio single-molecule real-time sequencing of their full-length 16S rRNA genes.In total, 30 phyla, 218 genera, and 306 species were identified from the RRPB group, with 20 phyla, 261 genera, and 420 species present in the WRPB group. Obvious differences in bacterial profiles between the RRPB and WRPB groups were found, with the bacterial diversity of the WRPB group shown to be greater than that of the RRPB group. This study revealed several characteristics of the bacteria composition, including the predominance of heterofermentative lactic acid bacteria, the species diversity of genus Pseudomonas, and the presence of three opportunistic pathogenic species.This study provides detailed information on the bacterial diversity and community structure of Chongqing radish paocai brine samples, and suggests it may be necessary to analyze paocai brine for potential sources of bacterial contamination and take appropriate measures to exclude any pathogenic species. © 2018 Society of Chemical Industry.© 2018 Society of Chemical Industry.


September 22, 2019

Capturing single cell genomes of active polysaccharide degraders: an unexpected contribution of Verrucomicrobia.

Microbial hydrolysis of polysaccharides is critical to ecosystem functioning and is of great interest in diverse biotechnological applications, such as biofuel production and bioremediation. Here we demonstrate the use of a new, efficient approach to recover genomes of active polysaccharide degraders from natural, complex microbial assemblages, using a combination of fluorescently labeled substrates, fluorescence-activated cell sorting, and single cell genomics. We employed this approach to analyze freshwater and coastal bacterioplankton for degraders of laminarin and xylan, two of the most abundant storage and structural polysaccharides in nature. Our results suggest that a few phylotypes of Verrucomicrobia make a considerable contribution to polysaccharide degradation, although they constituted only a minor fraction of the total microbial community. Genomic sequencing of five cells, representing the most predominant, polysaccharide-active Verrucomicrobia phylotype, revealed significant enrichment in genes encoding a wide spectrum of glycoside hydrolases, sulfatases, peptidases, carbohydrate lyases and esterases, confirming that these organisms were well equipped for the hydrolysis of diverse polysaccharides. Remarkably, this enrichment was on average higher than in the sequenced representatives of Bacteroidetes, which are frequently regarded as highly efficient biopolymer degraders. These findings shed light on the ecological roles of uncultured Verrucomicrobia and suggest specific taxa as promising bioprospecting targets. The employed method offers a powerful tool to rapidly identify and recover discrete genomes of active players in polysaccharide degradation, without the need for cultivation.


September 22, 2019

Complete genome sequence of Elizabethkingia sp. BM10, a symbiotic bacterium of the wood-feeding termite Reticulitermes speratus KMT1.

Elizabethkingia sp. BM10 was isolated from the hindgut of the wood-feeding termite Reticulitermes speratus KMT1. It had cellobiohydrolase and ß-glucosidase activities but not endo-ß-glucanase activity. The complete sequence of its genome, which has a total size of 4,242,519 bases, is reported here. The genomic analysis identified six ß-glucosidase candidate genes and three ß-glucanase candidate genes. Copyright © 2015 Lee et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.