Menu
September 22, 2019

RNA sequencing (RNA-Seq) reveals extremely low levels of reticulocyte-derived globin gene transcripts in peripheral blood from horses (Equus caballus) and cattle (Bos taurus).

RNA-seq has emerged as an important technology for measuring gene expression in peripheral blood samples collected from humans and other vertebrate species. In particular, transcriptomics analyses of whole blood can be used to study immunobiology and develop novel biomarkers of infectious disease. However, an obstacle to these methods in many mammalian species is the presence of reticulocyte-derived globin mRNAs in large quantities, which can complicate RNA-seq library sequencing and impede detection of other mRNA transcripts. A range of supplementary procedures for targeted depletion of globin transcripts have, therefore, been developed to alleviate this problem. Here, we use comparative analyses of RNA-seq data sets generated from human, porcine, equine, and bovine peripheral blood to systematically assess the impact of globin mRNA on routine transcriptome profiling of whole blood in cattle and horses. The results of these analyses demonstrate that total RNA isolated from equine and bovine peripheral blood contains very low levels of globin mRNA transcripts, thereby negating the need for globin depletion and greatly simplifying blood-based transcriptomic studies in these two domestic species.


September 22, 2019

Differential responses of total and active soil microbial communities to long-term experimental N deposition

Abstract The relationship between total and metabolically active soil microbial communities can provide insight into how these communities are impacted by environmental change, which may impact the flow of energy and cycling of nutrients in the future. For example, the anthropogenic release of biologically available N has dramatically increased over the last 150 years, which can alter the processes controlling C storage in terrestrial ecosystems. In a northern hardwood forest ecosystem located in Michigan, USA, nearly 20 years of experimentally increased atmospheric N deposition has reduced forest floor decay and increased soil C storage. A microbial mechanism underlies this response, as compositional changes in the soil microbial community have been concomitantly documented with these biogeochemical changes. Here, we co-extracted DNA and RNA from decaying leaf litter to determine if experimental atmospheric N deposition has lowered the diversity and altered the composition of the whole communities of bacteria and fungi (i.e., DNA-based) and well as its active members (i.e., RNA-based). In our experiment, experimental N deposition did not affect the composition, diversity, or richness of the total forest floor fungal community, but did lower the diversity (-8%), as well as altered the composition of the active fungal community. In contrast, neither the total nor active forest floor bacterial community was significantly affected by experimental N deposition. Our results suggest that future rates of atmospheric N deposition can fundamentally alter the organization of the saprotrophic soil fungal community, key mediators of C cycling in terrestrial environments.


September 22, 2019

The Epstein-Barr virus miR-BHRF1 microRNAs regulate viral gene expression in cis.

The Epstein-Barr virus (EBV) miR-BHRF1 microRNA (miRNA) cluster has been shown to facilitate B-cell transformation and promote the rapid growth of the resultant lymphoblastoid cell lines (LCLs). However, we find that expression of physiological levels of the miR-BHRF1 miRNAs in LCLs transformed with a miR-BHRF1 null mutant (?123) fails to increase their growth rate. We demonstrate that the pri-miR-BHRF1-2 and 1-3 stem-loops are present in the 3’UTR of transcripts encoding EBNA-LP and that excision of pre-miR-BHRF1-2 and 1-3 by Drosha destabilizes these mRNAs and reduces expression of the encoded protein. Therefore, mutational inactivation of pri-miR-BHRF1-2 and 1-3 in the ?123 mutant upregulates the expression of not only EBNA-LP but also EBNA-LP-regulated mRNAs and proteins, including LMP1. We hypothesize that this overexpression causes the reduced transformation capacity of the ?123 EBV mutant. Thus, in addition to regulating cellular mRNAs in trans, miR-BHRF1-2 and 1-3 also regulate EBNA-LP mRNA expression in cis. Copyright © 2017 Elsevier Inc. All rights reserved.


September 22, 2019

BIGMAC : breaking inaccurate genomes and merging assembled contigs for long read metagenomic assembly.

The problem of de-novo assembly for metagenomes using only long reads is gaining attention. We study whether post-processing metagenomic assemblies with the original input long reads can result in quality improvement. Previous approaches have focused on pre-processing reads and optimizing assemblers. BIGMAC takes an alternative perspective to focus on the post-processing step.Using both the assembled contigs and original long reads as input, BIGMAC first breaks the contigs at potentially mis-assembled locations and subsequently scaffolds contigs. Our experiments on metagenomes assembled from long reads show that BIGMAC can improve assembly quality by reducing the number of mis-assemblies while maintaining or increasing N50 and N75. Moreover, BIGMAC shows the largest N75 to number of mis-assemblies ratio on all tested datasets when compared to other post-processing tools. BIGMAC demonstrates the effectiveness of the post-processing approach in improving the quality of metagenomic assemblies.


September 22, 2019

The maize W22 genome provides a foundation for functional genomics and transposon biology.

The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using short-read sequencing technologies. We show that significant structural heterogeneity exists in comparison to the B73 reference genome at multiple scales, from transposon composition and copy number variation to single-nucleotide polymorphisms. The generation of this reference genome enables accurate placement of thousands of Mutator (Mu) and Dissociation (Ds) transposable element insertions for reverse and forward genetics studies. Annotation of the genome has been achieved using RNA-seq analysis, differential nuclease sensitivity profiling and bisulfite sequencing to map open reading frames, open chromatin sites and DNA methylation profiles, respectively. Collectively, the resources developed here integrate W22 as a community reference genome for functional genomics and provide a foundation for the maize pan-genome.


September 22, 2019

SMRT-Cappable-seq reveals complex operon variants in bacteria.

Current methods for genome-wide analysis of gene expression require fragmentation of original transcripts into small fragments for short-read sequencing. In bacteria, the resulting fragmented information hides operon complexity. Additionally, in vivo processing of transcripts confounds the accurate identification of the 5′ and 3′ ends of operons. Here we develop a methodology called SMRT-Cappable-seq that combines the isolation of un-fragmented primary transcripts with single-molecule long read sequencing. Applied to E. coli, this technology results in an accurate definition of the transcriptome with 34% of known operons from RegulonDB being extended by at least one gene. Furthermore, 40% of transcription termination sites have read-through that alters the gene content of the operons. As a result, most of the bacterial genes are present in multiple operon variants reminiscent of eukaryotic splicing. By providing such granularity in the operon structure, this study represents an important resource for the study of prokaryotic gene network and regulation.


September 22, 2019

Carbohydrate staple food modulates gut microbiota of Mongolians in China.

Gut microbiota is a determining factor in human physiological functions and health. It is commonly accepted that diet has a major influence on the gut microbial community, however, the effects of diet is not fully understood. The typical Mongolian diet is characterized by high and frequent consumption of fermented dairy products and red meat, and low level of carbohydrates. In this study, the gut microbiota profile of 26 Mongolians whom consumed wheat, rice and oat as the sole carbohydrate staple food for a week each consecutively was determined. It was observed that changes in staple carbohydrate rapidly (within a week) altered gut microbial community structure and metabolic pathway of the subjects. Wheat and oat favored bifidobacteria (Bifidobacterium catenulatum, Bifodobacteriumbifidum, Bifidobacterium adolescentis); whereas rice suppressed bifidobacteria (Bifidobacterium longum, Bifidobacterium adolescentis) and wheat suppresses Lactobaciilus, Ruminococcus and Bacteroides. The study exhibited two gut microbial clustering patterns with the preference of fucosyllactose utilization linking to fucosidase genes (glycoside hydrolase family classifications: GH95 and GH29) encoded by Bifidobacterium, and xylan and arabinoxylan utilization linking to xylanase and arabinoxylanase genes encoded by Bacteroides. There was also a correlation between Lactobacillus ruminis and sialidase, as well as Butyrivibrio crossotus and xylanase/xylosidase. Meanwhile, a strong concordance was found between the gastrointestinal bacterial microbiome and the intestinal virome. Present research will contribute to understanding the impacts of the dietary carbohydrate on human gut microbiome, which will ultimately help understand relationships between dietary factor, microbial populations, and the health of global humans.


September 22, 2019

16S rRNA long-read sequencing of the granulation tissue from nonsmokers and smokers-severe chronic periodontitis patients

Smoking has been associated with increased risk of periodontitis. The aim of the present study was to compare the periodontal disease severity among smokers and nonsmokers which may help in better understanding of predisposition to this chronic inflammation mediated diseases. We selected deep-seated infected granulation tissue removed during periodontal flap surgery procedures for identification and differential abundance of residential bacterial species among smokers and nonsmokers through long-read sequencing technology targeting full-length 16S rRNA gene. A total of 8 phyla were identified among which Firmicutes and Bacteroidetes were most dominating. Differential abundance analysis of OTUs through PICRUST showed significant (p>0.05) abundance of Phyla-Fusobacteria (Streptobacillus moniliformis); Phyla-Firmicutes (Streptococcus equi), and Phyla Proteobacteria (Enhydrobacter aerosaccus) in nonsmokers compared to smokers. The differential abundance of oral metagenomes in smokers showed significant enrichment of host genes modulating pathways involving primary immunodeficiency, citrate cycle, streptomycin biosynthesis, vitamin B6 metabolism, butanoate metabolism, glycine, serine, and threonine metabolism pathways. While thiamine metabolism, amino acid metabolism, homologous recombination, epithelial cell signaling, aminoacyl-tRNA biosynthesis, phosphonate/phosphinate metabolism, polycyclic aromatic hydrocarbon degradation, synthesis and degradation of ketone bodies, translation factors, Ascorbate and aldarate metabolism, and DNA replication pathways were significantly enriched in nonsmokers, modulation of these pathways in oral cavities due to differential enrichment of metagenomes in smokers may lead to an increased susceptibility to infections and/or higher formation of DNA adducts, which may increase the risk of carcinogenesis.


September 22, 2019

Microbial phylogenetic profiling with the Pacific Biosciences sequencing platform.

High-throughput sequencing of 16S rRNA gene amplicons has revolutionized the capacity and depth of microbial community profiling. Several sequencing platforms are available, but most phylogenetic studies are performed on the 454-pyrosequencing platform because its longer reads can give finer phylogenetic resolution. The Pacific Biosciences (PacBio) sequencing platform is significantly less expensive per run, does not rely on amplification for library generation, and generates reads that are, on average, four times longer than those from 454 (C2 chemistry), but the resulting high error rates appear to preclude its use in phylogenetic profiling. Recently, however, the PacBio platform was used to characterize four electrosynthetic microbiomes to the genus-level for less than USD 1,000 through the use of PacBio’s circular consensus sequence technology. Here, we describe in greater detail: 1) the output from successful 16S rRNA gene amplicon profiling with PacBio, 2) how the analysis was contingent upon several alterations to standard bioinformatic quality control workflows, and 3) the advantages and disadvantages of using the PacBio platform for community profiling.


September 22, 2019

PRAPI: post-transcriptional regulation analysis pipeline for Iso-Seq.

The single-molecule real-time (SMRT) isoform sequencing (Iso-Seq) based on Pacific Bioscience (PacBio) platform has received increasing attention for its ability to explore full-length isoforms. Thus, comprehensive tools for Iso-Seq bioinformatics analysis are extremely useful. Here, we present a one-stop solution for Iso-Seq analysis, called PRAPI to analyze alternative transcription initiation (ATI), alternative splicing (AS), alternative cleavage and polyadenylation (APA), natural antisense transcripts (NAT), and circular RNAs (circRNAs) comprehensively. PRAPI is capable of combining Iso-Seq full-length isoforms with short read data, such as RNA-Seq or polyadenylation site sequencing (PAS-seq) for differential expression analysis of NAT, AS, APA and circRNAs. Furthermore, PRAPI can annotate new genes and correct mis-annotated genes when gene annotation is available. Finally, PRAPI generates high-quality vector graphics to visualize and highlight the Iso-Seq results.The Dockerfile of PRAPI is available at http://www.bioinfor.org/tool/PRAPI.lfgu@fafu.edu.cn.


September 22, 2019

Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation.

Shotgun metagenomics methods enable characterization of microbial communities in human microbiome and environmental samples. Assembly of metagenome sequences does not output whole genomes, so computational binning methods have been developed to cluster sequences into genome ‘bins’. These methods exploit sequence composition, species abundance, or chromosome organization but cannot fully distinguish closely related species and strains. We present a binning method that incorporates bacterial DNA methylation signatures, which are detected using single-molecule real-time sequencing. Our method takes advantage of these endogenous epigenetic barcodes to resolve individual reads and assembled contigs into species- and strain-level bins. We validate our method using synthetic and real microbiome sequences. In addition to genome binning, we show that our method links plasmids and other mobile genetic elements to their host species in a real microbiome sample. Incorporation of DNA methylation information into shotgun metagenomics analyses will complement existing methods to enable more accurate sequence binning.


September 22, 2019

Defining a personal, allele-specific, and single-molecule long-read transcriptome.

Personal transcriptomes in which all of an individual’s genetic variants (e.g., single nucleotide variants) and transcript isoforms (transcription start sites, splice sites, and polyA sites) are defined and quantified for full-length transcripts are expected to be important for understanding individual biology and disease, but have not been described previously. To obtain such transcriptomes, we sequenced the lymphoblastoid transcriptomes of three family members (GM12878 and the parents GM12891 and GM12892) by using a Pacific Biosciences long-read approach complemented with Illumina 101-bp sequencing and made the following observations. First, we found that reads representing all splice sites of a transcript are evident for most sufficiently expressed genes =3 kb and often for genes longer than that. Second, we added and quantified previously unidentified splicing isoforms to an existing annotation, thus creating the first personalized annotation to our knowledge. Third, we determined SNVs in a de novo manner and connected them to RNA haplotypes, including HLA haplotypes, thereby assigning single full-length RNA molecules to their transcribed allele, and demonstrated Mendelian inheritance of RNA molecules. Fourth, we show how RNA molecules can be linked to personal variants on a one-by-one basis, which allows us to assess differential allelic expression (DAE) and differential allelic isoforms (DAI) from the phased full-length isoform reads. The DAI method is largely independent of the distance between exon and SNV–in contrast to fragmentation-based methods. Overall, in addition to improving eukaryotic transcriptome annotation, these results describe, to our knowledge, the first large-scale and full-length personal transcriptome.


September 22, 2019

Profiling of oral microbiota in early childhood caries using Single-Molecule Real-Time Sequencing

Background: Alterations of oral microbiota are the main cause of the progression of caries. The goal of this study was to characterize the oral microbiota in childhood caries based on single-molecule real-time sequencing. Methods: A total of 21 preschoolers, aged 3-5 years old with severe early childhood caries, and 20 age-matched, caries-free children as controls were recruited. Saliva samples were collected, followed by DNA extraction, Pacbio sequencing and phylogenetic analyses of the oral microbial communities. Results: 876 species derived from 13 known bacterial phyla and 110 genera were detected from 41 children using Pacbio sequencing. At the species level, 38 species, including Veillonella spp., Streptococcus spp., Prevotella spp. and Lactobacillus spp., showed higher abundance in the caries group compared to the caries-free group (p<0.05). The core microbiota at the genus and species levels was more stable in the caries-free micro-ecological niche. At follow-up, oral examinations 6 months after sample collection, development of new dental caries was observed in 5 children (the transitional group) among the 21 caries free children. Compared with the caries-free children, in the transitional and caries groups, 6 species, which were more abundant in the caries-free group, exhibited a relatively low abundance in both the caries group and the transitional group (p<0.05). We conclude that Abiotrophia spp., Neisseria spp. and Veillonella spp., are essential for maintaining a healthy oral microbial ecosystem. Prevotella spp., Lactobacillus spp., Dialister spp. and Filifactor spp. may be related to the pathogenesis and progression of dental caries.


September 22, 2019

Molecular characterization of eukaryotic algal communities in the tropical phyllosphere based on real-time sequencing of the 18S rDNA gene.

Foliicolous algae are a common occurrence in tropical forests. They are referable to a few simple morphotypes (unicellular, sarcinoid-like or filamentous), which makes their morphology of limited usefulness for taxonomic studies and species diversity assessments. The relationship between algal community and their host phyllosphere was not clear. In order to obtain a more accurate assessment, we used single molecule real-time sequencing of the 18S rDNA gene to characterize the eukaryotic algal community in an area of South-western China.We annotated 2922 OTUs belonging to five classes, Ulvophyceae, Trebouxiophyceae, Chlorophyceae, Dinophyceae and Eustigmatophyceae. Novel clades formed by large numbers sequences of green algae were detected in the order Trentepohliales (Ulvophyceae) and the Watanabea clade (Trebouxiophyceae), suggesting that these foliicolous communities may be substantially more diverse than so far appreciated and require further research. Species in Trentepohliales, Watanabea clade and Apatococcus clade were detected as the core members in the phyllosphere community studied. Communities from different host trees and sampling sites were not significantly different in terms of OTUs composition. However, the communities of Musa and Ravenala differed from other host plants significantly at the genus level, since they were dominated by Trebouxiophycean epiphytes.The cryptic diversity of eukaryotic algae especially Chlorophytes in tropical phyllosphere is very high. The community structure at species-level has no significant relationship either with host phyllosphere or locations. The core algal community in tropical phyllopshere is consisted of members from Trentepohliales, Watanabea clade and Apatococcus clade. Our study provided a large amount of novel 18S rDNA sequences that will be useful to unravel the cryptic diversity of phyllosphere eukaryotic algae and for comparisons with similar future studies on this type of communities.


September 22, 2019

Exploring the genome and transcriptome of the cave nectar bat Eonycteris spelaea with PacBio long-read sequencing.

In the past two decades, bats have emerged as an important model system to study host-pathogen interactions. More recently, it has been shown that bats may also serve as a new and excellent model to study aging, inflammation, and cancer, among other important biological processes. The cave nectar bat or lesser dawn bat (Eonycteris spelaea) is known to be a reservoir for several viruses and intracellular bacteria. It is widely distributed throughout the tropics and subtropics from India to Southeast Asia and pollinates several plant species, including the culturally and economically important durian in the region. Here, we report the whole-genome and transcriptome sequencing, followed by subsequent de novo assembly, of the E. spelaea genome solely using the Pacific Biosciences (PacBio) long-read sequencing platform.The newly assembled E. spelaea genome is 1.97 Gb in length and consists of 4,470 sequences with a contig N50 of 8.0 Mb. Identified repeat elements covered 34.65% of the genome, and 20,640 unique protein-coding genes with 39,526 transcripts were annotated.We demonstrated that the PacBio long-read sequencing platform alone is sufficient to generate a comprehensive de novo assembled genome and transcriptome of an important bat species. These results will provide useful insights and act as a resource to expand our understanding of bat evolution, ecology, physiology, immunology, viral infection, and transmission dynamics.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.