Menu
September 22, 2019

Isoform sequencing and state-of-art applications for unravelling complexity of plant transcriptomes

Single-molecule real-time (SMRT) sequencing developed by PacBio, also called third-generation sequencing (TGS), offers longer reads than the second-generation sequencing (SGS). Given its ability to obtain full-length transcripts without assembly, isoform sequencing (Iso-Seq) of transcriptomes by PacBio is advantageous for genome annotation, identification of novel genes and isoforms, as well as the discovery of long non-coding RNA (lncRNA). In addition, Iso-Seq gives access to the direct detection of alternative splicing, alternative polyadenylation (APA), gene fusion, and DNA modifications. Such applications of Iso-Seq facilitate the understanding of gene structure, post-transcriptional regulatory networks, and subsequently proteomic diversity. In this review, we summarize its applications in plant transcriptome study, specifically pointing out challenges associated with each step in the experimental design and highlight the development of bioinformatic pipelines. We aim to provide the community with an integrative overview and a comprehensive guidance to Iso-Seq, and thus to promote its applications in plant research.


September 22, 2019

Soil microbial communities are shaped by plant-driven changes in resource availability during secondary succession.

Although we understand the ecological processes eliciting changes in plant community composition during secondary succession, we do not understand whether co-occurring changes in plant detritus shape saprotrophic microbial communities in soil. In this study, we investigated soil microbial composition and function across an old-field chronosequence ranging from 16 to 86 years following agricultural abandonment, as well as three forests representing potential late-successional ecosystems. Fungal and bacterial community composition was quantified from ribosomal DNA, and insight into the functional potential of the microbial community to decay plant litter was gained from shotgun metagenomics and extracellular enzyme assays. Accumulation of soil organic matter across the chronosequence exerted a positive and significant effect on fungal phylogenetic ß-diversity and the activity of extracellular enzymes with lignocellulolytic activity. In addition, the increasing abundance of lignin-rich C4 grasses was positively related to the composition of fungal genes with lignocellulolytic function, thereby linking plant community composition, litter biochemistry, and microbial community function. However, edaphic properties were the primary agent shaping bacterial communities, as bacterial ß-diversity and variation in functional gene composition displayed a significant and positive relationship to soil pH across the chronosequence. The late-successional forests were compositionally distinct from the oldest old fields, indicating that substantial changes occur in soil microbial communities as old fields give way to forests. Taken together, our observations demonstrate that plants govern the turnover of soil fungal communities and functional characteristics during secondary succession, due to the continual input of detritus and differences in litter biochemistry among plant species.


September 22, 2019

Global identification of alternative splicing via comparative analysis of SMRT- and Illumina-based RNA-seq in strawberry.

Alternative splicing (AS) is a key post-transcriptional regulatory mechanism, yet little information is known about its roles in fruit crops. Here, AS was globally analyzed in the wild strawberry Fragaria vesca genome with RNA-seq data derived from different stages of fruit development. The AS landscape was characterized and compared between the single-molecule, real-time (SMRT) and Illumina RNA-seq platform. While SMRT has a lower sequencing depth, it identifies more genes undergoing AS (57.67% of detected multiexon genes) when it is compared with Illumina (33.48%), illustrating the efficacy of SMRT in AS identification. We investigated different modes of AS in the context of fruit development; the percentage of intron retention (IR) is markedly reduced whereas that of alternative acceptor sites (AA) is significantly increased post-fertilization when compared with pre-fertilization. When all the identified transcripts were combined, a total of 66.43% detected multiexon genes in strawberry undergo AS, some of which lead to a gain or loss of conserved domains in the gene products. The work demonstrates that SMRT sequencing is highly powerful in AS discovery and provides a rich data resource for later functional studies of different isoforms. Further, shifting AS modes may contribute to rapid changes of gene expression during fruit set.© 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.


September 22, 2019

Composition and pathogenic potential of a microbial bioremediation product used for crude oil degradation.

A microbial bioremediation product (MBP) used for large-scale oil degradation was investigated for microbial constituents and possible pathogenicity. Aerobic growth on various media yielded >108 colonies mL-1. Full-length 16S rDNA sequencing and fatty acid profiling from morphologically distinct colonies revealed =13 distinct genera. Full-length 16S rDNA library sequencing, by either Sanger or long-read PacBio technology, suggested that up to 21% of the MBP was composed of Arcobacter. Other high abundance microbial constituents (>6%) included the genera Proteus, Enterococcus, Dysgonomonas and several genera in the order Bacteroidales. The MBP was most susceptible to ciprofloxacin, doxycycline, gentamicin, and meropenam. MBP exposure of human HT29 and A549 cells caused significant cytotoxicity, and bacterial growth and adherence. An acellular MBP filtrate was also cytotoxic to HT29, but not A549. Both MBP and filtrate exposures elevated the neutrophil chemoattractant IL-8. In endotracheal murine exposures, bacterial pulmonary clearance was complete after one-week. Elevation of pro-inflammatory cytokines IL-1ß, IL-6, and TNF-a, and chemokines KC and MCP-1 occurred between 2h and 48h post-exposure, followed by restoration to baseline levels at 96h. Cytokine/chemokine signalling was accompanied by elevated blood neutrophils and monocytes at 4h and 48h, respectively. Peripheral acute phase response markers were maximal at 24h. All indicators examined returned to baseline values by 168h. In contrast to HT29, but similar to A549 observations, MBP filtrate did not induce significant murine effects with the indicators examined. The results demonstrated the potentially complex nature of MBPs and transient immunological effects during exposure. Products containing microbes should be scrutinized for pathogenic components and subjected to characterisation and quality validation prior to commercial release.


September 22, 2019

Introduction to isoform sequencing using Pacific Biosciences technology (Iso-Seq)

Alternative RNA splicing is a known phenomenon, but we still do not have a complete catalog of isoforms that explain variability in the human transcriptome. We have made significant progress in developing methods to study variability of the transcriptome, but we are far away of having a complete picture of the transcriptome. The initial methods to study gene expression were based on cloning of cDNAs and Sanger sequencing. The strategy was labor-intensive and expensive. With the development of microarrays, different methods based on exon arrays and tiling arrays provided valuable information about RNA expression. However, the microarray presented significant limitations. Most of the limitations became apparent by 2005, but it was not until 2008 that an alternative method to study the transcriptome was developed. RNA Sequencing using next-generation sequencing (RNA-Seq) quickly became the technology of choice for gene expression profiling. Recently, the precision and sensitivity of RNA-Seq have come into question, especially for transcriptome reconstruction. This chapter will describe a relatively new method, “Isoform Sequencing (Iso-Seq). Iso-Seq was developed by Pacific Biosciences (PacBio), and it is capable of identifying new isoforms with extraordinary precision due to its long-read technology. The technique to create libraries is straightforward, and the PacBio RS II instrument generates the information in hours. The bioinformatics analysis is performed using the freely available SMRT® Portal software. The SMRT Portal is easy to use and capable of performing all the steps necessary to analyze the raw data and to generate high-quality full-length isoforms. For the universal acceptance of the Iso-Seq method, the capacity of the SMRT Cells needs to improve at least 10- to 100-fold to make the system affordable and attractive to users.


September 22, 2019

MHC class I diversity of olive baboons (Papio anubis) unravelled by next-generation sequencing.

The olive baboon represents an important model system to study various aspects of human biology and health, including the origin and diversity of the major histocompatibility complex. After screening of a group of related animals for polymorphisms associated with a well-defined microsatellite marker, subsequent MHC class I typing of a selected population of 24 animals was performed on two distinct next-generation sequencing (NGS) platforms. A substantial number of 21 A and 80 B transcripts were discovered, about half of which had not been previously reported. Per animal, from one to four highly transcribed A alleles (majors) were observed, in addition to ones characterised by low transcripion levels (minors), such as members of the A*14 lineage. Furthermore, in one animal, up to 13 B alleles with differential transcription level profiles may be present. Based on segregation profiles, 16 Paan-AB haplotypes were defined. A haplotype encodes in general one or two major A and three to seven B transcripts, respectively. A further peculiarity is the presence of at least one copy of a B*02 lineage on nearly every haplotype, which indicates that B*02 represents a separate locus with probably a specialistic function. Haplotypes appear to be generated by recombination-like events, and the breakpoints map not only between the A and B regions but also within the B region itself. Therefore, the genetic makeup of the olive baboon MHC class I region appears to have been subject to a similar or even more complex expansion process than the one documented for macaque species.


September 22, 2019

Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics.

Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio’s single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing.© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019

Next-generation approaches to advancing eco-immunogenomic research in critically endangered primates.

High-throughput sequencing platforms are generating massive amounts of genomic data from nonmodel species, and these data sets are valuable resources that can be mined to advance a number of research areas. An example is the growing amount of transcriptome data that allow for examination of gene expression in nonmodel species. Here, we show how publicly available transcriptome data from nonmodel primates can be used to design novel research focused on immunogenomics. We mined transcriptome data from the world’s most endangered group of primates, the lemurs of Madagascar, for sequences corresponding to immunoglobulins. Our results confirmed homology between strepsirrhine and haplorrhine primate immunoglobulins and allowed for high-throughput sequencing of expressed antibodies (Ig-seq) in Coquerel’s sifaka (Propithecus coquereli). Using both Pacific Biosciences RS and Ion Torrent PGM sequencing, we performed Ig-seq on two individuals of Coquerel’s sifaka. We generated over 150 000 sequences of expressed antibodies, allowing for molecular characterization of the antigen-binding region. Our analyses suggest that similar VDJ expression patterns exist across all primates, with sequences closely related to the human VH 3 immunoglobulin family being heavily represented in sifaka antibodies. Moreover, the antigen-binding region of sifaka antibodies exhibited similar amino acid variation with respect to haplorrhine primates. Our study represents the first attempt to characterize sequence diversity of the expressed antibody repertoire in a species of lemur. We anticipate that methods similar to ours will provide the framework for investigating the adaptive immune response in wild populations of other nonmodel organisms and can be used to advance the burgeoning field of eco-immunology. © 2014 John Wiley & Sons Ltd.


September 22, 2019

Effects of antibiotic on microflora in ileum and cecum for broilers by 16S rRNA sequence analysis.

An experiment was conducted to analyze and compare the microbial composition, abundance, dynamic distribution, and functions without and with antibiotic fed to broilers. A 16S rRNA-sequencing approach was used to evaluate the bacterial composition of the gut of male broilers under different groups. A total of 240 1-day old AA male broilers were randomly assigned to two groups, with 120 broilers per group. The treatment group was administered an antibiotic with their feed, while the control group was not administered antibiotic (control group). A total of 10 replicates were assessed per treatment. The control group was fed a basal diet containing corn, soybean meal, and cottonseed meal and met the nutritional requirement. The antibiotic group was fed 100 mg/kg aureomycin (based on the basal diet). The trial lasted 42 days. Operational taxonomic unit partition and classification, alpha diversity, taxonomic composition, beta diversity, and microflora comparative analyses along with key species screening were performed for all of the treatment groups. Our data indicate that aureomycin treatment in broilers is directly correlated with variations of the gut content of specific bacterial taxa, and herein provide insights into the impact of antibiotic on microbial communities in cecum and ileum of broiler chickens.© 2018 Japanese Society of Animal Science.


September 22, 2019

Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system.

Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina’s MiSeq, have allowed researchers to obtain millions of high quality but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3-V5, V1-V3, V1-V5, V1-V6, and V1-V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1-V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina’s MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting.


September 22, 2019

Initial colonization, community assembly and ecosystem function: fungal colonist traits and litter biochemistry mediate decay rate.

Priority effects are an important ecological force shaping biotic communities and ecosystem processes, in which the establishment of early colonists alters the colonization success of later-arriving organisms via competitive exclusion and habitat modification. However, we do not understand which biotic and abiotic conditions lead to strong priority effects and lasting historical contingencies. Using saprotrophic fungi in a model leaf decomposition system, we investigated whether compositional and functional consequences of initial colonization were dependent on initial colonizer traits, resource availability or a combination thereof. To test these ideas, we factorially manipulated leaf litter biochemistry and initial fungal colonist identity, quantifying subsequent community composition, using neutral genetic markers, and community functional characteristics, including enzyme potential and leaf decay rates. During the first 3 months, initial colonist respiration rate and physiological capacity to degrade plant detritus were significant determinants of fungal community composition and leaf decay, indicating that rapid growth and lignolytic potential of early colonists contributed to altered trajectories of community assembly. Further, initial colonization on oak leaves generated increasingly divergent trajectories of fungal community composition and enzyme potential, indicating stronger initial colonizer effects on energy-poor substrates. Together, these observations provide evidence that initial colonization effects, and subsequent consequences on litter decay, are dependent upon substrate biochemistry and physiological traits within a regional species pool. Because microbial decay of plant detritus is important to global C storage, our results demonstrate that understanding the mechanisms by which initial conditions alter priority effects during community assembly may be key to understanding the drivers of ecosystem-level processes. © 2015 John Wiley & Sons Ltd.


September 22, 2019

Next generation multilocus sequence typing (NGMLST) and the analytical software program MLSTEZ enable efficient, cost-effective, high-throughput, multilocus sequencing typing.

Multilocus sequence typing (MLST) has become the preferred method for genotyping many biological species, and it is especially useful for analyzing haploid eukaryotes. MLST is rigorous, reproducible, and informative, and MLST genotyping has been shown to identify major phylogenetic clades, molecular groups, or subpopulations of a species, as well as individual strains or clones. MLST molecular types often correlate with important phenotypes. Conventional MLST involves the extraction of genomic DNA and the amplification by PCR of several conserved, unlinked gene sequences from a sample of isolates of the taxon under investigation. In some cases, as few as three loci are sufficient to yield definitive results. The amplicons are sequenced, aligned, and compared by phylogenetic methods to distinguish statistically significant differences among individuals and clades. Although MLST is simpler, faster, and less expensive than whole genome sequencing, it is more costly and time-consuming than less reliable genotyping methods (e.g. amplified fragment length polymorphisms). Here, we describe a new MLST method that uses next-generation sequencing, a multiplexing protocol, and appropriate analytical software to provide accurate, rapid, and economical MLST genotyping of 96 or more isolates in single assay. We demonstrate this methodology by genotyping isolates of the well-characterized, human pathogenic yeast Cryptococcus neoformans. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

The features of mucosa-associated microbiota in primary sclerosing cholangitis.

Little is known about the role of the microbiome in primary sclerosing cholangitis.To explore the mucosa-associated microbiota in primary sclerosing cholangitis (PSC) patients across different locations in the gut, and to compare it with inflammatory bowel disease (IBD)-only patients and healthy controls.Biopsies from the terminal ileum, right colon, and left colon were collected from patients and healthy controls undergoing colonoscopy. Microbiota profiling using bacterial 16S rRNA sequencing was performed on all biopsies.Forty-four patients were recruited: 20 with PSC (19 with PSC-IBD and one with PSC-only), 15 with IBD-only and nine healthy controls. The overall microbiome profile was similar throughout different locations in the gut. No differences in the global microbiome profile were found. However, we observed significant PSC-associated enrichment in Barnesiellaceae at the family level, and in Blautia and an unidentified Barnesiellaceae at the genus level. At the operational taxa unit level, most shifts in PSC were observed in Clostridiales and Bacteroidales orders, with approximately 86% of shifts occurring within the former order.The overall microbiota profile was similar across multiple locations in the gut from the same individual regardless of disease status. In this study, the mucosa associated-microbiota of patients with primary sclerosing cholangitis was characterised by enrichment of Blautia and Barnesiellaceae and by major shifts in operational taxa units within Clostridiales order.© 2016 John Wiley & Sons Ltd.


September 22, 2019

Complex effects of mammalian grazing on extramatrical mycelial biomass in the Scandes forest-tundra ecotone.

Mycorrhizal associations are widespread in high-latitude ecosystems and are potentially of great importance for global carbon dynamics. Although large herbivores play a key part in shaping subarctic plant communities, their impact on mycorrhizal dynamics is largely unknown. We measured extramatrical mycelial (EMM) biomass during one growing season in 16-year-old herbivore exclosures and unenclosed control plots (ambient), at three mountain birch forests and two shrub heath sites, in the Scandes forest-tundra ecotone. We also used high-throughput amplicon sequencing for taxonomic identification to investigate differences in fungal species composition. At the birch forest sites, EMM biomass was significantly higher in exclosures (1.36 ± 0.43 g C/m2) than in ambient conditions (0.66 ± 0.17 g C/m2) and was positively influenced by soil thawing degree-days. At the shrub heath sites, there was no significant effect on EMM biomass (exclosures: 0.72 ± 0.09 g C/m2; ambient plots: 1.43 ± 0.94). However, EMM biomass was negatively related toBetula nanaabundance, which was greater in exclosures, suggesting that grazing affected EMM biomass positively. We found no significant treatment effects on fungal diversity but the most abundant ectomycorrhizal lineage/cortinarius, showed a near-significant positive effect of herbivore exclusion (p = .08), indicating that herbivory also affects fungal community composition. These results suggest that herbivory can influence fungal biomass in highly context-dependent ways in subarctic ecosystems. Considering the importance of root-associated fungi for ecosystem carbon balance, these findings could have far-reaching implications.


September 22, 2019

Transcriptome-referenced association study of clove shape traits in garlic.

Genome-wide association studies are a powerful approach for identifying genes related to complex traits in organisms, but are limited by the requirement for a reference genome sequence of the species under study. To circumvent this problem, we propose a transcriptome-referenced association study (TRAS) that utilizes a transcriptome generated by single-molecule long-read sequencing as a reference sequence to score population variation at both transcript sequence and expression levels. Candidate transcripts are identified when both scores are associated with a trait and their potential interactions are ascertained by expression quantitative trait loci analysis. Applying this method to characterize garlic clove shape traits in 102 landraces, we identified 22 candidate transcripts, most of which showed extensive interactions. Eight transcripts were long non-coding RNAs (lncRNAs), and the others were proteins involved mainly in carbohydrate metabolism, protein degradation, etc. TRAS, as an efficient tool for association study independent of a reference genome, extends the applicability of association studies to a broad range of species.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.