Menu
September 22, 2019

Transcriptome profiling using Illumina- and SMRT-based RNA-seq of hot pepper for in-depth understanding of genes involved in CMV infection.

Hot pepper (Capsicum annuum L.) is becoming an increasingly important vegetable crop in the world. Cucumber mosaic virus (CMV) is a destructive virus that can cause leaf distortion and fruit lesions, affecting pepper production. However, studies on the response to CMV infection in pepper at the transcriptional level are limited. In this study, the transcript profiles of pepper leaves after CMV infection were investigated using Illumina and single-molecule real-time (SMRT) RNA-sequencing (RNA-seq). A total of 2143 differentially expressed genes (DEGs) were identified at five different stages. Gene ontology (GO) and KEGG analysis revealed that these DEGs were involved in the response to stress, defense response and plant-pathogen interaction pathways. Among these DEGs, several key genes that consistently appeared in studies of plant-pathogen interactions had increased transcript abundance after inoculation, including chitinase, pathogenesis-related (PR) protein, TMV resistance protein, WRKY transcription factor and jasmonate ZIM-domain protein. Four of these DEGs were further validated by quantitative real-time RT-PCR (qRT-PCR). Furthermore, a total of 73, 597 alternative splicing (AS) events were identified in the pepper leaves after CMV infection, distributed in 12, 615 genes. The intron retention of WRKY33 (Capana09g001251) might be involved in the regulation of CMV infection. Taken together, our study provides a transcriptome-wide insight into the molecular basis of resistance to CMV infection in pepper leaves and potential candidate genes for improving resistance cultivars. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

De novo transcriptome assembly of the Chinese pearl barley, adlay, by full-length isoform and short-read RNA sequencing.

Adlay (Coix lacryma-jobi) is a tropical grass that has long been used in traditional Chinese medicine and is known for its nutritional benefits. Recent studies have shown that vitamin E compounds in adlay protect against chronic diseases such as cancer and heart disease. However, the molecular basis of adlay’s health benefits remains unknown. Here, we generated adlay gene sets by de novo transcriptome assembly using long-read isoform sequencing (Iso-Seq) and short-read RNA-Sequencing (RNA-Seq). The gene sets obtained from Iso-seq and RNA-seq contained 31,177 genes and 57,901 genes, respectively. We confirmed the validity of the assembled gene sets by experimentally analyzing the levels of prolamin and vitamin E biosynthesis-associated proteins in adlay plant tissues and seeds. We compared the screened adlay genes with known gene families from closely related plant species, such as rice, sorghum and maize. We also identified tissue-specific genes from the adlay leaf, root, and young and mature seed, and experimentally validated the differential expression of 12 randomly-selected genes. Our study of the adlay transcriptome will provide a valuable resource for genetic studies that can enhance adlay breeding programs in the future.


September 22, 2019

Targeted sequencing of ampliconic gene transcripts from total human male testis RNA

The protocol summarizes all the steps towards the construction of PacBio and Illumina sequencing libraries of transcripts from nine Y chromosome ampliconic gene families starting from total human male testis RNA from two human males. This method is based on PacBio’s Isoform Sequencing method (Iso-Seq) using the Clontech SMARTer PCR cDNA Synthesis Kit and No Size Selection with some modifications.


September 22, 2019

Transcriptome sequencing and comparative analysis of differentially-expressed isoforms in the roots of Halogeton glomeratus under salt stress.

Although Halogeton glomeratus (H. glomeratus) has been confirmed to have a unique mechanism to regulate Na+efflux from the cytoplasm and compartmentalize Na+into leaf vacuoles, little is known about the salt tolerance mechanisms of roots under salinity stress. In the present study, transcripts were sequenced using the BGISEQ-500 sequencing platform (BGI, Wuhan, China). After quality control, approximately 24.08 million clean reads were obtained and the average mapping ratio to the reference gene was 70.00%. When comparing salt-treated samples with the control, a total of 550, 590, 1411 and 2063 DEIs were identified at 2, 6, 24 and 72h, respectively. Numerous differentially-expressed isoforms that play important roles in response and adaptation to salt condition are related to metabolic processes, cellular processes, single-organism processes, localization, biological regulation, responses to stimulus, binding, catalytic activity and transporter activity. Fifty-eight salt-induced isoforms were common to different stages of salt stress; most of these DEIs were related to signal transduction and transporters, which maybe the core isoforms regulating Na+uptake and transport in the roots of H. glomeratus. The expression patterns of 18 DEIs that were detected by quantitative real-time polymerase chain reaction were consistent with their respective changes in transcript abundance as identified by RNA-Seq technology. The present study thoroughly explored potential isoforms involved in salt tolerance on H. glomeratus roots at five time points. Our results may serve as an important resource for the H. glomeratus research community, improving our understanding of salt tolerance in halophyte survival under high salinity stress. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

High-resolution phylogenetic microbial community profiling.

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake’s water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.


September 22, 2019

Single-cell multiomics: multiple measurements from single cells.

Single-cell sequencing provides information that is not confounded by genotypic or phenotypic heterogeneity of bulk samples. Sequencing of one molecular type (RNA, methylated DNA or open chromatin) in a single cell, furthermore, provides insights into the cell’s phenotype and links to its genotype. Nevertheless, only by taking measurements of these phenotypes and genotypes from the same single cells can such inferences be made unambiguously. In this review, we survey the first experimental approaches that assay, in parallel, multiple molecular types from the same single cell, before considering the challenges and opportunities afforded by these and future technologies. Copyright © 2016. Published by Elsevier Ltd.


September 22, 2019

Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 a-, 11 ?-, one d- and five ?-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of a-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The d-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic a-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.


September 22, 2019

Characteristics of ARG-carrying plasmidome in the cultivable microbial community from wastewater treatment system under high oxytetracycline concentration.

Studies on antibiotic production wastewater have shown that even a single antibiotic can select for multidrug resistant bacteria in aquatic environments. It is speculated that plasmids are an important mechanism of multidrug resistance (MDR) under high concentrations of antibiotics. Herein, two metagenomic libraries were constructed with plasmid DNA extracted from cultivable microbial communities in a biological wastewater treatment reactor supplemented with 0 (CONTROL) or 25 mg/L of oxytetracycline (OTC-25). The OTC-25 plasmidome reads were assigned to 72 antibiotic resistance genes (ARGs) conferring resistance to 13 types of antibiotics. Dominant ARGs, encoding resistance to tetracycline, aminoglycoside, sulfonamide, and multidrug resistance genes, were enriched in the plasmidome under 25 mg/L of oxytetracycline. Furthermore, 17 contiguous multiple-ARG carrying contigs (carrying =?2 ARGs) were discovered in the OTC-25 plasmidome, whereas only nine were found in the CONTROL. Mapping of the OTC-25 plasmidome reads to completely sequenced plasmids revealed that the conjugative IncU resistance plasmid pFBAOT6 of Aeromonas caviae, carrying multidrug resistance transporter (pecM), tetracycline resistance genes (tetA, tetR), and transposase genes, might be a potential prevalent resistant plasmid in the OTC-25 plasmidome. Additionally, two novel resistant plasmids (containing contig C301682 carrying multidrug resistant operon mexCD-oprJ and contig C301632 carrying the tet36 and transposases genes) might also be potential prevalent resistant plasmids in the OTC-25 plasmidome. This study will be helpful to better understand the role of plasmids in the development of MDR in water environments under high antibiotic concentrations.


September 22, 2019

Different next generation sequencing platforms produce different microbial profiles and diversity in cystic fibrosis sputum.

Cystic fibrosis (CF) is an autosomal recessive disease characterized by recurrent lung infections. Studies of the lung microbiome have shown an association between decreasing diversity and progressive disease. 454 pyrosequencing has frequently been used to study the lung microbiome in CF, but will no longer be supported. We sought to identify the benefits and drawbacks of using two state-of-the-art next generation sequencing (NGS) platforms, MiSeq and PacBio RSII, to characterize the CF lung microbiome. Each has its advantages and limitations.Twelve samples of extracted bacterial DNA were sequenced on both MiSeq and PacBio NGS platforms. DNA was amplified for the V4 region of the 16S rRNA gene and libraries were sequenced on the MiSeq sequencing platform, while the full 16S rRNA gene was sequenced on the PacBio RSII sequencing platform. Raw FASTQ files generated by the MiSeq and PacBio platforms were processed in mothur v1.35.1.There was extreme discordance in alpha-diversity of the CF lung microbiome when using the two platforms. Because of its depth of coverage, sequencing of the 16S rRNA V4 gene region using MiSeq allowed for the observation of many more operational taxonomic units (OTUs) and higher Chao1 and Shannon indices than the PacBio RSII. Interestingly, several patients in our cohort had Escherichia, an unusual pathogen in CF. Also, likely because of its coverage of the complete 16S rRNA gene, only PacBio RSII was able to identify Burkholderia, an important CF pathogen.When comparing microbiome diversity in clinical samples from CF patients using 16S sequences, MiSeq and PacBio NGS platforms may generate different results in microbial community composition and structure. It may be necessary to use different platforms when trying to correctly identify dominant pathogens versus measuring alpha-diversity estimates, and it would be important to use the same platform for comparisons to minimize errors in interpretation. Copyright © 2016 Elsevier B.V. All rights reserved.


September 22, 2019

Global identification of the full-length transcripts and alternative splicing related to phenolic acid biosynthetic genes in Salvia miltiorrhiza.

Salvianolic acids are among the main bioactive components in Salvia miltiorrhiza, and their biosynthesis has attracted widespread interest. However, previous studies on the biosynthesis of phenolic acids using next-generation sequencing platforms are limited with regard to the assembly of full-length transcripts. Based on hybrid-seq (next-generation and single molecular real-time sequencing) of the S. miltiorrhiza root transcriptome, we experimentally identified 15 full-length transcripts and four alternative splicing events of enzyme-coding genes involved in the biosynthesis of rosmarinic acid. Moreover, we herein demonstrate that lithospermic acid B accumulates in the phloem and xylem of roots, in agreement with the expression patterns of the identified key genes related to rosmarinic acid biosynthesis. According to co-expression patterns, we predicted that six candidate cytochrome P450s and five candidate laccases participate in the salvianolic acid pathway. Our results provide a valuable resource for further investigation into the synthetic biology of phenolic acids in S. miltiorrhiza.


September 22, 2019

Minimap2: pairwise alignment for nucleotide sequences.

Recent advances in sequencing technologies promise ultra-long reads of ~100 kb in average, full-length mRNA or cDNA reads in high throughput and genomic contigs over 100 Mb in length. Existing alignment programs are unable or inefficient to process such data at scale, which presses for the development of new alignment algorithms.Minimap2 is a general-purpose alignment program to map DNA or long mRNA sequences against a large reference database. It works with accurate short reads of =100?bp in length, =1?kb genomic reads at error rate ~15%, full-length noisy Direct RNA or cDNA reads and assembly contigs or closely related full chromosomes of hundreds of megabases in length. Minimap2 does split-read alignment, employs concave gap cost for long insertions and deletions and introduces new heuristics to reduce spurious alignments. It is 3-4 times as fast as mainstream short-read mappers at comparable accuracy, and is =30 times faster than long-read genomic or cDNA mappers at higher accuracy, surpassing most aligners specialized in one type of alignment.https://github.com/lh3/minimap2.Supplementary data are available at Bioinformatics online.


September 22, 2019

Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019

Single-cell RNAseq for the study of isoforms-how is that possible?

Single-cell RNAseq and alternative splicing studies have recently become two of the most prominent applications of RNAseq. However, the combination of both is still challenging, and few research efforts have been dedicated to the intersection between them. Cell-level insight on isoform expression is required to fully understand the biology of alternative splicing, but it is still an open question to what extent isoform expression analysis at the single-cell level is actually feasible. Here, we establish a set of four conditions that are required for a successful single-cell-level isoform study and evaluate how these conditions are met by these technologies in published research.


September 22, 2019

Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials.

The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.


September 22, 2019

Soil bacterial communities are shaped by temporal and environmental filtering: evidence from a long-term chronosequence.

Soil microbial communities are abundant, hyper-diverse and mediate global biogeochemical cycles, but we do not yet understand the processes mediating their assembly. Current hypothetical frameworks suggest temporal (e.g. dispersal limitation) and environmental (e.g. soil pH) filters shape microbial community composition; however, there is limited empirical evidence supporting this framework in the hyper-diverse soil environment, particularly at large spatial (i.e. regional to continental) and temporal (i.e. 100 to 1000 years) scales. Here, we present evidence from a long-term chronosequence (4000 years) that temporal and environmental filters do indeed shape soil bacterial community composition. Furthermore, nearly 20 years of environmental monitoring allowed us to control for potentially confounding environmental variation. Soil bacterial communities were phylogenetically distinct across the chronosequence. We determined that temporal and environmental factors accounted for significant portions of bacterial phylogenetic structure using distance-based linear models. Environmental factors together accounted for the majority of phylogenetic structure, namely, soil temperature (19%), pH (17%) and litter carbon:nitrogen (C:N; 17%). However, of all individual factors, time since deglaciation accounted for the greatest proportion of bacterial phylogenetic structure (20%). Taken together, our results provide empirical evidence that temporal and environmental filters act together to structure soil bacterial communities across large spatial and long-term temporal scales. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.