Menu
September 22, 2019

Association of gene expression with biomass content and composition in sugarcane.

About 64% of the total aboveground biomass in sugarcane production is from the culm, of which ~90% is present in fiber and sugars. Understanding the transcriptome in the sugarcane culm, and the transcripts that are associated with the accumulation of the sugar and fiber components would facilitate the modification of biomass composition for enhanced biofuel and biomaterial production. The Sugarcane Iso-Seq Transcriptome (SUGIT) database was used as a reference for RNA-Seq analysis of variation in gene expression between young and mature tissues, and between 10 genotypes with varying fiber content. Global expression analysis suggests that each genotype displayed a unique expression pattern, possibly due to different chromosome combinations and maturation amongst these genotypes. Apart from direct sugar- and fiber-related transcripts, the differentially expressed (DE) transcripts in this study belonged to various supporting pathways that are not obviously involved in the accumulation of these major biomass components. The analysis revealed 1,649 DE transcripts between the young and mature tissues, while 555 DE transcripts were found between the low and high fiber genotypes. Of these, 151 and 23 transcripts respectively, were directly involved in sugar and fiber accumulation. Most of the transcripts identified were up-regulated in the young tissues (2 to 22-fold, FDR adjusted p-value <0.05), which could be explained by the more active metabolism in the young tissues compared to the mature tissues in the sugarcane culm. The results of analysis of the contrasting genotypes suggests that due to the large number of genes contributing to these traits, some of the critical DE transcripts could display less than 2-fold differences in expression and might not be easily identified. However, this transcript profiling analysis identified full-length candidate transcripts and pathways that were likely to determine the differences in sugar and fiber accumulation between tissue types and contrasting genotypes.


September 22, 2019

Assessing the gene content of the megagenome: sugar pine (Pinus lambertiana).

Sugar pine (Pinus lambertiana Douglas) is within the subgenus Strobus with an estimated genome size of 31 Gbp. Transcriptomic resources are of particular interest in conifers due to the challenges presented in their megagenomes for gene identification. In this study, we present the first comprehensive survey of the P. lambertiana transcriptome through deep sequencing of a variety of tissue types to generate more than 2.5 billion short reads. Third generation, long reads generated through PacBio Iso-Seq has been included for the first time in conifers to combat the challenges associated with de novo transcriptome assembly. A technology comparison is provided here contribute to the otherwise scarce comparisons of 2nd and 3rd generation transcriptome sequencing approaches in plant species. In addition, the transcriptome reference was essential for gene model identification and quality assessment in the parallel project responsible for sequencing and assembly of the entire genome. In this study, the transcriptomic data was also used to address some of the questions surrounding lineage-specific Dicer-like proteins in conifers. These proteins play a role in the control of transposable element proliferation and the related genome expansion in conifers. Copyright © 2016 Author et al.


September 22, 2019

Species groups distributed across elevational gradients reveal convergent and continuous genetic adaptation to high elevations.

Although many cases of genetic adaptations to high elevations have been reported, the processes driving these modifications and the pace of their evolution remain unclear. Many high-elevation adaptations (HEAs) are thought to have arisen in situ as populations rose with growing mountains. In contrast, most high-elevation lineages of the Qinghai-Tibetan Plateau appear to have colonized from low-elevation areas. These lineages provide an opportunity for studying recent HEAs and comparing them with ancestral low-elevation alternatives. Herein, we compare four frogs (three species of Nanorana and a close lowland relative) and four lizards (Phrynocephalus) that inhabit a range of elevations on or along the slopes of the Qinghai-Tibetan Plateau. The sequential cladogenesis of these species across an elevational gradient allows us to examine the gradual accumulation of HEA at increasing elevations. Many adaptations to high elevations appear to arise gradually and evolve continuously with increasing elevational distributions. Numerous related functions, especially DNA repair and energy metabolism pathways, exhibit rapid change and continuous positive selection with increasing elevations. Although the two studied genera are distantly related, they exhibit numerous convergent evolutionary changes, especially at the functional level. This functional convergence appears to be more extensive than convergence at the individual gene level, although we found 32 homologous genes undergoing positive selection for change in both high-elevation groups. We argue that species groups distributed along a broad elevational gradient provide a more powerful system for testing adaptations to high-elevation environments compared with studies that compare only pairs of high-elevation versus low-elevation species.


September 22, 2019

MCF-7 breast cancer cell line PacBio generated transcriptome has ~300 novel transcribed regions, un-annotated in both RefSeq and GENCODE, and absent in the liver, heart and brain transcriptomes

Illuminating the “dark” regions of the human genome remains an ongoing effort, a decade and a half after the human genome was sequenced – RefSeq and GENCODE being two of the major annotation databases. Pacific Biosciences (PacBio) has provided open access to the transcriptome of MCF-7, a breast cancer cell line that has provided significant therapeutic advancement in breast cancer research since the 1970s. PacBio sequencing generates much longer reads compared to second-generation sequencing technologies, with a trade-off of lower throughput, higher error rate and more cost per base. Here, this transcriptome was analyzed using the YeATS pipeline, with additionally introduced kmer based algorithms, reducing computational times to a few hours on a simple workstation. Out of ~300 transcripts that have no match in both RefSeq and GENCODE, ~250 are absent in the transcriptomes of the heart, liver and brain, also provided by PacBio. Also, ~200 transcripts are absent in a recent catalogue of un-annotated long non-coding RNAs from 6,503 samples (~43 Terabases of sequence data) [1], and only two present in common in an experimental workflow RACE-Seq that reported 2,556 novel transcripts [2]. ~100 transcripts have >100 amino acid open reading frames, and have the potential of being protein coding genes. ORF based annotation also identified few bacterial transcripts in the PacBio database mapped to the human genome, and one human transcript that has been annotated as bacterial in the NCBI database. The current work reiterates the under-utilization of transcriptomes for annotating genomes. It also provides new leads for investigating breast cancer by virtue of exclusively expressed transcripts not expressed in other tissues, which have the prospects of breast cancer biomarkers based on further investigations.


September 22, 2019

Effects of low crude oil chronic exposure on the northern krill (Meganyctiphanes norvegica)

Chronic oil pollution related to gas and oil drilling activities is increasing in the sea due to the rising offshore petroleum industry activity. Among marine organisms, zooplankton play a crucial role in the marine ecosystem and therefore understanding the effects of crude oil chronic exposure on zooplankton is needed to determine the impact of oil in marine environments. The present study reports on the effect of crude oil on adult northern krill, Meganyctiphanes norvegica, collected during three seasons. Their sensitivity to oil was examined with oil concentration of 0.01 versus 0.1 mg oil L- 1 and photo-modified oil in flowing seawater maintained in the dark for 2 weeks at in situ temperature. Oil (polycyclic aromatic hydrocarbons, PAHs) entered the krill (on average, 350 and 4400 µg·kg- 1 wet weight in low and medium oil treatments respectively) and a larger fraction of the krill exhibited digestive gland pathologies (enhanced apoptosis and pathology of digestive tubules) in oil treatments (27–80%) compared to a significantly lower fraction (7–13%) in treatments that received no oil. However, 2-week oil exposure at these concentrations did not significantly decrease survivorship or impair basic functioning such as feeding and respiration rates. Similarly, there were only limited changes in the transcription of 7 selected genes from head tissue. Additionally, although there was significant seasonal variation in krill total lipid content and fatty acid composition, there was no treatment effect on both these parameters, which suggests limited oxidative stress under experimental conditions. Furthermore, there was no significant treatment effect on two direct measures of oxidative stress (MDA: malondialdehyde and AOPP: advanced oxidation protein products) in any of the seasons. Nevertheless, histology clearly revealed enhanced digestive gland pathologies in krill even at low concentrations. Although krill with such pathologies continue to survive, their accumulation of PAHs may be transferred up the food chain, impacting their predators and the wider ecosystem.


September 22, 2019

GMAP and GSNAP for genomic sequence alignment: enhancements to speed, accuracy, and functionality.

The programs GMAP and GSNAP, for aligning RNA-Seq and DNA-Seq datasets to genomes, have evolved along with advances in biological methodology to handle longer reads, larger volumes of data, and new types of biological assays. The genomic representation has been improved to include linear genomes that can compare sequences using single-instruction multiple-data (SIMD) instructions, compressed genomic hash tables with fast access using SIMD instructions, handling of large genomes with more than four billion bp, and enhanced suffix arrays (ESAs) with novel data structures for fast access. Improvements to the algorithms have included a greedy match-and-extend algorithm using suffix arrays, segment chaining using genomic hash tables, diagonalization using segmental hash tables, and nucleotide-level dynamic programming procedures that use SIMD instructions and eliminate the need for F-loop calculations. Enhancements to the functionality of the programs include standardization of indel positions, handling of ambiguous splicing, clipping and merging of overlapping paired-end reads, and alignments to circular chromosomes and alternate scaffolds. The programs have been adapted for use in pipelines by integrating their usage into R/Bioconductor packages such as gmapR and HTSeqGenie, and these pipelines have facilitated the discovery of numerous biological phenomena.


September 22, 2019

Full-length transcriptome of Misgurnus anguillicaudatus provides insights into evolution of genus Misgurnus.

Reconstruction and annotation of transcripts, particularly for a species without reference genome, plays a critical role in gene discovery, investigation of genomic signatures, and genome annotation in the pre-genomic era. This study generated 33,330 full-length transcripts of diploid M. anguillicaudatus using PacBio SMRT Sequencing. A total of 6,918 gene families were identified with two or more isoforms, and 26,683 complete ORFs with an average length of 1,497?bp were detected. Totally, 1,208 high-confidence lncRNAs were identified, and most of these appeared to be precursor transcripts of miRNAs or snoRNAs. Phylogenetic tree of the Misgurnus species was inferred based on the 1,905 single copy orthologous genes. The tetraploid and diploid M. anguillicaudatus grouped into a clade, and M. bipartitus showed a closer relationship with the M. anguillicaudatus. The overall evolutionary rates of tetraploid M. anguillicaudatus were significantly higher than those of other Misgurnus species. Meanwhile, 28 positively selected genes were identified in M. anguillicaudatus clade. These positively selected genes may play critical roles in the adaptation to various habitat environments for M. anguillicaudatus. This study could facilitate further exploration of the genomic signatures of M. anguillicaudatus and provide potential insights into unveiling the evolutionary history of tetraploid loach.


September 22, 2019

Molecular mechanisms of acclimatization to phosphorus starvation and recovery underlying full-length transcriptome profiling in barley (Hordeum vulgare L.).

A lack of phosphorus (P) in plants can severely constrain growth and development. Barley, one of the earliest domesticated crops, is extensively planted in poor soil around the world. To date, the molecular mechanisms of enduring low phosphorus, at the transcriptional level, in barley are still unclear. In the present study, two different barley genotypes (GN121 and GN42)-with contrasting phosphorus efficiency-were used to reveal adaptations to low phosphorus stress, at three time points, at the morphological, physiological, biochemical, and transcriptome level. GN121 growth was less affected by phosphorus starvation and recovery than that of GN42. The biomass and inorganic phosphorus concentration of GN121 and GN42 declined under the low phosphorus-induced stress and increased after recovery with normal phosphorus. However, the range of these parameters was higher in GN42 than in GN121. Subsequently, a more complete genome annotation was obtained by correcting with the data sequenced on Illumina HiSeq X 10 and PacBio RSII SMRT platform. A total of 6,182 and 5,270 differentially expressed genes (DEGs) were identified in GN121 and GN42, respectively. The majority of these DEGs were involved in phosphorus metabolism such as phospholipid degradation, hydrolysis of phosphoric enzymes, sucrose synthesis, phosphorylation/dephosphorylation and post-transcriptional regulation; expression of these genes was significantly different between GN121 and GN42. Specifically, six and seven DEGs were annotated as phosphorus transporters in roots and leaves, respectively. Furthermore, a putative model was constructed relying on key metabolic pathways related to phosphorus to illustrate the higher phosphorus efficiency of GN121 compared to GN42 under low phosphorus conditions. Results from this study provide a multi-transcriptome database and candidate genes for further study on phosphorus use efficiency (PUE).


September 22, 2019

How far can mitochondrial DNA drive the disease?

Mitochondria are one of the dominant drivers for producing cellular energy to meet a large number of biological functions, of which the mitochondrial DNA (mtDNA) is the control center of energetic driving force and the dominant driver of mitochondrial molecular diversification. mtDNA transcription generates the necessary RNAs to regulate the extent and nature of mtRNA post-transcriptional modifications and the activity of nucleus-encoded enzymes. With a special focus on mtDNA, the current volume aims to overview the biology and structures of mtDNA, regulatory roles of mtDNA in lung diseases, or involvement of mtDNA in metabolism. We explore the significance of mtDNA sequencing, methylation, stability, and mutation in the pathogenesis of the diseases. Molecular mechanisms by which mtDNA contribute to the regulation of mitochondrial homeostasis and drug resistance are also discussed. We also point out the importance of mitochondrial ribosome, single cell biology, and gene editing in the understanding of the development of mitochondrial dysfunction in lung disease.


September 22, 2019

Transcriptome analysis of distinct cold tolerance strategies in the rubber tree (Hevea brasiliensis)

Natural rubber is an indispensable commodity used in approximately 40,000 products and is fundamental to the tire industry. Among the species that produce latex, the rubber tree [Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell-Arg.], a species native to the Amazon rainforest, is the major producer of latex used worldwide. The Amazon Basin presents optimal conditions for rubber tree growth, but the occurrence of South American leaf blight, which is caused by the fungus Microcyclus ulei (P. Henn) v. Arx, limits rubber tree production. Currently, rubber tree plantations are located in scape regions that exhibit suboptimal conditions such as high winds and cold temperatures. Rubber tree breeding programs aim to identify clones that are adapted to these stress conditions. However, rubber tree breeding is time-consuming, taking more than 20 years to develop a new variety. It is also expensive and requires large field areas. Thus, genetic studies could optimize field evaluations, thereby reducing the time and area required for these experiments. Transcriptome sequencing using next-generation sequencing (RNA-seq) is a powerful tool to identify a full set of transcripts and for evaluating gene expression in model and non-model species. In this study, we constructed a comprehensive transcriptome to evaluate the cold response strategies of the RRIM600 (cold-resistant) and GT1 (cold-tolerant) genotypes. Furthermore, we identified putative microsatellite (SSR) and single-nucleotide polymorphism (SNP) markers. Alternative splicing, which is an important mechanism for plant adaptation under abiotic stress, was further identified, providing an important database for further studies of cold tolerance.


September 22, 2019

Transcriptome comparative analysis of salt stress responsiveness in chrysanthemum (Dendranthema grandiflorum) roots by Illumina- and Single-Molecule Real-Time-based RNA sequencing.

Salt response has long been considered a polygenic-controlled character in plants. Under salt stress conditions, plants respond by activating a great amount of proteins and enzymes. To develop a better understanding of the molecular mechanism and screen salt responsive genes in chrysanthemum under salt stress, we performed the RNA sequencing (RNA-seq) on both salt-processed chrysanthemum seedling roots and the control group, and gathered six cDNA databases eventually. Moreover, to overcome the Illumina HiSeq technology’s limitation on sufficient length of reads and improve the quality and accuracy of the result, we combined Illumina HiSeq with single-molecule real-time sequencing (SMRT-seq) to decode the full-length transcripts. As a result, we successfully collected 550,823 unigenes, and from which we selected 48,396 differentially expressed genes (DEGs). Many of these DEGs were associated with the signal transduction, biofilm system, antioxidant system, and osmotic regulation system, such as mitogen-activated protein kinase (MAPK), Acyl-CoA thioesterase (ACOT), superoxide (SOD), catalase (CAT), peroxisomal membrane protein (PMP), and pyrroline-5-carboxylate reductase (P5CR). The quantitative real-time polymerase chain reaction (qRT-PCR) analysis of 15 unigenes was performed to test the data validity. The results were highly consistent with the RNA-seq results. In all, these findings could facilitate further detection of the responsive molecular mechanism under salt stress. They also provided more accurate candidate genes for genetic engineering on salt-tolerant chrysanthemums.


September 22, 2019

Melanization of mycorrhizal fungal necromass structures microbial decomposer communities

Mycorrhizal fungal necromass is increasingly recognized as an important contributor to soil organic carbon pools, particularly in forest ecosystems. While its decomposition rate is primarily determined by biochemical composition, how traits such as melanin content affect the structure of necromass decomposer communities remains poorly understood. To assess the role of biochemical traits on microbial decomposer community composition and functioning, we incubated melanized and non-melanized necromass of the mycorrhizal fungus Meliniomyces bicolor in Pinus- and Quercus-dominated forests in Minnesota, USA and then assessed the associated fungal and bacterial decomposer communities after 1, 2 and 3 months using high-throughput sequencing. Melanized necromass decomposed significantly slower than non-melanized necromass in both forests. The structure of the microbial decomposer communities depended significantly on necromass melanin content, although the effect was stronger for fungi than bacteria. On non-melanized necromass, fungal communities were dominated by r-selected ascomycete and mucoromycete microfungi early and then replaced by basidiomycete ectomycorrhizal fungi, while on melanized necromass these groups were co-dominant throughout the incubation. Bacterial communities were dominated by both specialist mycophageous and generalist taxa. Synthesis. Our results indicate that necromass biochemistry not only strongly affects rates of decomposition but also the structure of the associated decomposer communities. Furthermore, the observed colonization patterns suggest that fungi, and particularly ectomycorrhizal fungi, may play a more important role in necromass decomposition than previously recognized.


September 22, 2019

Single-cell isoform RNA sequencing characterizes isoforms in thousands of cerebellar cells.

Full-length RNA sequencing (RNA-Seq) has been applied to bulk tissue, cell lines and sorted cells to characterize transcriptomes, but applying this technology to single cells has proven to be difficult, with less than ten single-cell transcriptomes having been analyzed thus far. Although single splicing events have been described for =200 single cells with statistical confidence, full-length mRNA analyses for hundreds of cells have not been reported. Single-cell short-read 3′ sequencing enables the identification of cellular subtypes, but full-length mRNA isoforms for these cell types cannot be profiled. We developed a method that starts with bulk tissue and identifies single-cell types and their full-length RNA isoforms without fluorescence-activated cell sorting. Using single-cell isoform RNA-Seq (ScISOr-Seq), we identified RNA isoforms in neurons, astrocytes, microglia, and cell subtypes such as Purkinje and Granule cells, and cell-type-specific combination patterns of distant splice sites. We used ScISOr-Seq to improve genome annotation in mouse Gencode version 10 by determining the cell-type-specific expression of 18,173 known and 16,872 novel isoforms.


September 22, 2019

Enigmatic Diphyllatea eukaryotes: culturing and targeted PacBio RS amplicon sequencing reveals a higher order taxonomic diversity and global distribution.

The class Diphyllatea belongs to a group of enigmatic unicellular eukaryotes that play a key role in reconstructing the morphological innovation and diversification of early eukaryotic evolution. Despite its evolutionary significance, very little is known about the phylogeny and species diversity of Diphyllatea. Only three species have described morphology, being taxonomically divided by flagella number, two or four, and cell size. Currently, one 18S rRNA Diphyllatea sequence is available, with environmental sequencing surveys reporting only a single partial sequence from a Diphyllatea-like organism. Accordingly, geographical distribution of Diphyllatea based on molecular data is limited, despite morphological data suggesting the class has a global distribution. We here present a first attempt to understand species distribution, diversity and higher order structure of Diphyllatea.We cultured 11 new strains, characterised these morphologically and amplified their rRNA for a combined 18S-28S rRNA phylogeny. We sampled environmental DNA from multiple sites and designed new Diphyllatea-specific PCR primers for long-read PacBio RSII technology. Near full-length 18S rRNA sequences from environmental DNA, in addition to supplementary Diphyllatea sequence data mined from public databases, resolved the phylogeny into three deeply branching and distinct clades (Diphy I – III). Of these, the Diphy III clade is entirely novel, and in congruence with Diphy II, composed of species morphologically consistent with the earlier described Collodictyon triciliatum. The phylogenetic split between the Diphy I and Diphy II?+?III clades corresponds with a morphological division of Diphyllatea into bi- and quadriflagellate cell forms.This altered flagella composition must have occurred early in the diversification of Diphyllatea and may represent one of the earliest known morphological transitions among eukaryotes. Further, the substantial increase in molecular data presented here confirms Diphyllatea has a global distribution, seemingly restricted to freshwater habitats. Altogether, the results reveal the advantage of combining a group-specific PCR approach and long-read high-throughput amplicon sequencing in surveying enigmatic eukaryote lineages. Lastly, our study shows the capacity of PacBio RS when targeting a protist class for increasing phylogenetic resolution.


September 22, 2019

Multi-platform assessment of transcriptome profiling using RNA-seq in the ABRF next-generation sequencing study.

High-throughput RNA sequencing (RNA-seq) greatly expands the potential for genomics discoveries, but the wide variety of platforms, protocols and performance capabilitites has created the need for comprehensive reference data. Here we describe the Association of Biomolecular Resource Facilities next-generation sequencing (ABRF-NGS) study on RNA-seq. We carried out replicate experiments across 15 laboratory sites using reference RNA standards to test four protocols (poly-A-selected, ribo-depleted, size-selected and degraded) on five sequencing platforms (Illumina HiSeq, Life Technologies PGM and Proton, Pacific Biosciences RS and Roche 454). The results show high intraplatform (Spearman rank R > 0.86) and inter-platform (R > 0.83) concordance for expression measures across the deep-count platforms, but highly variable efficiency and cost for splice junction and variant detection between all platforms. For intact RNA, gene expression profiles from rRNA-depletion and poly-A enrichment are similar. In addition, rRNA depletion enables effective analysis of degraded RNA samples. This study provides a broad foundation for cross-platform standardization, evaluation and improvement of RNA-seq.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.