Menu
September 22, 2019

Single-cell mRNA isoform diversity in the mouse brain.

Alternative mRNA isoform usage is an important source of protein diversity in mammalian cells. This phenomenon has been extensively studied in bulk tissues, however, it remains unclear how this diversity is reflected in single cells.Here we use long-read sequencing technology combined with unique molecular identifiers (UMIs) to reveal patterns of alternative full-length isoform expression in single cells from the mouse brain. We found a surprising amount of isoform diversity, even after applying a conservative definition of what constitutes an isoform. Genes tend to have one or a few isoforms highly expressed and a larger number of isoforms expressed at a low level. However, for many genes, nearly every sequenced mRNA molecule was unique, and many events affected coding regions suggesting previously unknown protein diversity in single cells. Exon junctions in coding regions were less prone to splicing errors than those in non-coding regions, indicating purifying selection on splice donor and acceptor efficiency.Our findings indicate that mRNA isoform diversity is an important source of biological variability also in single cells.


September 22, 2019

Genome re-annotation of the wild strawberry Fragaria vesca using extensive Illumina-and SMRT-based RNA-seq datasets

The genome of the wild diploid strawberry species Fragaria vesca, an ideal model system of cultivated strawberry (Fragaria × ananassa, octoploid) and other Rosaceae family crops, was first published in 2011 and followed by a new assembly (Fvb). However, the annotation for Fvb mainly relied on ab initio predictions and included only predicted coding sequences, therefore an improved annotation is highly desirable. Here, a new annotation version named v2.0.a2 was created for the Fvb genome by a pipeline utilizing one PacBio library, 90 Illumina RNA-seq libraries, and 9 small RNA-seq libraries. Altogether, 18,641 genes (55.6% out of 33,538 genes) were augmented with information on the 5′ and/or 3′ UTRs, 13,168 (39.3%) protein-coding genes were modified or newly identified, and 7,370 genes were found to possess alternative isoforms. In addition, 1,938 long non-coding RNAs, 171 miRNAs, and 51,714 small RNA clusters were integrated into the annotation. This new annotation of F. vesca is substantially improved in both accuracy and integrity of gene predictions, beneficial to the gene functional studies in strawberry and to the comparative genomic analysis of other horticultural crops in Rosaceae family.


September 22, 2019

Transcriptome analysis of distinct cold tolerance strategies in the rubber tree (Hevea brasiliensis)

Natural rubber is an indispensable commodity used in approximately 40,000 products and is fundamental to the tire industry. Among the species that produce latex, the rubber tree [Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell-Arg.], a species native to the Amazon rainforest, is the major producer of latex used worldwide. The Amazon Basin presents optimal conditions for rubber tree growth, but the occurrence of South American leaf blight, which is caused by the fungus Microcyclus ulei (P. Henn) v. Arx, limits rubber tree production. Currently, rubber tree plantations are located in scape regions that exhibit suboptimal conditions such as high winds and cold temperatures. Rubber tree breeding programs aim to identify clones that are adapted to these stress conditions. However, rubber tree breeding is time-consuming, taking more than 20 years to develop a new variety. It is also expensive and requires large field areas. Thus, genetic studies could optimize field evaluations, thereby reducing the time and area required for these experiments. Transcriptome sequencing using next-generation sequencing (RNA-seq) is a powerful tool to identify a full set of transcripts and for evaluating gene expression in model and non-model species. In this study, we constructed a comprehensive transcriptome to evaluate the cold response strategies of the RRIM600 (cold-resistant) and GT1 (cold-tolerant) genotypes. Furthermore, we identified putative microsatellite (SSR) and single-nucleotide polymorphism (SNP) markers. Alternative splicing, which is an important mechanism for plant adaptation under abiotic stress, was further identified, providing an important database for further studies of cold tolerance.


September 22, 2019

Major histocompatibility complex haplotyping and long-amplicon allele discovery in cynomolgus macaques from Chinese breeding facilities.

Very little is currently known about the major histocompatibility complex (MHC) region of cynomolgus macaques (Macaca fascicularis; Mafa) from Chinese breeding centers. We performed comprehensive MHC class I haplotype analysis of 100 cynomolgus macaques from two different centers, with animals from different reported original geographic origins (Vietnamese, Cambodian, and Cambodian/Indonesian mixed-origin). Many of the samples were of known relation to each other (sire, dam, and progeny sets), making it possible to characterize lineage-level haplotypes in these animals. We identified 52 Mafa-A and 74 Mafa-B haplotypes in this cohort, many of which were restricted to specific sample origins. We also characterized full-length MHC class I transcripts using Pacific Biosciences (PacBio) RS II single-molecule real-time (SMRT) sequencing. This technology allows for complete read-through of unfragmented MHC class I transcripts (~1100 bp in length), so no assembly is required to unambiguously resolve novel full-length sequences. Overall, we identified 311 total full-length transcripts in a subset of 72 cynomolgus macaques from these Chinese breeding facilities; 130 of these sequences were novel and an additional 115 extended existing short database sequences to span the complete open reading frame. This significantly expands the number of Mafa-A, Mafa-B, and Mafa-I full-length alleles in the official cynomolgus macaque MHC class I database. The PacBio technique described here represents a general method for full-length allele discovery and genotyping that can be extended to other complex immune loci such as MHC class II, killer immunoglobulin-like receptors, and Fc gamma receptors.


September 22, 2019

The first whole transcriptomic exploration of pre-oviposited early chicken embryos using single and bulked embryonic RNA-sequencing.

The chicken is a valuable model organism, especially in evolutionary and embryology research because its embryonic development occurs in the egg. However, despite its scientific importance, no transcriptome data have been generated for deciphering the early developmental stages of the chicken because of practical and technical constraints in accessing pre-oviposited embryos.Here, we determine the entire transcriptome of pre-oviposited avian embryos, including oocyte, zygote, and intrauterine embryos from Eyal-giladi and Kochav stage I (EGK.I) to EGK.X collected using a noninvasive approach for the first time. We also compare RNA-sequencing data obtained using a bulked embryo sequencing and single embryo/cell sequencing technique. The raw sequencing data were preprocessed with two genome builds, Galgal4 and Galgal5, and the expression of 17,108 and 26,102 genes was quantified in the respective builds. There were some differences between the two techniques, as well as between the two genome builds, and these were affected by the emergence of long intergenic noncoding RNA annotations.The first transcriptome datasets of pre-oviposited early chicken embryos based on bulked and single embryo sequencing techniques will serve as a valuable resource for investigating early avian embryogenesis, for comparative studies among vertebrates, and for novel gene annotation in the chicken genome.


September 22, 2019

Circular RNA architecture and differentiation during leaf bud to young leaf development in tea (Camellia sinensis).

Circular RNA (circRNA) discovery, expression patterns and experimental validation in developing tea leaves indicates its correlation with circRNA-parental genes and potential roles in ceRNA interaction network. Circular RNAs (circRNAs) have recently emerged as a novel class of abundant endogenous stable RNAs produced by circularization with regulatory potential. However, identification of circRNAs in plants, especially in non-model plants with large genomes, is challenging. In this study, we undertook a systematic identification of circRNAs from different stage tissues of tea plant (Camellia sinensis) leaf development using rRNA-depleted circular RNA-seq. By combining two state-of-the-art detecting tools, we characterized 3174 circRNAs, of which 342 were shared by each approach, and thus considered high-confidence circRNAs. A few predicted circRNAs were randomly chosen, and 20 out of 24 were experimental confirmed by PCR and Sanger sequencing. Similar in other plants, tissue-specific expression was also observed for many C. sinensis circRNAs. In addition, we found that circRNA abundances were positively correlated with the mRNA transcript abundances of their parental genes. qRT-PCR validated the differential expression patterns of circRNAs between leaf bud and young leaf, which also indicated the low expression abundance of circRNAs compared to the standard mRNAs from the parental genes. We predicted the circRNA-microRNA interaction networks, and 54 of the differentially expressed circRNAs were found to have potential tea plant miRNA binding sites. The gene sets encoding circRNAs were significantly enriched in chloroplasts related GO terms and photosynthesis/metabolites biosynthesis related KEGG pathways, suggesting the candidate roles of circRNAs in photosynthetic machinery and metabolites biosynthesis during leaf development.


September 22, 2019

Interpreting microbial biosynthesis in the genomic age: Biological and practical considerations.

Genome mining has become an increasingly powerful, scalable, and economically accessible tool for the study of natural product biosynthesis and drug discovery. However, there remain important biological and practical problems that can complicate or obscure biosynthetic analysis in genomic and metagenomic sequencing projects. Here, we focus on limitations of available technology as well as computational and experimental strategies to overcome them. We review the unique challenges and approaches in the study of symbiotic and uncultured systems, as well as those associated with biosynthetic gene cluster (BGC) assembly and product prediction. Finally, to explore sequencing parameters that affect the recovery and contiguity of large and repetitive BGCs assembled de novo, we simulate Illumina and PacBio sequencing of the Salinispora tropica genome focusing on assembly of the salinilactam (slm) BGC.


September 22, 2019

First insights into the nature and evolution of antisense transcription in nematodes.

The development of multicellular organisms is coordinated by various gene regulatory mechanisms that ensure correct spatio-temporal patterns of gene expression. Recently, the role of antisense transcription in gene regulation has moved into focus of research. To characterize genome-wide patterns of antisense transcription and to study their evolutionary conservation, we sequenced a strand-specific RNA-seq library of the nematode Pristionchus pacificus.We identified 1112 antisense configurations of which the largest group represents 465 antisense transcripts (ASTs) that are fully embedded in introns of their host genes. We find that most ASTs show homology to protein-coding genes and are overrepresented in proteomic data. Together with the finding, that expression levels of ASTs and host genes are uncorrelated, this indicates that most ASTs in P. pacificus do not represent non-coding RNAs and do not exhibit regulatory functions on their host genes. We studied the evolution of antisense gene pairs across 20 nematode genomes, showing that the majority of pairs is lineage-specific and even the highly conserved vps-4, ddx-27, and sel-2 loci show abundant structural changes including duplications, deletions, intron gains and loss of antisense transcription. In contrast, host genes in general, are remarkably conserved and encode exceptionally long introns leading to unusually large blocks of conserved synteny.Our study has shown that in P. pacificus antisense transcription as such does not define non-coding RNAs but is rather a feature of highly conserved genes with long introns. We hypothesize that the presence of regulatory elements imposes evolutionary constraint on the intron length, but simultaneously, their large size makes them a likely target for translocation of genomic elements including protein-coding genes that eventually end up as ASTs.


September 22, 2019

Transcriptome comparative analysis of salt stress responsiveness in chrysanthemum (Dendranthema grandiflorum) roots by Illumina- and Single-Molecule Real-Time-based RNA sequencing.

Salt response has long been considered a polygenic-controlled character in plants. Under salt stress conditions, plants respond by activating a great amount of proteins and enzymes. To develop a better understanding of the molecular mechanism and screen salt responsive genes in chrysanthemum under salt stress, we performed the RNA sequencing (RNA-seq) on both salt-processed chrysanthemum seedling roots and the control group, and gathered six cDNA databases eventually. Moreover, to overcome the Illumina HiSeq technology’s limitation on sufficient length of reads and improve the quality and accuracy of the result, we combined Illumina HiSeq with single-molecule real-time sequencing (SMRT-seq) to decode the full-length transcripts. As a result, we successfully collected 550,823 unigenes, and from which we selected 48,396 differentially expressed genes (DEGs). Many of these DEGs were associated with the signal transduction, biofilm system, antioxidant system, and osmotic regulation system, such as mitogen-activated protein kinase (MAPK), Acyl-CoA thioesterase (ACOT), superoxide (SOD), catalase (CAT), peroxisomal membrane protein (PMP), and pyrroline-5-carboxylate reductase (P5CR). The quantitative real-time polymerase chain reaction (qRT-PCR) analysis of 15 unigenes was performed to test the data validity. The results were highly consistent with the RNA-seq results. In all, these findings could facilitate further detection of the responsive molecular mechanism under salt stress. They also provided more accurate candidate genes for genetic engineering on salt-tolerant chrysanthemums.


September 22, 2019

Microsatellites from Fosterella christophii (Bromeliaceae) by de novo transcriptome sequencing on the Pacific Biosciences RS platform.

Microsatellite markers were developed in Fosterella christophii (Bromeliaceae) to investigate the genetic diversity and population structure within the F. micrantha group, comprising F. christophii, F. micrantha, and F. villosula.Full-length cDNAs were isolated from F. christophii and sequenced on a Pacific Biosciences RS platform. A total of 1590 high-quality consensus isoforms were assembled into 971 unigenes containing 421 perfect microsatellites. Thirty primer sets were designed, of which 13 revealed a high level of polymorphism in three populations of F. christophii, with four to nine alleles per locus. Each of these 13 loci cross-amplified in the closely related species F. micrantha and F. villosula, with one to six and one to 11 alleles per locus, respectively.The new markers are promising tools to study the population genetics of F. christophii and to discover species boundaries within the F. micrantha group.


September 22, 2019

Plant 24-nt reproductive phasiRNAs from intramolecular duplex mRNAs in diverse monocots.

In grasses, two pathways that generate diverse and numerous 21-nt (premeiotic) and 24-nt (meiotic) phased siRNAs are highly enriched in anthers, the male reproductive organs. These “phasiRNAs” are analogous to mammalian piRNAs, yet their functions and evolutionary origins remain largely unknown. The 24-nt meiotic phasiRNAs have only been described in grasses, wherein their biogenesis is dependent on a specialized Dicer (DCL5). To assess how evolution gave rise to this pathway, we examined reproductive phasiRNA pathways in nongrass monocots: garden asparagus, daylily, and lily. The common ancestors of these species diverged approximately 115-117 million years ago (MYA). We found that premeiotic 21-nt and meiotic 24-nt phasiRNAs were abundant in all three species and displayed spatial localization and temporal dynamics similar to grasses. The miR2275-triggered pathway was also present, yielding 24-nt reproductive phasiRNAs, and thus originated more than 117 MYA. In asparagus, unlike in grasses, these siRNAs are largely derived from inverted repeats (IRs); analyses in lily identified thousands of precursor loci, and many were also predicted to form foldback substrates for Dicer processing. Additionally, reproductive phasiRNAs were present in female reproductive organs and thus may function in both male and female germinal development. These data describe several distinct mechanisms of production for 24-nt meiotic phasiRNAs and provide new insights into the evolution of reproductive phasiRNA pathways in monocots.© 2018 Kakrana et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Melanization of mycorrhizal fungal necromass structures microbial decomposer communities

Mycorrhizal fungal necromass is increasingly recognized as an important contributor to soil organic carbon pools, particularly in forest ecosystems. While its decomposition rate is primarily determined by biochemical composition, how traits such as melanin content affect the structure of necromass decomposer communities remains poorly understood. To assess the role of biochemical traits on microbial decomposer community composition and functioning, we incubated melanized and non-melanized necromass of the mycorrhizal fungus Meliniomyces bicolor in Pinus- and Quercus-dominated forests in Minnesota, USA and then assessed the associated fungal and bacterial decomposer communities after 1, 2 and 3 months using high-throughput sequencing. Melanized necromass decomposed significantly slower than non-melanized necromass in both forests. The structure of the microbial decomposer communities depended significantly on necromass melanin content, although the effect was stronger for fungi than bacteria. On non-melanized necromass, fungal communities were dominated by r-selected ascomycete and mucoromycete microfungi early and then replaced by basidiomycete ectomycorrhizal fungi, while on melanized necromass these groups were co-dominant throughout the incubation. Bacterial communities were dominated by both specialist mycophageous and generalist taxa. Synthesis. Our results indicate that necromass biochemistry not only strongly affects rates of decomposition but also the structure of the associated decomposer communities. Furthermore, the observed colonization patterns suggest that fungi, and particularly ectomycorrhizal fungi, may play a more important role in necromass decomposition than previously recognized.


September 22, 2019

Cow-to-mouse fecal transplantations suggest intestinal microbiome as one cause of mastitis.

Mastitis, which affects nearly all lactating mammals including human, is generally thought to be caused by local infection of the mammary glands. For treatment, antibiotics are commonly prescribed, which however are of concern in both treatment efficacy and neonate safety. Here, using bovine mastitis which is the most costly disease in the dairy industry as a model, we showed that intestinal microbiota alone can lead to mastitis.Fecal microbiota transplantation (FMT) from mastitis, but not healthy cows, to germ-free (GF) mice resulted in mastitis symptoms in mammary gland and inflammations in serum, spleen, and colon. Probiotic intake in parallel with FMT from diseased cows led to relieved mastitis symptoms in mice, by shifting the murine intestinal microbiota to a state that is functionally distinct from either healthy or diseased microbiota yet structurally similar to the latter. Despite conservation in mastitis symptoms, diseased cows and mice shared few mastitis-associated bacterial organismal or functional markers, suggesting striking divergence in mastitis-associated intestinal microbiota among lactating mammals. Moreover, an “amplification effect” of disease-health distinction in both microbiota structure and function was apparent during the cow-to-mouse FMT.Hence, dysbiosis of intestinal microbiota may be one cause of mastitis, and probiotics that restore intestinal microbiota function are an effective and safe strategy to treat mastitis.


September 22, 2019

Long-read sequencing of chicken transcripts and identification of new transcript isoforms.

The chicken has long served as an important model organism in many fields, and continues to aid our understanding of animal development. Functional genomics studies aimed at probing the mechanisms that regulate development require high-quality genomes and transcript annotations. The quality of these resources has improved dramatically over the last several years, but many isoforms and genes have yet to be identified. We hope to contribute to the process of improving these resources with the data presented here: a set of long cDNA sequencing reads, and a curated set of new genes and transcript isoforms not currently represented in the most up-to-date genome annotation currently available to the community of researchers who rely on the chicken genome.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.