Menu
September 22, 2019

Comprehensive genomic analysis of malignant pleural mesothelioma identifies recurrent mutations, gene fusions and splicing alterations.

We analyzed transcriptomes (n = 211), whole exomes (n = 99) and targeted exomes (n = 103) from 216 malignant pleural mesothelioma (MPM) tumors. Using RNA-seq data, we identified four distinct molecular subtypes: sarcomatoid, epithelioid, biphasic-epithelioid (biphasic-E) and biphasic-sarcomatoid (biphasic-S). Through exome analysis, we found BAP1, NF2, TP53, SETD2, DDX3X, ULK2, RYR2, CFAP45, SETDB1 and DDX51 to be significantly mutated (q-score = 0.8) in MPMs. We identified recurrent mutations in several genes, including SF3B1 (~2%; 4/216) and TRAF7 (~2%; 5/216). SF3B1-mutant samples showed a splicing profile distinct from that of wild-type tumors. TRAF7 alterations occurred primarily in the WD40 domain and were, except in one case, mutually exclusive with NF2 alterations. We found recurrent gene fusions and splice alterations to be frequent mechanisms for inactivation of NF2, BAP1 and SETD2. Through integrated analyses, we identified alterations in Hippo, mTOR, histone methylation, RNA helicase and p53 signaling pathways in MPMs.


September 22, 2019

Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity.

Circular RNAs (circRNAs) have re-emerged as an interesting RNA species. Using deep RNA profiling in different mouse tissues, we observed that circRNAs were substantially enriched in brain and a disproportionate fraction of them were derived from host genes that encode synaptic proteins. Moreover, on the basis of separate profiling of the RNAs localized in neuronal cell bodies and neuropil, circRNAs were, on average, more enriched in the neuropil than their host gene mRNA isoforms. Using high-resolution in situ hybridization, we visualized circRNA punctae in the dendrites of neurons. Consistent with the idea that circRNAs might regulate synaptic function during development, many circRNAs changed their abundance abruptly at a time corresponding to synaptogenesis. In addition, following a homeostatic downscaling of neuronal activity many circRNAs exhibited substantial up- or downregulation. Together, our data indicate that brain circRNAs are positioned to respond to and regulate synaptic function.


September 22, 2019

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.

We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.


September 22, 2019

Shorter unreported sequences in a RACE-Seq study involving seven tissues confirms ~150 novel transcripts identified in MCF-7 cell line PacBio transcriptome, leaving ~100 non-redundant transcripts exclusive to the cancer cell line.

PacBio sequencing generates much longer reads compared to second-generation sequencing technologies, with a trade-off of lower throughput, higher error rate and more cost per base. The PacBio transcriptome of the breast cancer cell line MCF-7 was found to have ~300 transcripts un-annotated in the current GENCODE (v25) or RefSeq, and missing in the liver, heart and brain PacBio transcriptomes [1]. RACE-sequencing (RACE-seq [2]) extends a well-established method of characterizing cDNA molecules generated by rapid amplification of cDNA ends (RACE [3]) using high-throughput sequencing technologies, reducing costs compared to PacBio. Here, shorter fragments of ~150 transcripts were found to be present in seven tissues analyzed in a recent RACE-seq study (Accid:ERP012249) [4]. These transcripts were not among the ~2500 novel transcripts reported in that study, tested separately here using the genomic coordinates provided, although “all curated novel isoforms were incorporated into the human GENCODE set (v22)” in that study. Non-redundancy analysis of the exclusive transcripts identified one transcript mapping to Chr1 with seven different splice variants, and erroneously mapped to Chr15 (PAC clone 15q11-q13) from the Prader-Willi/Angelman Syndrome region (Accid:AC004137.1). Finally, there are ~100 non-redundant transcripts missing in the seven tissues, in addition to other three tissues analyzed previously. Their absence in GENCODE and RefSeq databases rule them out as commonly transcribed regions, further increasing their likelihood as biomarkers.


September 22, 2019

Genome and evolution of the shade-requiring medicinal herb Panax ginseng.

Panax ginseng C. A. Meyer, reputed as the king of medicinal herbs, has slow growth, long generation time, low seed production and complicated genome structure that hamper its study. Here, we unveil the genomic architecture of tetraploid P. ginseng by de novo genome assembly, representing 2.98 Gbp with 59 352 annotated genes. Resequencing data indicated that diploid Panax species diverged in association with global warming in Southern Asia, and two North American species evolved via two intercontinental migrations. Two whole genome duplications (WGD) occurred in the family Araliaceae (including Panax) after divergence with the Apiaceae, the more recent one contributing to the ability of P. ginseng to overwinter, enabling it to spread broadly through the Northern Hemisphere. Functional and evolutionary analyses suggest that production of pharmacologically important dammarane-type ginsenosides originated in Panax and are produced largely in shoot tissues and transported to roots; that newly evolved P. ginseng fatty acid desaturases increase freezing tolerance; and that unprecedented retention of chlorophyll a/b binding protein genes enables efficient photosynthesis under low light. A genome-scale metabolic network provides a holistic view of Panax ginsenoside biosynthesis. This study provides valuable resources for improving medicinal values of ginseng either through genomics-assisted breeding or metabolic engineering.© 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


September 22, 2019

Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing.

The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing.In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells.Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.


September 22, 2019

High-resolution comparative analysis of great ape genomes.

Genetic studies of human evolution require high-quality contiguous ape genome assemblies that are not guided by the human reference. We coupled long-read sequence assembly and full-length complementary DNA sequencing with a multiplatform scaffolding approach to produce ab initio chimpanzee and orangutan genome assemblies. By comparing these with two long-read de novo human genome assemblies and a gorilla genome assembly, we characterized lineage-specific and shared great ape genetic variation ranging from single- to mega-base pair-sized variants. We identified ~17,000 fixed human-specific structural variants identifying genic and putative regulatory changes that have emerged in humans since divergence from nonhuman apes. Interestingly, these variants are enriched near genes that are down-regulated in human compared to chimpanzee cerebral organoids, particularly in cells analogous to radial glial neural progenitors. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


September 22, 2019

A single-molecule long-read survey of the human transcriptome.

Global RNA studies have become central to understanding biological processes, but methods such as microarrays and short-read sequencing are unable to describe an entire RNA molecule from 5′ to 3′ end. Here we use single-molecule long-read sequencing technology from Pacific Biosciences to sequence the polyadenylated RNA complement of a pooled set of 20 human organs and tissues without the need for fragmentation or amplification. We show that full-length RNA molecules of up to 1.5 kb can readily be monitored with little sequence loss at the 5′ ends. For longer RNA molecules more 5′ nucleotides are missing, but complete intron structures are often preserved. In total, we identify ~14,000 spliced GENCODE genes. High-confidence mappings are consistent with GENCODE annotations, but >10% of the alignments represent intron structures that were not previously annotated. As a group, transcripts mapping to unannotated regions have features of long, noncoding RNAs. Our results show the feasibility of deep sequencing full-length RNA from complex eukaryotic transcriptomes on a single-molecule level.


September 22, 2019

A survey of transcriptome complexity in Sus scrofa using single-molecule long-read sequencing.

Alternative splicing (AS) and fusion transcripts produce a vast expansion of transcriptomes and proteomes diversity. However, the reliability of these events and the extend of epigenetic mechanisms have not been adequately addressed due to its limitation of uncertainties about the complete structure of mRNA. Here we combined single-molecule real-time sequencing, Illumina RNA-seq and DNA methylation data to characterize the landscapes of DNA methylation on AS, fusion isoforms formation and lncRNA feature and further to unveil the transcriptome complexity of pig. Our analysis identified an unprecedented scale of high-quality full-length isoforms with over 28,127 novel isoforms from 26,881 novel genes. More than 92,000 novel AS events were detected and intron retention predominated in AS model, followed by exon skipping. Interestingly, we found that DNA methylation played an important role in generating various AS isoforms by regulating splicing sites, promoter regions and first exons. Furthermore, we identified a large of fusion transcripts and novel lncRNAs, and found that DNA methylation of the promoter and gene body could regulate lncRNA expression. Our results significantly improved existed gene models of pig and unveiled that pig AS and epigenetic modify were more complex than previously thought.


September 22, 2019

Transcriptional diversity during lineage commitment of human blood progenitors.

Blood cells derive from hematopoietic stem cells through stepwise fating events. To characterize gene expression programs driving lineage choice, we sequenced RNA from eight primary human hematopoietic progenitor populations representing the major myeloid commitment stages and the main lymphoid stage. We identified extensive cell type-specific expression changes: 6711 genes and 10,724 transcripts, enriched in non-protein-coding elements at early stages of differentiation. In addition, we found 7881 novel splice junctions and 2301 differentially used alternative splicing events, enriched in genes involved in regulatory processes. We demonstrated experimentally cell-specific isoform usage, identifying nuclear factor I/B (NFIB) as a regulator of megakaryocyte maturation-the platelet precursor. Our data highlight the complexity of fating events in closely related progenitor populations, the understanding of which is essential for the advancement of transplantation and regenerative medicine. Copyright © 2014, American Association for the Advancement of Science.


September 22, 2019

Characterization of the human ESC transcriptome by hybrid sequencing.

Although transcriptional and posttranscriptional events are detected in RNA-Seq data from second-generation sequencing, full-length mRNA isoforms are not captured. On the other hand, third-generation sequencing, which yields much longer reads, has current limitations of lower raw accuracy and throughput. Here, we combine second-generation sequencing and third-generation sequencing with a custom-designed method for isoform identification and quantification to generate a high-confidence isoform dataset for human embryonic stem cells (hESCs). We report 8,084 RefSeq-annotated isoforms detected as full-length and an additional 5,459 isoforms predicted through statistical inference. Over one-third of these are novel isoforms, including 273 RNAs from gene loci that have not previously been identified. Further characterization of the novel loci indicates that a subset is expressed in pluripotent cells but not in diverse fetal and adult tissues; moreover, their reduced expression perturbs the network of pluripotency-associated genes. Results suggest that gene identification, even in well-characterized human cell lines and tissues, is likely far from complete.


September 22, 2019

Bypassing the Restriction System To Improve Transformation of Staphylococcus epidermidis.

Staphylococcus epidermidis is the leading cause of infections on indwelling medical devices worldwide. Intrinsic antibiotic resistance and vigorous biofilm production have rendered these infections difficult to treat and, in some cases, require the removal of the offending medical prosthesis. With the exception of two widely passaged isolates, RP62A and 1457, the pathogenesis of infections caused by clinical S. epidermidis strains is poorly understood due to the strong genetic barrier that precludes the efficient transformation of foreign DNA into clinical isolates. The difficulty in transforming clinical S. epidermidis isolates is primarily due to the type I and IV restriction-modification systems, which act as genetic barriers. Here, we show that efficient plasmid transformation of clinical S. epidermidis isolates from clonal complexes 2, 10, and 89 can be realized by employing a plasmid artificial modification (PAM) in Escherichia coli DC10B containing a ?dcm mutation. This transformative technique should facilitate our ability to genetically modify clinical isolates of S. epidermidis and hence improve our understanding of their pathogenesis in human infections.IMPORTANCEStaphylococcus epidermidis is a source of considerable morbidity worldwide. The underlying mechanisms contributing to the commensal and pathogenic lifestyles of S. epidermidis are poorly understood. Genetic manipulations of clinically relevant strains of S. epidermidis are largely prohibited due to the presence of a strong restriction barrier. With the introductions of the tools presented here, genetic manipulation of clinically relevant S. epidermidis isolates has now become possible, thus improving our understanding of S. epidermidis as a pathogen. Copyright © 2017 American Society for Microbiology.


September 22, 2019

Indoleacrylic acid produced by commensal Peptostreptococcus species suppresses inflammation.

Host factors in the intestine help select for bacteria that promote health. Certain commensals can utilize mucins as an energy source, thus promoting their colonization. However, health conditions such as inflammatory bowel disease (IBD) are associated with a reduced mucus layer, potentially leading to dysbiosis associated with this disease. We characterize the capability of commensal species to cleave and transport mucin-associated monosaccharides and identify several Clostridiales members that utilize intestinal mucins. One such mucin utilizer, Peptostreptococcus russellii, reduces susceptibility to epithelial injury in mice. Several Peptostreptococcus species contain a gene cluster enabling production of the tryptophan metabolite indoleacrylic acid (IA), which promotes intestinal epithelial barrier function and mitigates inflammatory responses. Furthermore, metagenomic analysis of human stool samples reveals that the genetic capability of microbes to utilize mucins and metabolize tryptophan is diminished in IBD patients. Our data suggest that stimulating IA production could promote anti-inflammatory responses and have therapeutic benefits. Copyright © 2017 Elsevier Inc. All rights reserved.


September 22, 2019

Altered expression of the FMR1 splicing variants landscape in premutation carriers.

FMR1 premutation carriers (55-200 CGG repeats) are at risk for developing Fragile X-associated Tremor/Ataxia Syndrome (FXTAS), an adult onset neurodegenerative disorder. Approximately 20% of female carriers will develop Fragile X-associated Primary Ovarian Insufficiency (FXPOI), in addition to a number of clinical problems affecting premutation carriers throughout their life span. Marked elevation in FMR1 mRNA levels have been observed with premutation alleles resulting in RNA toxicity, the leading molecular mechanism proposed for the FMR1 associated disorders observed in premutation carriers. The FMR1 gene undergoes alternative splicing and we have recently reported that the relative abundance of all FMR1 mRNA isoforms is significantly increased in premutation carriers. In this study, we characterized the transcriptional FMR1 isoforms distribution pattern in different tissues and identified a total of 49 isoforms, some of which observed only in premutation carriers and which might play a role in the pathogenesis of FXTAS. Further, we investigated the distribution pattern and expression levels of the FMR1 isoforms in asymptomatic premutation carriers and in those with FXTAS and found no significant differences between the two groups. Our findings suggest that the characterization of the expression levels of the different FMR1 isoforms is fundamental for understanding the regulation of the FMR1 gene as imbalance in their expression could lead to an altered functional diversity with neurotoxic consequences. Their characterization will also help to elucidating the mechanism(s) by which “toxic gain of function” of the FMR1 mRNA may play a role in FXTAS and/or in the other FMR1-associated conditions. Copyright © 2017. Published by Elsevier B.V.


September 22, 2019

Lactobacillus fermentum FTDC 8312 combats hypercholesterolemia via alteration of gut microbiota.

In this study, hypercholesterolemic mice fed with Lactobacillus fermentum FTDC 8312 after a seven-week feeding trial showed a reduction in serum total cholesterol (TC) levels, accompanied by a decrease in serum low-density lipoprotein cholesterol (LDL-C) levels, an increase in serum high-density lipoprotein cholesterol (HDL-C) levels, and a decreased ratio of apoB100:apoA1 when compared to those fed with control or a type strain, L. fermentum JCM 1173. These have contributed to a decrease in atherogenic indices (TC/HDL-C) of mice on the FTDC 8312 diet. Serum triglyceride (TG) levels of mice fed with FTDC 8312 and JCM 1173 were comparable to those of the controls. A decreased ratio of cholesterol and phospholipids (C/P) was also observed for mice fed with FTDC 8312, leading to a decreased number of spur red blood cells (RBC) formation in mice. Additionally, there was an increase in fecal TC, TG, and total bile acid levels in mice on FTDC 8312 diet compared to those with JCM 1173 and controls. The administration of FTDC 8312 also altered the gut microbiota population such as an increase in the members of genera Akkermansia and Oscillospira, affecting lipid metabolism and fecal bile excretion in the mice. Overall, we demonstrated that FTDC 8312 exerted a cholesterol lowering effect that may be attributed to gut microbiota modulation. Copyright © 2017 Elsevier B.V. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.