Menu
September 22, 2019

Novel full-length major histocompatibility complex class I allele discovery and haplotype definition in pig-tailed macaques.

Pig-tailed macaques (Macaca nemestrina, Mane) are important models for human immunodeficiency virus (HIV) studies. Their infectability with minimally modified HIV makes them a uniquely valuable animal model to mimic human infection with HIV and progression to acquired immunodeficiency syndrome (AIDS). However, variation in the pig-tailed macaque major histocompatibility complex (MHC) and the impact of individual transcripts on the pathogenesis of HIV and other infectious diseases is understudied compared to that of rhesus and cynomolgus macaques. In this study, we used Pacific Biosciences single-molecule real-time circular consensus sequencing to describe full-length MHC class I (MHC-I) transcripts for 194 pig-tailed macaques from three breeding centers. We then used the full-length sequences to infer Mane-A and Mane-B haplotypes containing groups of MHC-I transcripts that co-segregate due to physical linkage. In total, we characterized full-length open reading frames (ORFs) for 313 Mane-A, Mane-B, and Mane-I sequences that defined 86 Mane-A and 106 Mane-B MHC-I haplotypes. Pacific Biosciences technology allows us to resolve these Mane-A and Mane-B haplotypes to the level of synonymous allelic variants. The newly defined haplotypes and transcript sequences containing full-length ORFs provide an important resource for infectious disease researchers as certain MHC haplotypes have been shown to provide exceptional control of simian immunodeficiency virus (SIV) replication and prevention of AIDS-like disease in nonhuman primates. The increased allelic resolution provided by Pacific Biosciences sequencing also benefits transplant research by allowing researchers to more specifically match haplotypes between donors and recipients to the level of nonsynonymous allelic variation, thus reducing the risk of graft-versus-host disease.


September 22, 2019

The genomic and functional landscapes of developmental plasticity in the American cockroach.

Many cockroach species have adapted to urban environments, and some have been serious pests of public health in the tropics and subtropics. Here, we present the 3.38-Gb genome and a consensus gene set of the American cockroach, Periplaneta americana. We report insights from both genomic and functional investigations into the underlying basis of its adaptation to urban environments and developmental plasticity. In comparison with other insects, expansions of gene families in P. americana exist for most core gene families likely associated with environmental adaptation, such as chemoreception and detoxification. Multiple pathways regulating metamorphic development are well conserved, and RNAi experiments inform on key roles of 20-hydroxyecdysone, juvenile hormone, insulin, and decapentaplegic signals in regulating plasticity. Our analyses reveal a high level of sequence identity in genes between the American cockroach and two termite species, advancing it as a valuable model to study the evolutionary relationships between cockroaches and termites.


September 22, 2019

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.


September 22, 2019

Metasecretome phage display.

Metasecretome is a collection of cell-surface and secreted proteins that mediate interactions between microbial communities and their environment. These include adhesins, enzymes, surface structures such as pili or flagella, vaccine targets or proteins responsible for immune evasion. Traditional approaches to exploring matasecretome of complex microbial communities via cultivation of microorganisms and screening of individual strains fail to sample extraordinary diversity in these communities, since only a limited fraction of microorganisms are represented by cultures. Advances in culture-independent sequence analysis methods, collectively referred to as metagenomics, offer an alternative approach that enables the direct analysis of collective microbial genomes (metagenome) recovered from environmental samples. This protocol describes a method, metasecretome phage display, which selectively displays the metasecretome portion of the metagenome. The metasecretome library can then be used for two purposes: (1) to sequence the entire metasecretome (using PacBio technology); (2) to identify metasecretome proteins that have a specific function of interest by affinity-screening (bio-panning) using a variety of methods described in other chapters of this volume.


September 22, 2019

Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken.

The domestic chicken (Gallus gallus) is widely used as a model in developmental biology and is also an important livestock species. We describe a novel approach to data integration to generate an mRNA expression atlas for the chicken spanning major tissue types and developmental stages, using a diverse range of publicly-archived RNA-seq datasets and new data derived from immune cells and tissues.Randomly down-sampling RNA-seq datasets to a common depth and quantifying expression against a reference transcriptome using the mRNA quantitation tool Kallisto ensured that disparate datasets explored comparable transcriptomic space. The network analysis tool Graphia was used to extract clusters of co-expressed genes from the resulting expression atlas, many of which were tissue or cell-type restricted, contained transcription factors that have previously been implicated in their regulation, or were otherwise associated with biological processes, such as the cell cycle. The atlas provides a resource for the functional annotation of genes that currently have only a locus ID. We cross-referenced the RNA-seq atlas to a publicly available embryonic Cap Analysis of Gene Expression (CAGE) dataset to infer the developmental time course of organ systems, and to identify a signature of the expansion of tissue macrophage populations during development.Expression profiles obtained from public RNA-seq datasets – despite being generated by different laboratories using different methodologies – can be made comparable to each other. This meta-analytic approach to RNA-seq can be extended with new datasets from novel tissues, and is applicable to any species.


September 22, 2019

Novel molecules lncRNAs, tRFs and circRNAs deciphered from next-generation sequencing/RNA sequencing: computational databases and tools.

Powerful next-generation sequencing (NGS) technologies, more specifically RNA sequencing (RNA-seq), have been pivotal toward the detection and analysis and hypotheses generation of novel biomolecules, long noncoding RNAs (lncRNAs), tRNA-derived fragments (tRFs) and circular RNAs (circRNAs). Experimental validation of the occurrence of these biomolecules inside the cell has been reported. Their differential expression and functionally important role in several cancers types as well as other diseases such as Alzheimer’s and cardiovascular diseases have garnered interest toward further studies in this research arena. In this review, starting from a brief relevant introduction to NGS and RNA-seq and the expression and role of lncRNAs, tRFs and circRNAs in cancer, we have comprehensively analyzed the current landscape of databases developed and computational software used for analyses and visualization for this emerging and highly interesting field of these novel biomolecules. Our review will help the end users and research investigators gain information on the existing databases and tools as well as an understanding of the specific features which these offer. This will be useful for the researchers in their proper usage thereby guiding them toward novel hypotheses generation and saving time and costs involved in extensive experimental processes in these three different novel functional RNAs.© The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.


September 22, 2019

Multiplatform next-generation sequencing identifies novel RNA molecules and transcript isoforms of the endogenous retrovirus isolated from cultured cells.

In this study, we applied short- and long-read RNA sequencing techniques, as well as PCR analysis to investigate the transcriptome of the porcine endogenous retrovirus (PERV) expressed from cultured porcine kidney cell line PK-15. This analysis has revealed six novel transcripts and eight transcript isoforms, including five length and three splice variants. We were able to establish whether a deletion in a transcript is the result of the splicing of mRNAs or of genomic deletion in one of the PERV clones. Additionally, we re-annotated the formerly identified RNA molecules. Our analysis revealed a higher complexity of PERV transcriptome than it was earlier believed.© FEMS 2018. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.


September 22, 2019

PHACTR1 splicing isoforms and eQTLs in atherosclerosis-relevant human cells.

Genome-wide association studies (GWAS) have identified a variant (rs9349379) at the phosphatase and actin regulator 1 (PHACTR1) locus that is associated with coronary artery disease (CAD). The same variant is also an expression quantitative trait locus (eQTL) for PHACTR1 in human coronary arteries (hCA). Here, we sought to characterize PHACTR1 splicing pattern in atherosclerosis-relevant human cells. We also explored how rs9349379 modulates the expression of the different PHACTR1 splicing isoforms.We combined rapid amplification of cDNA ends (RACE) with next-generation long-read DNA sequencing to discover all PHACTR1 transcripts in many human tissues and cell types. We measured PHACTR1 transcripts by qPCR to identify transcript-specific eQTLs.We confirmed a brain-specific long transcript, a short transcript expressed in monocytes and four intermediate transcripts that are different due to alternative splicing of two in-frame exons. In contrast to a previous report, we confirmed that the PHACTR1 protein is present in vascular smooth muscle cells. In 158 hCA from our collection and the GTEx dataset, rs9349379 was only associated with the expression levels of the intermediate PHACTR1 transcripts.Our comprehensive transcriptomic profiling of PHACTR1 indicates that this gene encodes six main transcripts. Five of them are expressed in hCA, where atherosclerotic plaques develop. In this tissue, genotypes at rs9349379 are associated with the expression of the intermediate transcripts, but not the immune-specific short transcript. This result suggests that rs9349379 may in part influence CAD by modulating the expression of intermediate PHACTR1 transcripts in endothelial or vascular smooth muscle cells found in hCA.


September 22, 2019

A survey of the sorghum transcriptome using single-molecule long reads.

Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ~11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.


September 22, 2019

Functional mitochondria in health and disease.

The ability to rapidly adapt cellular bioenergetic capabilities to meet rapidly changing environmental conditions is mandatory for normal cellular function and for cancer progression. Any loss of this adaptive response has the potential to compromise cellular function and render the cell more susceptible to external stressors such as oxidative stress, radiation, chemotherapeutic drugs, and hypoxia. Mitochondria play a vital role in bioenergetic and biosynthetic pathways and can rapidly adjust to meet the metabolic needs of the cell. Increased demand is met by mitochondrial biogenesis and fusion of individual mitochondria into dynamic networks, whereas a decrease in demand results in the removal of superfluous mitochondria through fission and mitophagy. Effective communication between nucleus and mitochondria (mito-nuclear cross talk), involving the generation of different mitochondrial stress signals as well as the nuclear stress response pathways to deal with these stressors, maintains bioenergetic homeostasis under most conditions. However, when mitochondrial DNA (mtDNA) mutations accumulate and mito-nuclear cross talk falters, mitochondria fail to deliver critical functional outputs. Mutations in mtDNA have been implicated in neuromuscular and neurodegenerative mitochondriopathies and complex diseases such as diabetes, cardiovascular diseases, gastrointestinal disorders, skin disorders, aging, and cancer. In some cases, drastic measures such as acquisition of new mitochondria from donor cells occurs to ensure cell survival. This review starts with a brief discussion of the evolutionary origin of mitochondria and summarizes how mutations in mtDNA lead to mitochondriopathies and other degenerative diseases. Mito-nuclear cross talk, including various stress signals generated by mitochondria and corresponding stress response pathways activated by the nucleus are summarized. We also introduce and discuss a small family of recently discovered hormone-like mitopeptides that modulate body metabolism. Under conditions of severe mitochondrial stress, mitochondria have been shown to traffic between cells, replacing mitochondria in cells with damaged and malfunctional mtDNA. Understanding the processes involved in cellular bioenergetics and metabolic adaptation has the potential to generate new knowledge that will lead to improved treatment of many of the metabolic, degenerative, and age-related inflammatory diseases that characterize modern societies.


September 22, 2019

Cell-type-specific splicing of Piezo2 regulates mechanotransduction.

Piezo2 is a mechanically activated ion channel required for touch discrimination, vibration detection, and proprioception. Here, we discovered that Piezo2 is extensively spliced, producing different Piezo2 isoforms with distinct properties. Sensory neurons from both mice and humans express a large repertoire of Piezo2 variants, whereas non-neuronal tissues express predominantly a single isoform. Notably, even within sensory ganglia, we demonstrate the splicing of Piezo2 to be cell type specific. Biophysical characterization revealed substantial differences in ion permeability, sensitivity to calcium modulation, and inactivation kinetics among Piezo2 splice variants. Together, our results describe, at the molecular level, a potential mechanism by which transduction is tuned, permitting the detection of a variety of mechanosensory stimuli. Published by Elsevier Inc.


September 22, 2019

Isoform evolution in primates through independent combination of alternative RNA processing events.

Recent RNA-seq technology revealed thousands of splicing events that are under rapid evolution in primates, whereas the reliability of these events, as well as their combination on the isoform level, have not been adequately addressed due to its limited sequencing length. Here, we performed comparative transcriptome analyses in human and rhesus macaque cerebellum using single molecule long-read sequencing (Iso-seq) and matched RNA-seq. Besides 359 million RNA-seq reads, 4,165,527 Iso-seq reads were generated with a mean length of 14,875?bp, covering 11,466 human genes, and 10,159 macaque genes. With Iso-seq data, we substantially expanded the repertoire of alternative RNA processing events in primates, and found that intron retention and alternative polyadenylation are surprisingly more prevalent in primates than previously estimated. We then investigated the combinatorial mode of these alternative events at the whole-transcript level, and found that the combination of these events is largely independent along the transcript, leading to thousands of novel isoforms missed by current annotations. Notably, these novel isoforms are selectively constrained in general, and 1,119 isoforms have even higher expression than the previously annotated major isoforms in human, indicating that the complexity of the human transcriptome is still significantly underestimated. Comparative transcriptome analysis further revealed 502 genes encoding selectively constrained, lineage-specific isoforms in human but not in rhesus macaque, linking them to some lineage-specific functions. Overall, we propose that the independent combination of alternative RNA processing events has contributed to complex isoform evolution in primates, which provides a new foundation for the study of phenotypic difference among primates.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


September 22, 2019

Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi.

Single molecule real-time (SMRT) sequencing has recently been used to obtain full-length cDNA sequences that improve genome annotation and reveal RNA isoforms. Here, we used one such method called isoform sequencing from Pacific Biosciences (PacBio) to sequence a cDNA library from the Asian malaria mosquito Anopheles stephensi. More than 600 000 full-length cDNAs, referred to as reads of insert, were identified. Owing to the inherently high error rate of PacBio sequencing, we tested different approaches for error correction. We found that error correction using Illumina RNA sequencing (RNA-seq) generated more data than using the default SMRT pipeline. The full-length error-corrected PacBio reads greatly improved the gene annotation of Anopheles stephensi: 4867 gene models were updated and 1785 alternatively spliced isoforms were added to the annotation. In addition, six trans-splicing events, where exons from different primary transcripts were joined together, were identified in An. stephensi. All six trans-splicing events appear to be conserved in Culicidae, as they are also found in Anopheles gambiae and Aedes aegypti. The proteins encoded by trans-splicing events are also highly conserved and the orthologues of these proteins are cis-spliced in outgroup species, indicating that trans-splicing may arise as a mechanism to rescue genes that broke up during evolution.© 2017 The Royal Entomological Society.


September 22, 2019

A human-specific switch of alternatively spliced AFMID isoforms contributes to TP53 mutations and tumor recurrence in hepatocellular carcinoma.

Pre-mRNA splicing can contribute to the switch of cell identity that occurs in carcinogenesis. Here, we analyze a large collection of RNA-seq data sets and report that splicing changes in hepatocyte-specific enzymes, such as AFMID and KHK, are associated with HCC patients’ survival and relapse. The switch of AFMID isoforms is an early event in HCC development and is associated with driver mutations in TP53 and ARID1A The switch of AFMID isoforms is human-specific and not detectable in other species, including primates. Finally, we show that overexpression of the full-length AFMID isoform leads to a higher NAD+ level, lower DNA-damage response, and slower cell growth in HepG2 cells. The integrative analysis uncovered a mechanistic link between splicing switches, de novo NAD+ biosynthesis, driver mutations, and HCC recurrence.© 2018 Lin et al.; Published by Cold Spring Harbor Laboratory Press.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.