Menu
September 22, 2019  |  

Searching for convergent pathways in autism spectrum disorders: insights from human brain transcriptome studies.

Autism spectrum disorder (ASD) is one of the most heritable neuropsychiatric conditions. The complex genetic landscape of the disorder includes both common and rare variants at hundreds of genetic loci. This marked heterogeneity has thus far hampered efforts to develop genetic diagnostic panels and targeted pharmacological therapies. Here, we give an overview of the current literature on the genetic basis of ASD, and review recent human brain transcriptome studies and their role in identifying convergent pathways downstream of the heterogeneous genetic variants. We also discuss emerging evidence on the involvement of non-coding genomic regions and non-coding RNAs in ASD.


September 22, 2019  |  

TACO produces robust multisample transcriptome assemblies from RNA-seq.

Accurate transcript structure and abundance inference from RNA sequencing (RNA-seq) data is foundational for molecular discovery. Here we present TACO, a computational method to reconstruct a consensus transcriptome from multiple RNA-seq data sets. TACO employs novel change-point detection to demarcate transcript start and end sites, leading to improved reconstruction accuracy compared with other tools in its class. The tool is available at http://tacorna.github.io and can be readily incorporated into RNA-seq analysis workflows.


September 22, 2019  |  

The transcriptome of human pluripotent stem cells.

Human Embryonic Stem Cells (hESCs) are in vitro derivatives of the inner cell mass of the blastocyst and are characterized by an undifferentiated and pluripotent state that can be perpetuated in time, indefinitely. hESCs provide a unique opportunity to both dissect the molecular mechanisms that are predisposed to the maintenance of pluripotency and model the ability to initiate differentiation and cell commitment within the developing embryo. To fully understand these mechanisms, it is necessary to accurately identify the specific transcriptome of hESCs. Many distinct gene annotation methods, such as cDNA and EST sequencing and RNA-Seq, have been used to identify the transcriptome of hESCs. Lately, we developed a new tool (IDP) to integrate the hybrid sequencing data to characterize a more reliable and comprehensive hESC transcriptome with discoveries of many novel transcripts. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.


September 22, 2019  |  

Transcriptome profiling in the spathe of Anthurium andraeanum ‘Albama’ and its anthocyanin-loss mutant ‘Xueyu’.

Anthurium andraeanum is a popular tropical ornamental plant. Its spathes are brilliantly coloured due to variable anthocyanin contents. To examine the mechanisms that control anthocyanin biosynthesis, we sequenced the spathe transcriptomes of ‘Albama’, a red-spathed cultivar of A. andraeanum, and ‘Xueyu’, its anthocyanin-loss mutant. Both long reads and short reads were sequenced. Long read sequencing produced 805,869 raw reads, resulting in 83,073 high-quality transcripts. Short read sequencing produced 347.79?M reads, and the subsequent assembly resulted in 111,674 unigenes. High-quality transcripts and unigenes were quantified using the short reads, and differential expression analysis was performed between ‘Albama’ and ‘Xueyu’. Obtaining high-quality, full-length transcripts enabled the detection of long transcript structures and transcript variants. These data provide a foundation to elucidate the mechanisms regulating the biosynthesis of anthocyanin in A. andraeanum.


September 22, 2019  |  

Membrane attack complex-associated molecules from redlip mullet (Liza haematocheila): Molecular characterization and transcriptional evidence of C6, C7, C8ß, and C9 in innate immunity.

The redlip mullet (Liza haematocheila) is one of the most economically important fish in Korea and other East Asian countries; it is susceptible to infections by pathogens such as Lactococcus garvieae, Argulus spp., Trichodina spp., and Vibrio spp. Learning about the mechanisms of the complement system of the innate immunity of redlip mullet is important for efforts towards eradicating pathogens. Here, we report a comprehensive study of the terminal complement complex (TCC) components that form the membrane attack complex (MAC) through in-silico characterization and comparative spatial and temporal expression profiling. Five conserved domains (TSP1, LDLa, MACPF, CCP, and FIMAC) were detected in the TCC components, but the CCP and FIMAC domains were absent in MuC8ß and MuC9. Expression analysis of four TCC genes from healthy redlip mullets showed the highest expression levels in the liver, whereas limited expression was observed in other tissues; immune-induced expression in the head kidney and spleen revealed significant responses against Lactococcus garvieae and poly I:C injection, suggesting their involvement in MAC formation in response to harmful pathogenic infections. Furthermore, the response to poly I:C may suggest the role of TCC components in the breakdown of the membrane of enveloped viruses. These findings may help to elucidate the mechanisms behind the complement system of the teleosts innate immunity. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

Leveraging multiple transcriptome assembly methods for improved gene structure annotation.

The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand.Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms.We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.


September 22, 2019  |  

A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation.

Alternative splicing (AS) is a crucial regulatory mechanism in eukaryotes, which acts by greatly increasing transcriptome diversity. The extent and complexity of AS has been revealed in model plants using high-throughput next-generation sequencing. However, this technique is less effective in accurately identifying transcript isoforms in polyploid species because of the high sequence similarity between coexisting subgenomes. Here we characterize AS in the polyploid species cotton. Using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq), we developed an integrated pipeline for Iso-Seq transcriptome data analysis (https://github.com/Nextomics/pipeline-for-isoseq). We identified 176 849 full-length transcript isoforms from 44 968 gene models and updated gene annotation. These data led us to identify 15 102 fibre-specific AS events and estimate that c. 51.4% of homoeologous genes produce divergent isoforms in each subgenome. We reveal that AS allows differential regulation of the same gene by miRNAs at the isoform level. We also show that nucleosome occupancy and DNA methylation play a role in defining exons at the chromatin level. This study provides new insights into the complexity and regulation of AS, and will enhance our understanding of AS in polyploid species. Our methodology for Iso-Seq data analysis will be a useful reference for the study of AS in other species.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


September 22, 2019  |  

Transcriptional adaptations during long-term persistence of Staphylococcus aureus in the airways of a cystic fibrosis patient.

The lungs of Cystic fibrosis (CF) patients are often colonized and/or infected by Staphylococcus aureus for years, mostly by one predominant clone. For long-term survival in this environment, S. aureus needs to adapt during its interactions with host factors, antibiotics, and other pathogens. Here, we study long-term transcriptional as well as genomic adaptations of an isogenic pair of S. aureus isolates from a single patient using RNA sequencing (RNA-Seq) and whole genome sequencing (WGS). Mimicking in vivo conditions, we cultivated the S. aureus isolates using artificial sputum medium before harvesting RNA for subsequent analysis. We confirmed our RNA-Seq data using quantitative real-time (qRT)-PCR and additionally investigated intermediate isolates from the same patient representing in total 13.2 years of persistence in the CF airways. Comparative RNA-Seq analysis of the first and the last (“late”) isolate revealed significant differences in the late isolate after 13.2 years of persistence. Of the 2545 genes expressed in both isolates that were cultivated aerobically, 256 genes were up- and 161 were down-regulated with a minimum 2-fold change (2f). Focusing on 25 highly (=8f) up- (n=9) or down- (n=16) regulated genes, we identified several genes encoding for virulence factors involved in immune evasion, bacterial spread or secretion (e.g. spa, sak, and esxA). Moreover, these genes displayed similar expression trends under aerobic, microaerophilic and anaerobic conditions. Further qRT-PCR-experiments of highly up- or down-regulated genes within intermediate S. aureus isolates resulted in different gene expression patterns over the years. Using sequencing analysis of the differently expressed genes and their upstream regions in the late S. aureus isolate resulted in only few genomic alterations. Comparative transcriptomic analysis revealed adaptive changes affecting mainly genes involved in host-pathogen interaction. Although the underlying mechanisms were not known, our results suggest adaptive processes beyond genomic mutations triggered by local factors rather than by activation of global regulators. Copyright © 2014 The Authors. Published by Elsevier GmbH.. All rights reserved.


September 22, 2019  |  

Evaluating approaches to find exon chains based on long reads.

Transcript prediction can be modeled as a graph problem where exons are modeled as nodes and reads spanning two or more exons are modeled as exon chains. Pacific Biosciences third-generation sequencing technology produces significantly longer reads than earlier second-generation sequencing technologies, which gives valuable information about longer exon chains in a graph. However, with the high error rates of third-generation sequencing, aligning long reads correctly around the splice sites is a challenging task. Incorrect alignments lead to spurious nodes and arcs in the graph, which in turn lead to incorrect transcript predictions. We survey several approaches to find the exon chains corresponding to long reads in a splicing graph, and experimentally study the performance of these methods using simulated data to allow for sensitivity/precision analysis. Our experiments show that short reads from second-generation sequencing can be used to significantly improve exon chain correctness either by error-correcting the long reads before splicing graph creation, or by using them to create a splicing graph on which the long-read alignments are then projected. We also study the memory and time consumption of various modules, and show that accurate exon chains lead to significantly increased transcript prediction accuracy.The simulated data and in-house scripts used for this article are available at http://www.cs.helsinki.fi/group/gsa/exon-chains/exon-chains-bib.tar.bz2.


September 22, 2019  |  

Comparison of the mitochondrial genomes and steady state transcriptomes of two strains of the trypanosomatid parasite, Leishmania tarentolae.

U-insertion/deletion RNA editing is a post-transcriptional mitochondrial RNA modification phenomenon required for viability of trypanosomatid parasites. Small guide RNAs encoded mainly by the thousands of catenated minicircles contain the information for this editing. We analyzed by NGS technology the mitochondrial genomes and transcriptomes of two strains, the old lab UC strain and the recently isolated LEM125 strain. PacBio sequencing provided complete minicircle sequences which avoided the assembly problem of short reads caused by the conserved regions. Minicircles were identified by a characteristic size, the presence of three short conserved sequences, a region of inherently bent DNA and the presence of single gRNA genes at a fairly defined location. The LEM125 strain contained over 114 minicircles encoding different gRNAs and the UC strain only ~24 minicircles. Some LEM125 minicircles contained no identifiable gRNAs. Approximate copy numbers of the different minicircle classes in the network were determined by the number of PacBio CCS reads that assembled to each class. Mitochondrial RNA libraries from both strains were mapped against the minicircle and maxicircle sequences. Small RNA reads mapped to the putative gRNA genes but also to multiple regions outside the genes on both strands and large RNA reads mapped in many cases over almost the entire minicircle on both strands. These data suggest that minicircle transcription is complete and bidirectional, with 3′ processing yielding the mature gRNAs. Steady state RNAs in varying abundances are derived from all maxicircle genes, including portions of the repetitive divergent region. The relative extents of editing in both strains correlated with the presence of a cascade of cognate gRNAs. These data should provide the foundation for a deeper understanding of this dynamic genetic system as well as the evolutionary variation of editing in different strains.


September 22, 2019  |  

Two phospholipid scramblase 1-related proteins (PLSCR1like-a & -b) from Liza haematocheila: Molecular and transcriptional features and expression analysis after immune stimulation.

Phospholipid scramblases (PLSCRs) are a family of transmembrane proteins known to be responsible for Ca2+-mediated bidirectional phospholipid translocation in the plasma membrane. Apart from the scrambling activity of PLSCRs, recent studies revealed their diverse other roles, including antiviral defense, tumorigenesis, protein-DNA interactions, apoptosis regulation, and cell activation. Nonetheless, the biological and transcriptional functions of PLSCRs in fish have not been discovered to date. Therefore, in this study, two new members related to the PLSCR1 family were identified in the red lip mullet (Liza haematocheila) as MuPLSCR1like-a and MuPLSCR1like-b, and their characteristics were studied at molecular and transcriptional levels. Sequence analysis revealed that MuPLSCR1like-a and MuPLSCR1like-b are composed of 245 and 228 amino acid residues (aa) with the predicted molecular weights of 27.82 and 25.74?kDa, respectively. A constructed phylogenetic tree showed that MuPLSCR1like-a and MuPLSCR1like-b are clustered together with other known PLSCR1 and -2 orthologues, thus pointing to the relatedness to both PLSCR1 and PLSCR2 families. Two-dimensional (2D) and 3D graphical representations illustrated the well-known 12-stranded ß-barrel structure of MuPLSCR1like-a and MuPLSCR1like-b with transmembrane orientation toward the phospholipid bilayer. In analysis of tissue-specific expression, the highest expression of MuPLSCR1like-a was observed in the intestine, whereas MuPLSCR1like-b was highly expressed in the brain, indicating isoform specificity. Of note, we found that the transcription of MuPLSCR1like-a and MuPLSCR1like-b was significantly upregulated when the fish were stimulated with poly(I:C), suggesting that such immune responses target viral infections. Overall, this study provides the first experimental insight into the characteristics and immune-system relevance of PLSCR1-related genes in red lip mullets. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

HIV-1 infection of primary CD4(+) T cells regulates the expression of specific HERV-K (HML-2) elements.

Endogenous retroviruses (ERVs) occupy extensive regions of the human genome. Although many of these retroviral elements have lost their ability to replicate, those whose insertion took place more recently, such as the HML-2 group of HERV-K elements, still retain intact open reading frames and the capacity to produce certain viral RNA and/or proteins. Transcription of these ERVs is, however, tightly regulated by dedicated epigenetic control mechanisms. Nonetheless, it has been reported that some pathologic states, such as viral infections and certain cancers, coincide with ERV expression suggesting transcriptional reawakening is possible. HML-2 elements are reportedly induced during HIV-1 infection, but the conserved nature of these elements has, until recently, rendered their expression profiling problematic.Here, we provide comprehensive HERV-K HML-2 expression profiles specific for productively HIV-1 infected primary human CD4(+) T cells. We combined enrichment of HIV-1 infected cells using a reporter virus expressing a surface reporter for gentle and efficient purification with long-read Single Molecule Real-Time sequencing. We show that three HML-2 proviruses, 6q25.1, 8q24.3, and 19q13.42 are up-regulated on average between 3- and 5-fold in HIV-1 infected CD4(+) T cells. One provirus, HML-2 12q24.33, in contrast, was repressed in the presence of active HIV replication.In conclusion, this report identifies the HERV-K HML-2 loci whose expression profiles differ upon HIV-1 infection in primary human CD4(+) T cells. These data will help pave the way for further studies on the influence of endogenous retroviruses on HIV-1 replication.Importance Endogenous retroviruses inhabit big portions of our genome. And although they are mainly inert some of the evolutionarily younger members maintain the ability to express both RNA as well as proteins. We have developed an approach using long-read SMRT sequencing that produces long reads, that provides us with ability to obtain detailed and accurate HERV-K HML-2 expression profiles. We have now applied this approach to study HERV-K expression in the presence and absence of productive HIV-1 infection of primary human CD4(+) T cells. In addition to using SMRT sequencing, our strategy also includes the magnetic selection of the infected cells so that levels of background expression due to uninfected cells are kept at a minimum. The results in this manuscript provide the blueprint for in-depth studies of the interactions of the authentic upregulated HERV-K HML-2 elements and HIV-1. Copyright © 2017 American Society for Microbiology.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.