Menu
September 22, 2019  |  

The state of long non-coding RNA biology.

Transcriptomic studies have demonstrated that the vast majority of the genomes of mammals and other complex organisms is expressed in highly dynamic and cell-specific patterns to produce large numbers of intergenic, antisense and intronic long non-protein-coding RNAs (lncRNAs). Despite well characterized examples, their scaling with developmental complexity, and many demonstrations of their association with cellular processes, development and diseases, lncRNAs are still to be widely accepted as major players in gene regulation. This may reflect an underappreciation of the extent and precision of the epigenetic control of differentiation and development, where lncRNAs appear to have a central role, likely as organizational and guide molecules: most lncRNAs are nuclear-localized and chromatin-associated, with some involved in the formation of specialized subcellular domains. I suggest that a reassessment of the conceptual framework of genetic information and gene expression in the 4-dimensional ontogeny of spatially organized multicellular organisms is required. Together with this and further studies on their biology, the key challenges now are to determine the structure?function relationships of lncRNAs, which may be aided by emerging evidence of their modular structure, the role of RNA editing and modification in enabling epigenetic plasticity, and the role of RNA signaling in transgenerational inheritance of experience.


September 22, 2019  |  

Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation.

Shotgun metagenomics methods enable characterization of microbial communities in human microbiome and environmental samples. Assembly of metagenome sequences does not output whole genomes, so computational binning methods have been developed to cluster sequences into genome ‘bins’. These methods exploit sequence composition, species abundance, or chromosome organization but cannot fully distinguish closely related species and strains. We present a binning method that incorporates bacterial DNA methylation signatures, which are detected using single-molecule real-time sequencing. Our method takes advantage of these endogenous epigenetic barcodes to resolve individual reads and assembled contigs into species- and strain-level bins. We validate our method using synthetic and real microbiome sequences. In addition to genome binning, we show that our method links plasmids and other mobile genetic elements to their host species in a real microbiome sample. Incorporation of DNA methylation information into shotgun metagenomics analyses will complement existing methods to enable more accurate sequence binning.


September 22, 2019  |  

Characterization of four C1q/TNF-related proteins (CTRPs) from red-lip mullet (Liza haematocheila) and their transcriptional modulation in response to bacterial and pathogen-associated molecular pattern stimuli.

The structural and evolutionary linkage between tumor necrosis factor (TNF) and the globular C1q (gC1q) domain defines the C1q and TNF-related proteins (CTRPs), which are involved in diverse functions such as immune defense, inflammation, apoptosis, autoimmunity, and cell differentiation. In this study, red-lip mullet (Liza haematocheila) CTRP4-like (MuCTRP4-like), CTRP5 (MuCTRP5), CTRP6 (MuCTRP6), and CTRP7 (MuCTRP7) were identified from the red-lip mullet transcriptome database and molecularly characterized. According to in silico analysis, coding sequences of MuCTRP4-like, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of 1128, 753, 729, and 888 bp open reading frames (ORF), respectively and encoded 375, 250, 242, and 295 amino acids, respectively. All CTRPs possessed a putative C1q domain. Additionally, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of a collagen region. Phylogenetic analysis exemplified that MuCTRPs were distinctly clustered with the respective CTRP orthologs. Tissue-specific expression analysis demonstrated that MuCTRP4-like was mostly expressed in the blood and intestine. Moreover, MuCTRP6 was highly expressed in the blood, whereas MuCTRP5 and MuCTRP7 were predominantly expressed in the muscle and stomach, respectively. According to the temporal expression in blood, all MuCTRPs exhibited significant modulations in response to polyinosinic:polycytidylic acid (poly I:C) and Lactococcus garvieae (L. garvieae). MuCTRP4-like, MuCTRP5, and MuCTRP6 showed significant upregulation in response to lipopolysaccharides (LPS). The results of this study suggest the potential involvement of Mullet CTRPs in post-immune responses. Copyright © 2018. Published by Elsevier Ltd.


September 22, 2019  |  

Researches on transcriptome sequencing in the study of traditional Chinese medicine

Due to its incomparable advantages, the application of transcriptome sequencing in the study of traditional Chinese medicine attracts more and more attention of researchers, which greatly promote the development of traditional Chinese medicine. In this paper, the applications of transcriptome sequencing in traditional Chinese medicine were summarized by reviewing recent related papers.


September 22, 2019  |  

Alternative polyadenylation: methods, findings, and impacts.

Alternative polyadenylation (APA), a phenomenon that RNA molecules with different 3′ ends originate from distinct polyadenylation sites of a single gene, is emerging as a mechanism widely used to regulate gene expression. In the present review, we first summarized various methods prevalently adopted in APA study, mainly focused on the next-generation sequencing (NGS)-based techniques specially designed for APA identification, the related bioinformatics methods, and the strategies for APA study in single cells. Then we summarized the main findings and advances so far based on these methods, including the preferences of alternative polyA (pA) site, the biological processes involved, and the corresponding consequences. We especially categorized the APA changes discovered so far and discussed their potential functions under given conditions, along with the possible underlying molecular mechanisms. With more in-depth studies on extensive samples, more signatures and functions of APA will be revealed, and its diverse roles will gradually heave in sight. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.


September 22, 2019  |  

Differential TGFß pathway targeting by miR-122 in humans and mice affects liver cancer metastasis.

Downregulation of a predominantly hepatocyte-specific miR-122 is associated with human liver cancer metastasis, whereas miR-122-deficient mice display normal liver function. Here we show a functional conservation of miR-122 in the TGFß pathway: miR-122 target site is present in the mouse but not human TGFßR1, whereas a noncanonical target site is present in the TGFß1 5’UTR in humans and other primates. Experimental switch of the miR-122 target between the receptor TGFßR1 and the ligand TGFß1 changes the metastatic properties of mouse and human liver cancer cells. High expression of TGFß1 in human primary liver tumours is associated with poor survival. We identify over 50 other miRNAs orthogonally targeting ligand/receptor pairs in humans and mice, suggesting that these are evolutionarily common events. These results reveal an evolutionary mechanism for miRNA-mediated gene regulation underlying species-specific physiological or pathological phenotype and provide a potentially valuable strategy for treating liver-associated diseases.


September 22, 2019  |  

Identification and characterization of a carboxypeptidase N1 from red lip mullet (Liza haematocheila); revealing its immune relevance.

Complement system orchestrates the innate and adaptive immunity via the activation, recruitment, and regulation of immune molecules to destroy pathogens. However, regulation of the complement is essential to avoid injuries to the autologous tissues. The present study unveils the characteristic features of an important complement component, anaphylatoxin inactivator from red lip mullet at its molecular and functional level. Mullet carboxypeptidase N1 (MuCPN1) cDNA sequence possessed an open reading frame of 1347 bp, which encoded a protein of 449 amino acids with a predicted molecular weight of 51?kDa. In silico analysis discovered two domains of PM14-Zn carboxypeptidase and a C-terminal domain of M14 N/E carboxypeptidase, two zinc-binding signature motifs, and an N-glycosylation site in the MuCPN1 sequence. Homology analysis revealed that most of the residues in the sequence are conserved among the other selected homologs. Phylogeny analysis showed that MuCPN1 closely cladded with the Maylandia zebra CPN1 and clustered together with the teleostean counterparts. A challenge experiment showed modulated expression of MuCPN1 upon polyinosinic:polycytidylic acid and Lactococcus garviae in head kidney, spleen, gill, and liver tissues. The highest upregulation of MuCPN1 was observed 24?h post infection against poly I:C in each tissue. Moreover, the highest relative expressions upon L. garviae challenge were observed at 24?h post infection in head kidney tissue and 48?h post infection in spleen, gill, and liver tissues. MuCPN1 transfected cells triggered a 2.2-fold increase of nitric oxide (NO) production upon LPS stimulation compared to the un-transfected controls suggesting that MuCPN1 is an active protease which releases arginine from complement C3a, C4a, and C5a. These results have driven certain way towards enhancing the understanding of immune role of MuCPN1 in the complement defense mechanism of red lip mullet. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

The third revolution in sequencing technology.

Forty years ago the advent of Sanger sequencing was revolutionary as it allowed complete genome sequences to be deciphered for the first time. A second revolution came when next-generation sequencing (NGS) technologies appeared, which made genome sequencing much cheaper and faster. However, NGS methods have several drawbacks and pitfalls, most notably their short reads. Recently, third-generation/long-read methods appeared, which can produce genome assemblies of unprecedented quality. Moreover, these technologies can directly detect epigenetic modifications on native DNA and allow whole-transcript sequencing without the need for assembly. This marks the third revolution in sequencing technology. Here we review and compare the various long-read methods. We discuss their applications and their respective strengths and weaknesses and provide future perspectives. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

Bayesian nonparametric discovery of isoforms and individual specific quantification.

Most human protein-coding genes can be transcribed into multiple distinct mRNA isoforms. These alternative splicing patterns encourage molecular diversity, and dysregulation of isoform expression plays an important role in disease etiology. However, isoforms are difficult to characterize from short-read RNA-seq data because they share identical subsequences and occur in different frequencies across tissues and samples. Here, we develop BIISQ, a Bayesian nonparametric model for isoform discovery and individual specific quantification from short-read RNA-seq data. BIISQ does not require isoform reference sequences but instead estimates an isoform catalog shared across samples. We use stochastic variational inference for efficient posterior estimates and demonstrate superior precision and recall for simulations compared to state-of-the-art isoform reconstruction methods. BIISQ shows the most gains for low abundance isoforms, with 36% more isoforms correctly inferred at low coverage versus a multi-sample method and 170% more versus single-sample methods. We estimate isoforms in the GEUVADIS RNA-seq data and validate inferred isoforms by associating genetic variants with isoform ratios.


September 22, 2019  |  

Proteomic detection of immunoglobulin light chain variable region peptides from amyloidosis patient biopsies.

Immunoglobulin light chain (LC) amyloidosis (AL) is caused by deposition of clonal LCs produced by an underlying plasma cell neoplasm. The clonotypic LC sequences are unique to each patient, and they cannot be reliably detected by either immunoassays or standard proteomic workflows that target the constant regions of LCs. We addressed this issue by developing a novel sequence template-based workflow to detect LC variable (LCV) region peptides directly from AL amyloid deposits. The workflow was implemented in a CAP/CLIA compliant clinical laboratory dedicated to proteomic subtyping of amyloid deposits extracted from either formalin-fixed paraffin-embedded tissues or subcutaneous fat aspirates. We evaluated the performance of the workflow on a validation cohort of 30 AL patients, whose amyloidogenic clone was identified using a novel proteogenomics method, and 30 controls. The recall and negative predictive values of the workflow, when identifying the gene family of the AL clone, were 93 and 98%, respectively. Application of the workflow on a clinical cohort of 500 AL amyloidosis samples highlighted a bias in the LCV gene families used by the AL clones. We also detected similarity between AL clones deposited in multiple organs of systemic AL patients. In summary, AL proteomic data sets are rich in LCV region peptides of potential clinical significance that are recoverable with advanced bioinformatics.


September 22, 2019  |  

Rodent papillomaviruses.

Preclinical infection model systems are extremely valuable tools to aid in our understanding of Human Papillomavirus (HPV) biology, disease progression, prevention, and treatments. In this context, rodent papillomaviruses and their respective infection models are useful tools but remain underutilized resources in the field of papillomavirus biology. Two rodent papillomaviruses, MnPV1, which infects the Mastomys species of multimammate rats, and MmuPV1, which infects laboratory mice, are currently the most studied rodent PVs. Both of these viruses cause malignancy in the skin and can provide attractive infection models to study the lesser understood cutaneous papillomaviruses that have been frequently associated with HPV-related skin cancers. Of these, MmuPV1 is the first reported rodent papillomavirus that can naturally infect the laboratory strain of mice. MmuPV1 is an attractive model virus to study papillomavirus pathogenesis because of the ubiquitous availability of lab mice and the fact that this mouse species is genetically modifiable. In this review, we have summarized the knowledge we have gained about PV biology from the study of rodent papillomaviruses and point out the remaining gaps that can provide new research opportunities.


September 22, 2019  |  

Influenza virus infection causes global RNAPII termination defects.

Viral infection perturbs host cells and can be used to uncover regulatory mechanisms controlling cellular responses and susceptibility to infections. Using cell biological, biochemical, and genetic tools, we reveal that influenza A virus (IAV) infection induces global transcriptional defects at the 3′ ends of active host genes and RNA polymerase II (RNAPII) run-through into extragenic regions. Deregulated RNAPII leads to expression of aberrant RNAs (3′ extensions and host-gene fusions) that ultimately cause global transcriptional downregulation of physiological transcripts, an effect influencing antiviral response and virulence. This phenomenon occurs with multiple strains of IAV, is dependent on influenza NS1 protein, and can be modulated by SUMOylation of an intrinsically disordered region (IDR) of NS1 expressed by the 1918 pandemic IAV strain. Our data identify a strategy used by IAV to suppress host gene expression and indicate that polymorphisms in IDRs of viral proteins can affect the outcome of an infection.


September 22, 2019  |  

Long-read sequencing revealed an extensive transcript complexity in herpesviruses.

Long-read sequencing (LRS) techniques are very recent advancements, but they have already been used for transcriptome research in all of the three subfamilies of herpesviruses. These techniques have multiplied the number of known transcripts in each of the examined viruses. Meanwhile, they have revealed a so far hidden complexity of the herpesvirus transcriptome with the discovery of a large number of novel RNA molecules, including coding and non-coding RNAs, as well as transcript isoforms, and polycistronic RNAs. Additionally, LRS techniques have uncovered an intricate meshwork of transcriptional overlaps between adjacent and distally located genes. Here, we review the contribution of LRS to herpesvirus transcriptomics and present the complexity revealed by this technology, while also discussing the functional significance of this phenomenon.


September 22, 2019  |  

Lipoprotein lipase reaches the capillary lumen in chickens despite an apparent absence of GPIHBP1.

In mammals, GPIHBP1 is absolutely essential for transporting lipoprotein lipase (LPL) to the lumen of capillaries, where it hydrolyzes the triglycerides in triglyceride-rich lipoproteins. In all lower vertebrate species (e.g., birds, amphibians, reptiles, fish), a gene for LPL can be found easily, but a gene for GPIHBP1 has never been found. The obvious question is whether the LPL in lower vertebrates is able to reach the capillary lumen. Using purified antibodies against chicken LPL, we showed that LPL is present on capillary endothelial cells of chicken heart and adipose tissue, colocalizing with von Willebrand factor. When the antibodies against chicken LPL were injected intravenously into chickens, they bound to LPL on the luminal surface of capillaries in heart and adipose tissue. LPL was released rapidly from chicken hearts with an infusion of heparin, consistent with LPL being located inside blood vessels. Remarkably, chicken LPL bound in a specific fashion to mammalian GPIHBP1. However, we could not identify a gene for GPIHBP1 in the chicken genome, nor could we identify a transcript for GPIHBP1 in a large chicken RNA-seq data set. We conclude that LPL reaches the capillary lumen in chickens – as it does in mammals – despite an apparent absence of GPIHBP1.


September 22, 2019  |  

Cataloguing over-expressed genes in Epstein Barr Virus immortalized lymphoblastoid cell lines through consensus analysis of PacBio transcriptomes corroborates hypomethylation of chromosome 1

The ability of Epstein Barr Virus (EBV) to transform resting cell B-cells into immortalized lymphoblastoid cell lines (LCL) provides a continuous source of peripheral blood lymphocytes that are used to model conditions in which these lymphocytes play a key role. Here, the PacBio generated transcriptome of three LCLs from a parent-daughter trio (SRAid:SRP036136) provided by a previous study [1] were analyzed using a kmer-based version of YeATS (KEATS). The set of over-expressed genes in these cell lines were determined based on a comparison with the PacBio transcriptome of twenty tissues pro- vided by another study (hOPTRS) [2]. MIR155 long non-coding RNA (MIR155HG), Fc fragment of IgE receptor II (FCER2), T-cell leukemia/lymphoma 1A (TCL1A), and germinal center associated signaling and motility (GCSAM) were genes having the highest expression counts in the three LCLs with no expression in hOPTRS. Other over-expressed genes, having low expression in hOPTRS, were membrane spanning 4-domains A1 (MS4A1) and ribosomal protein S2 pseudogene 55 (RPS2P55). While some of these genes are known to be over-expressed in LCLs, this study provides a comprehensive cataloguing of such genes. A recent work involving a patient with EBV-positive large B-cell lymphoma was “unusually lacking various B-cell markers”, but over-expressing CD30 [3] – a gene ranked 79 among uniquely expressed genes here. Hypomethylation of chromosome 1 observed in EBV immortalized LCLs [4, 5] is also corroborated here by mapping the genes to chromosomes. Extending previous work identifying un-annotated genes [6], 80 genes were identified which are expressed in the three LCLs, not in hOPTRS, and missing in the GENCODE, RefSeq and RefSeqGene databases. KEATS introduces a method of determining expression counts based on a partitioning of the known annotated genes, has runtimes of a few hours on a personal workstation and provides detailed reports enabling proper debugging.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.