Menu
September 22, 2019  |  

An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations.

Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop. © 2017 Clavijo et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019  |  

Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques.

Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date.For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies.By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.


September 22, 2019  |  

Complete genome sequence of multidrug-resistant Staphylococcus cohnii ssp. urealyticus strain SNUDS-2 isolated from farmed duck, Republic of Korea.

Staphylococcus cohnii has become increasingly recognized as a potential pathogen of clinically significant nosocomial and farm animal infections. This study was designed to determine the genome of a multidrug-resistant S. cohnii subsp. urealyticus strain SNUDS-2 isolated from a farmed duck in Korea.Genomic DNA was sequenced using the PacBio RS II system. The complete genome was annotated and the presence of antimicrobial resistance and virulence genes were identified.The annotated 2,625,703 bp genome contained various antimicrobial resistance genes conferring resistance to ß-lactam, aminoglycosides, fluoroquinolones, phenicols and trimethoprim. The virulence-associated three synergistic hemolysins have been identified in the strain.To the best of our knowledge, this is the first complete genome of S. cohnii, and will provide important insights into the biodiversity of CoNS and valuable information for the control of this emerging pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.


September 22, 2019  |  

Characterization of four C1q/TNF-related proteins (CTRPs) from red-lip mullet (Liza haematocheila) and their transcriptional modulation in response to bacterial and pathogen-associated molecular pattern stimuli.

The structural and evolutionary linkage between tumor necrosis factor (TNF) and the globular C1q (gC1q) domain defines the C1q and TNF-related proteins (CTRPs), which are involved in diverse functions such as immune defense, inflammation, apoptosis, autoimmunity, and cell differentiation. In this study, red-lip mullet (Liza haematocheila) CTRP4-like (MuCTRP4-like), CTRP5 (MuCTRP5), CTRP6 (MuCTRP6), and CTRP7 (MuCTRP7) were identified from the red-lip mullet transcriptome database and molecularly characterized. According to in silico analysis, coding sequences of MuCTRP4-like, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of 1128, 753, 729, and 888 bp open reading frames (ORF), respectively and encoded 375, 250, 242, and 295 amino acids, respectively. All CTRPs possessed a putative C1q domain. Additionally, MuCTRP5, MuCTRP6, and MuCTRP7 consisted of a collagen region. Phylogenetic analysis exemplified that MuCTRPs were distinctly clustered with the respective CTRP orthologs. Tissue-specific expression analysis demonstrated that MuCTRP4-like was mostly expressed in the blood and intestine. Moreover, MuCTRP6 was highly expressed in the blood, whereas MuCTRP5 and MuCTRP7 were predominantly expressed in the muscle and stomach, respectively. According to the temporal expression in blood, all MuCTRPs exhibited significant modulations in response to polyinosinic:polycytidylic acid (poly I:C) and Lactococcus garvieae (L. garvieae). MuCTRP4-like, MuCTRP5, and MuCTRP6 showed significant upregulation in response to lipopolysaccharides (LPS). The results of this study suggest the potential involvement of Mullet CTRPs in post-immune responses. Copyright © 2018. Published by Elsevier Ltd.


September 22, 2019  |  

Researches on transcriptome sequencing in the study of traditional Chinese medicine

Due to its incomparable advantages, the application of transcriptome sequencing in the study of traditional Chinese medicine attracts more and more attention of researchers, which greatly promote the development of traditional Chinese medicine. In this paper, the applications of transcriptome sequencing in traditional Chinese medicine were summarized by reviewing recent related papers.


September 22, 2019  |  

Differential increases of specific FMR1 mRNA isoforms in premutation carriers.

Over 40% of male and ~16% of female carriers of a premutation FMR1 allele (55-200 CGG repeats) will develop fragile X-associated tremor/ataxia syndrome, an adult onset neurodegenerative disorder, while about 20% of female carriers will develop fragile X-associated primary ovarian insufficiency. Marked elevation in FMR1 mRNA transcript levels has been observed with premutation alleles, and RNA toxicity due to increased mRNA levels is the leading molecular mechanism proposed for these disorders. However, although the FMR1 gene undergoes alternative splicing, it is unknown whether all or only some of the isoforms are overexpressed in premutation carriers and which isoforms may contribute to the premutation pathology.To address this question, we have applied a long-read sequencing approach using single-molecule real-time (SMRT) sequencing and qRT-PCR. Our SMRT sequencing analysis performed on peripheral blood mononuclear cells, fibroblasts and brain tissue samples derived from premutation carriers and controls revealed the existence of 16 isoforms of 24 predicted variants. Although the relative abundance of all mRNA isoforms was significantly increased in the premutation group, as expected based on the bulk increase in mRNA levels, there was a disproportionate (fourfold to sixfold) increase, relative to the overall increase in mRNA, in the abundance of isoforms spliced at both exons 12 and 14, specifically Iso10 and Iso10b, containing the complete exon 15 and differing only in splicing in exon 17.These findings suggest that RNA toxicity may arise from a relative increase of all FMR1 mRNA isoforms. Interestingly, the Iso10 and Iso10b mRNA isoforms, lacking the C-terminal functional sites for fragile X mental retardation protein function, are the most increased in premutation carriers relative to normal, suggesting a functional relevance in the pathology of FMR1-associated disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.


September 22, 2019  |  

Alternative polyadenylation: methods, findings, and impacts.

Alternative polyadenylation (APA), a phenomenon that RNA molecules with different 3′ ends originate from distinct polyadenylation sites of a single gene, is emerging as a mechanism widely used to regulate gene expression. In the present review, we first summarized various methods prevalently adopted in APA study, mainly focused on the next-generation sequencing (NGS)-based techniques specially designed for APA identification, the related bioinformatics methods, and the strategies for APA study in single cells. Then we summarized the main findings and advances so far based on these methods, including the preferences of alternative polyA (pA) site, the biological processes involved, and the corresponding consequences. We especially categorized the APA changes discovered so far and discussed their potential functions under given conditions, along with the possible underlying molecular mechanisms. With more in-depth studies on extensive samples, more signatures and functions of APA will be revealed, and its diverse roles will gradually heave in sight. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.


September 22, 2019  |  

Differential TGFß pathway targeting by miR-122 in humans and mice affects liver cancer metastasis.

Downregulation of a predominantly hepatocyte-specific miR-122 is associated with human liver cancer metastasis, whereas miR-122-deficient mice display normal liver function. Here we show a functional conservation of miR-122 in the TGFß pathway: miR-122 target site is present in the mouse but not human TGFßR1, whereas a noncanonical target site is present in the TGFß1 5’UTR in humans and other primates. Experimental switch of the miR-122 target between the receptor TGFßR1 and the ligand TGFß1 changes the metastatic properties of mouse and human liver cancer cells. High expression of TGFß1 in human primary liver tumours is associated with poor survival. We identify over 50 other miRNAs orthogonally targeting ligand/receptor pairs in humans and mice, suggesting that these are evolutionarily common events. These results reveal an evolutionary mechanism for miRNA-mediated gene regulation underlying species-specific physiological or pathological phenotype and provide a potentially valuable strategy for treating liver-associated diseases.


September 22, 2019  |  

Identification and characterization of a carboxypeptidase N1 from red lip mullet (Liza haematocheila); revealing its immune relevance.

Complement system orchestrates the innate and adaptive immunity via the activation, recruitment, and regulation of immune molecules to destroy pathogens. However, regulation of the complement is essential to avoid injuries to the autologous tissues. The present study unveils the characteristic features of an important complement component, anaphylatoxin inactivator from red lip mullet at its molecular and functional level. Mullet carboxypeptidase N1 (MuCPN1) cDNA sequence possessed an open reading frame of 1347 bp, which encoded a protein of 449 amino acids with a predicted molecular weight of 51?kDa. In silico analysis discovered two domains of PM14-Zn carboxypeptidase and a C-terminal domain of M14 N/E carboxypeptidase, two zinc-binding signature motifs, and an N-glycosylation site in the MuCPN1 sequence. Homology analysis revealed that most of the residues in the sequence are conserved among the other selected homologs. Phylogeny analysis showed that MuCPN1 closely cladded with the Maylandia zebra CPN1 and clustered together with the teleostean counterparts. A challenge experiment showed modulated expression of MuCPN1 upon polyinosinic:polycytidylic acid and Lactococcus garviae in head kidney, spleen, gill, and liver tissues. The highest upregulation of MuCPN1 was observed 24?h post infection against poly I:C in each tissue. Moreover, the highest relative expressions upon L. garviae challenge were observed at 24?h post infection in head kidney tissue and 48?h post infection in spleen, gill, and liver tissues. MuCPN1 transfected cells triggered a 2.2-fold increase of nitric oxide (NO) production upon LPS stimulation compared to the un-transfected controls suggesting that MuCPN1 is an active protease which releases arginine from complement C3a, C4a, and C5a. These results have driven certain way towards enhancing the understanding of immune role of MuCPN1 in the complement defense mechanism of red lip mullet. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

The third revolution in sequencing technology.

Forty years ago the advent of Sanger sequencing was revolutionary as it allowed complete genome sequences to be deciphered for the first time. A second revolution came when next-generation sequencing (NGS) technologies appeared, which made genome sequencing much cheaper and faster. However, NGS methods have several drawbacks and pitfalls, most notably their short reads. Recently, third-generation/long-read methods appeared, which can produce genome assemblies of unprecedented quality. Moreover, these technologies can directly detect epigenetic modifications on native DNA and allow whole-transcript sequencing without the need for assembly. This marks the third revolution in sequencing technology. Here we review and compare the various long-read methods. We discuss their applications and their respective strengths and weaknesses and provide future perspectives. Copyright © 2018 Elsevier Ltd. All rights reserved.


September 22, 2019  |  

CLK-dependent exon recognition and conjoined gene formation revealed with a novel small molecule inhibitor.

CDC-like kinase phosphorylation of serine/arginine-rich proteins is central to RNA splicing reactions. Yet, the genomic network of CDC-like kinase-dependent RNA processing events remains poorly defined. Here, we explore the connectivity of genomic CDC-like kinase splicing functions by applying graduated, short-exposure, pharmacological CDC-like kinase inhibition using a novel small molecule (T3) with very high potency, selectivity, and cell-based stability. Using RNA-Seq, we define CDC-like kinase-responsive alternative splicing events, the large majority of which monotonically increase or decrease with increasing CDC-like kinase inhibition. We show that distinct RNA-binding motifs are associated with T3 response in skipped exons. Unexpectedly, we observe dose-dependent conjoined gene transcription, which is associated with motif enrichment in the last and second exons of upstream and downstream partners, respectively. siRNA knockdown of CLK2-associated genes significantly increases conjoined gene formation. Collectively, our results reveal an unexpected role for CDC-like kinase in conjoined gene formation, via regulation of 3′-end processing and associated splicing factors.The phosphorylation of serine/arginine-rich proteins by CDC-like kinase is a central regulatory mechanism for RNA splicing reactions. Here, the authors synthesize a novel small molecule CLK inhibitor and map CLK-responsive alternative splicing events and discover an effect on conjoined gene transcription.


September 22, 2019  |  

Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.

Parallel sequencing of a single cell’s genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ~3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.


September 22, 2019  |  

Androgen receptor variant AR-V9 is co-expressed with AR-V7 in prostate cancer metastases and predicts abiraterone resistance.

Purpose: Androgen receptor (AR) variant AR-V7 is a ligand-independent transcription factor that promotes prostate cancer resistance to AR-targeted therapies.  Accordingly, efforts are underway to develop strategies for monitoring and inhibiting AR-V7 in castration-resistant prostate cancer (CRPC).  The purpose of this study was to understand whether other AR variants may be co-expressed with AR-V7 and promote resistance to AR-targeted therapies. Experimental Design:  We utilized complementary short- and long-read sequencing of intact AR mRNA isoforms to characterize AR expression in CRPC models.  Co-expression of AR-V7 and AR-V9 mRNA in CRPC metastases and circulating tumor cells was assessed by RNA-seq and RT-PCR, respectively.  Expression of AR-V9 protein in CRPC models was evaluated with polyclonal antisera.  Multivariate analysis was performed to test whether AR variant mRNA expression in metastatic tissues was associated with a 12-week progression-free survival endpoint in a prospective clinical trial of 78 CRPC-stage patients initiating therapy with the androgen synthesis inhibitor, abiraterone acetate. Results: AR-V9 was frequently co-expressed with AR-V7.  Both AR variant species were found to share a common 3′ terminal cryptic exon, which rendered AR-V9 susceptible to experimental manipulations that were previously-thought to target AR-V7 uniquely.  AR-V9 promoted ligand-independent growth of prostate cancer cells.  High AR-V9 mRNA expression in CRPC metastases was predictive of primary resistance to abiraterone acetate (HR = 4.0, 95% CI = 1.31-12.2, P = 0.02).   Conclusions:  AR-V9 may be an important component of therapeutic resistance in CRPC. Copyright ©2017, American Association for Cancer Research.


September 22, 2019  |  

L_RNA_scaffolder: scaffolding genomes with transcripts.

Generation of large mate-pair libraries is necessary for de novo genome assembly but the procedure is complex and time-consuming. Furthermore, in some complex genomes, it is hard to increase the N50 length even with large mate-pair libraries, which leads to low transcript coverage. Thus, it is necessary to develop other simple scaffolding approaches, to at least solve the elongation of transcribed fragments.We describe L_RNA_scaffolder, a novel genome scaffolding method that uses long transcriptome reads to order, orient and combine genomic fragments into larger sequences. To demonstrate the accuracy of the method, the zebrafish genome was scaffolded. With expanded human transcriptome data, the N50 of human genome was doubled and L_RNA_scaffolder out-performed most scaffolding results by existing scaffolders which employ mate-pair libraries. In these two examples, the transcript coverage was almost complete, especially for long transcripts. We applied L_RNA_scaffolder to the highly polymorphic pearl oyster draft genome and the gene model length significantly increased.The simplicity and high-throughput of RNA-seq data makes this approach suitable for genome scaffolding. L_RNA_scaffolder is available at http://www.fishbrowser.org/software/L_RNA_scaffolder.


September 22, 2019  |  

Bayesian nonparametric discovery of isoforms and individual specific quantification.

Most human protein-coding genes can be transcribed into multiple distinct mRNA isoforms. These alternative splicing patterns encourage molecular diversity, and dysregulation of isoform expression plays an important role in disease etiology. However, isoforms are difficult to characterize from short-read RNA-seq data because they share identical subsequences and occur in different frequencies across tissues and samples. Here, we develop BIISQ, a Bayesian nonparametric model for isoform discovery and individual specific quantification from short-read RNA-seq data. BIISQ does not require isoform reference sequences but instead estimates an isoform catalog shared across samples. We use stochastic variational inference for efficient posterior estimates and demonstrate superior precision and recall for simulations compared to state-of-the-art isoform reconstruction methods. BIISQ shows the most gains for low abundance isoforms, with 36% more isoforms correctly inferred at low coverage versus a multi-sample method and 170% more versus single-sample methods. We estimate isoforms in the GEUVADIS RNA-seq data and validate inferred isoforms by associating genetic variants with isoform ratios.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.