Menu
September 22, 2019

Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies.

Alternations in usage of polyadenylation sites during transcription termination yield transcript isoforms from a gene. Recent findings of transcriptome-wide alternative polyadenylation (APA) as a molecular response to changes in biology position APA not only as a molecular event of early transcriptional termination but also as a cellular regulatory step affecting various biological pathways. With the development of high-throughput profiling technologies at a single nucleotide level and their applications targeted to the 3′-end of mRNAs, dynamics in the landscape of mRNA 3′-end is measureable at a global scale. In this review, methods and technologies that have been adopted to study APA events are discussed. In addition, various bioinformatics algorithms for APA isoform analysis using publicly available RNA-seq datasets are introduced. [BMB Reports 2017; 50(4): 201-207].


September 22, 2019

Targeted sequencing of ampliconic gene transcripts from total human male testis RNA

The protocol summarizes all the steps towards the construction of PacBio and Illumina sequencing libraries of transcripts from nine Y chromosome ampliconic gene families starting from total human male testis RNA from two human males. This method is based on PacBio’s Isoform Sequencing method (Iso-Seq) using the Clontech SMARTer PCR cDNA Synthesis Kit and No Size Selection with some modifications.


September 22, 2019

Single-cell multiomics: multiple measurements from single cells.

Single-cell sequencing provides information that is not confounded by genotypic or phenotypic heterogeneity of bulk samples. Sequencing of one molecular type (RNA, methylated DNA or open chromatin) in a single cell, furthermore, provides insights into the cell’s phenotype and links to its genotype. Nevertheless, only by taking measurements of these phenotypes and genotypes from the same single cells can such inferences be made unambiguously. In this review, we survey the first experimental approaches that assay, in parallel, multiple molecular types from the same single cell, before considering the challenges and opportunities afforded by these and future technologies. Copyright © 2016. Published by Elsevier Ltd.


September 22, 2019

Different next generation sequencing platforms produce different microbial profiles and diversity in cystic fibrosis sputum.

Cystic fibrosis (CF) is an autosomal recessive disease characterized by recurrent lung infections. Studies of the lung microbiome have shown an association between decreasing diversity and progressive disease. 454 pyrosequencing has frequently been used to study the lung microbiome in CF, but will no longer be supported. We sought to identify the benefits and drawbacks of using two state-of-the-art next generation sequencing (NGS) platforms, MiSeq and PacBio RSII, to characterize the CF lung microbiome. Each has its advantages and limitations.Twelve samples of extracted bacterial DNA were sequenced on both MiSeq and PacBio NGS platforms. DNA was amplified for the V4 region of the 16S rRNA gene and libraries were sequenced on the MiSeq sequencing platform, while the full 16S rRNA gene was sequenced on the PacBio RSII sequencing platform. Raw FASTQ files generated by the MiSeq and PacBio platforms were processed in mothur v1.35.1.There was extreme discordance in alpha-diversity of the CF lung microbiome when using the two platforms. Because of its depth of coverage, sequencing of the 16S rRNA V4 gene region using MiSeq allowed for the observation of many more operational taxonomic units (OTUs) and higher Chao1 and Shannon indices than the PacBio RSII. Interestingly, several patients in our cohort had Escherichia, an unusual pathogen in CF. Also, likely because of its coverage of the complete 16S rRNA gene, only PacBio RSII was able to identify Burkholderia, an important CF pathogen.When comparing microbiome diversity in clinical samples from CF patients using 16S sequences, MiSeq and PacBio NGS platforms may generate different results in microbial community composition and structure. It may be necessary to use different platforms when trying to correctly identify dominant pathogens versus measuring alpha-diversity estimates, and it would be important to use the same platform for comparisons to minimize errors in interpretation. Copyright © 2016 Elsevier B.V. All rights reserved.


September 22, 2019

Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019

Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials.

The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.


September 22, 2019

Improved full-length killer cell immunoglobulin-like receptor transcript discovery in Mauritian cynomolgus macaques.

Killer cell immunoglobulin-like receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens, and so, considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lack the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing Pacific Biosciences circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probe sets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.


September 22, 2019

Quantitative isoform-profiling of highly diversified recognition molecules.

Complex biological systems rely on cell surface cues that govern cellular self-recognition and selective interactions with appropriate partners. Molecular diversification of cell surface recognition molecules through DNA recombination and complex alternative splicing has emerged as an important principle for encoding such interactions. However, the lack of tools to specifically detect and quantify receptor protein isoforms is a major impediment to functional studies. We here developed a workflow for targeted mass spectrometry by selected reaction monitoring (SRM) that permits quantitative assessment of highly diversified protein families. We apply this workflow to dissecting the molecular diversity of the neuronal neurexin receptors and uncover an alternative splicing-dependent recognition code for synaptic ligands.


September 22, 2019

KIR3DL01 upregulation on gut natural killer cells in response to SIV infection of KIR- and MHC class I-defined rhesus macaques.

Natural killer cells provide an important early defense against viral pathogens and are regulated in part by interactions between highly polymorphic killer-cell immunoglobulin-like receptors (KIRs) on NK cells and their MHC class I ligands on target cells. We previously identified MHC class I ligands for two rhesus macaque KIRs: KIR3DL01 recognizes Mamu-Bw4 molecules and KIR3DL05 recognizes Mamu-A1*002. To determine how these interactions influence NK cell responses, we infected KIR3DL01+ and KIR3DL05+ macaques with and without defined ligands for these receptors with SIVmac239, and monitored NK cell responses in peripheral blood and lymphoid tissues. NK cell responses in blood were broadly stimulated, as indicated by rapid increases in the CD16+ population during acute infection and sustained increases in the CD16+ and CD16-CD56- populations during chronic infection. Markers of proliferation (Ki-67), activation (CD69 & HLA-DR) and antiviral activity (CD107a & TNFa) were also widely expressed, but began to diverge during chronic infection, as reflected by sustained CD107a and TNFa upregulation by KIR3DL01+, but not by KIR3DL05+ NK cells. Significant increases in the frequency of KIR3DL01+ (but not KIR3DL05+) NK cells were also observed in tissues, particularly in the gut-associated lymphoid tissues, where this receptor was preferentially upregulated on CD56+ and CD16-CD56- subsets. These results reveal broad NK cell activation and dynamic changes in the phenotypic properties of NK cells in response to SIV infection, including the enrichment of KIR3DL01+ NK cells in tissues that support high levels of virus replication.


September 22, 2019

Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing.

Divergence of alternative splicing represents one of the major driving forces to shape phenotypic diversity during evolution. However, the extent to which these divergences could be explained by the evolving cis-regulatory versus trans-acting factors remains unresolved. To globally investigate the relative contributions of the two factors for the first time in mammals, we measured splicing difference between C57BL/6J and SPRET/EiJ mouse strains and allele-specific splicing pattern in their F1 hybrid. Out of 11,818 alternative splicing events expressed in the cultured fibroblast cells, we identified 796 with significant difference between the parental strains. After integrating allele-specific data from F1 hybrid, we demonstrated that these events could be predominately attributed to cis-regulatory variants, including those residing at and beyond canonical splicing sites. Contrary to previous observations in Drosophila, such predominant contribution was consistently observed across different types of alternative splicing. Further analysis of liver tissues from the same mouse strains and reanalysis of published datasets on other strains showed similar trends, implying in general the predominant contribution of cis-regulatory changes in the evolution of mouse alternative splicing. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.


September 22, 2019

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.


September 22, 2019

Identification of a novel fusion transcript between human relaxin-1 (RLN1) and human relaxin-2 (RLN2) in prostate cancer.

Simultaneous expression of highly homologous RLN1 and RLN2 genes in prostate impairs their accurate delineation. We used PacBio SMRT sequencing and RNA-Seq in LNCaP cells in order to dissect the expression of RLN1 and RLN2 variants. We identified a novel fusion transcript comprising the RLN1 and RLN2 genes and found evidence of its expression in the normal and prostate cancer tissues. The RLN1-RLN2 fusion putatively encodes RLN2 isoform with the deleted secretory signal peptide. The identification of the fusion transcript provided information to determine unique RLN1-RLN2 fusion and RLN1 regions. The RLN1-RLN2 fusion was co-expressed with RLN1 in LNCaP cells, but the two gene products were inversely regulated by androgens. We showed that RLN1 is underrepresented in common PCa cell lines in comparison to normal and PCa tissue. The current study brings a highly relevant update to the relaxin field, and will encourage further studies of RLN1 and RLN2 in PCa and broader. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.


September 22, 2019

Mutational landscape of antibody variable domains reveals a switch modulating the interdomain conformational dynamics and antigen binding.

Somatic mutations within the antibody variable domains are critical to the immense capacity of the immune repertoire. Here, via a deep mutational scan, we dissect how mutations at all positions of the variable domains of a high-affinity anti-VEGF antibody G6.31 impact its antigen-binding function. The resulting mutational landscape demonstrates that large portions of antibody variable domain positions are open to mutation, and that beneficial mutations can be found throughout the variable domains. We determine the role of one antigen-distal light chain position 83, demonstrating that mutation at this site optimizes both antigen affinity and thermostability by modulating the interdomain conformational dynamics of the antigen-binding fragment. Furthermore, by analyzing a large number of human antibody sequences and structures, we demonstrate that somatic mutations occur frequently at position 83, with corresponding domain conformations observed for G6.31. Therefore, the modulation of interdomain dynamics represents an important mechanism during antibody maturation in vivo.


September 22, 2019

Diverse antibiotic resistance genes in dairy cow manure.

Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. IMPORTANCE The increasing prevalence of antibiotic resistance among bacteria is one of the most intractable challenges in 21st-century public health. The origins of resistance are complex, and a better understanding of the impacts of antibiotics used on farms would produce a more robust platform for public policy. Microbiomes of farm animals are reservoirs of antibiotic resistance genes, which may affect distribution of antibiotic resistance genes in human pathogens. Previous studies have focused on antibiotic resistance genes in manures of animals subjected to intensive antibiotic use, such as pigs and chickens. Cow manure has received less attention, although it is commonly used in crop production. Here, we report the discovery of novel and diverse antibiotic resistance genes in the cow microbiome, demonstrating that it is a significant reservoir of antibiotic resistance genes. The genomic resource presented here lays the groundwork for understanding the dispersal of antibiotic resistance from the agroecosystem to other settings.


September 22, 2019

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.