Menu
September 22, 2019  |  

Next generation sequencing technology: Advances and applications.

Impressive progress has been made in the field of Next Generation Sequencing (NGS). Through advancements in the fields of molecular biology and technical engineering, parallelization of the sequencing reaction has profoundly increased the total number of produced sequence reads per run. Current sequencing platforms allow for a previously unprecedented view into complex mixtures of RNA and DNA samples. NGS is currently evolving into a molecular microscope finding its way into virtually every fields of biomedical research. In this chapter we review the technical background of the different commercially available NGS platforms with respect to template generation and the sequencing reaction and take a small step towards what the upcoming NGS technologies will bring. We close with an overview of different implementations of NGS into biomedical research. This article is part of a Special Issue entitled: From Genome to Function. Copyright © 2014 Elsevier B.V. All rights reserved.


September 22, 2019  |  

A survey of transcriptome complexity in Sus scrofa using single-molecule long-read sequencing.

Alternative splicing (AS) and fusion transcripts produce a vast expansion of transcriptomes and proteomes diversity. However, the reliability of these events and the extend of epigenetic mechanisms have not been adequately addressed due to its limitation of uncertainties about the complete structure of mRNA. Here we combined single-molecule real-time sequencing, Illumina RNA-seq and DNA methylation data to characterize the landscapes of DNA methylation on AS, fusion isoforms formation and lncRNA feature and further to unveil the transcriptome complexity of pig. Our analysis identified an unprecedented scale of high-quality full-length isoforms with over 28,127 novel isoforms from 26,881 novel genes. More than 92,000 novel AS events were detected and intron retention predominated in AS model, followed by exon skipping. Interestingly, we found that DNA methylation played an important role in generating various AS isoforms by regulating splicing sites, promoter regions and first exons. Furthermore, we identified a large of fusion transcripts and novel lncRNAs, and found that DNA methylation of the promoter and gene body could regulate lncRNA expression. Our results significantly improved existed gene models of pig and unveiled that pig AS and epigenetic modify were more complex than previously thought.


September 22, 2019  |  

Somatic APP gene recombination in Alzheimer’s disease and normal neurons.

The diversity and complexity of the human brain are widely assumed to be encoded within a constant genome. Somatic gene recombination, which changes germline DNA sequences to increase molecular diversity, could theoretically alter this code but has not been documented in the brain, to our knowledge. Here we describe recombination of the Alzheimer’s disease-related gene APP, which encodes amyloid precursor protein, in human neurons, occurring mosaically as thousands of variant ‘genomic cDNAs’ (gencDNAs). gencDNAs lacked introns and ranged from full-length cDNA copies of expressed, brain-specific RNA splice variants to myriad smaller forms that contained intra-exonic junctions, insertions, deletions, and/or single nucleotide variations. DNA in situ hybridization identified gencDNAs within single neurons that were distinct from wild-type loci and absent from non-neuronal cells. Mechanistic studies supported neuronal ‘retro-insertion’ of RNA to produce gencDNAs; this process involved transcription, DNA breaks, reverse transcriptase activity, and age. Neurons from individuals with sporadic Alzheimer’s disease showed increased gencDNA diversity, including eleven mutations known to be associated with familial Alzheimer’s disease that were absent from healthy neurons. Neuronal gene recombination may allow ‘recording’ of neural activity for selective ‘playback’ of preferred gene variants whose expression bypasses splicing; this has implications for cellular diversity, learning and memory, plasticity, and diseases of the human brain.


September 22, 2019  |  

Single-cell mRNA isoform diversity in the mouse brain.

Alternative mRNA isoform usage is an important source of protein diversity in mammalian cells. This phenomenon has been extensively studied in bulk tissues, however, it remains unclear how this diversity is reflected in single cells.Here we use long-read sequencing technology combined with unique molecular identifiers (UMIs) to reveal patterns of alternative full-length isoform expression in single cells from the mouse brain. We found a surprising amount of isoform diversity, even after applying a conservative definition of what constitutes an isoform. Genes tend to have one or a few isoforms highly expressed and a larger number of isoforms expressed at a low level. However, for many genes, nearly every sequenced mRNA molecule was unique, and many events affected coding regions suggesting previously unknown protein diversity in single cells. Exon junctions in coding regions were less prone to splicing errors than those in non-coding regions, indicating purifying selection on splice donor and acceptor efficiency.Our findings indicate that mRNA isoform diversity is an important source of biological variability also in single cells.


September 22, 2019  |  

Improving eukaryotic genome annotation using single molecule mRNA sequencing.

The advantages of Pacific Biosciences (PacBio) single-molecule real-time (SMRT) technology include long reads, low systematic bias, and high consensus read accuracy. Here we use these attributes to improve on the genome annotation of the parasitic hookworm Ancylostoma ceylanicum using PacBio RNA-Seq.We sequenced 192,888 circular consensus sequences (CCS) derived from cDNAs generated using the CloneTech SMARTer system. These SMARTer-SMRT libraries were normalized and size-selected providing a robust population of expressed structural genes for subsequent genome annotation. We demonstrate PacBio mRNA sequences based genome annotation improvement, compared to genome annotation using conventional sequencing-by-synthesis alone, by identifying 1609 (9.2%) new genes, extended the length of 3965 (26.7%) genes and increased the total genomic exon length by 1.9 Mb (12.4%). Non-coding sequence representation (primarily from UTRs based on dT reverse transcription priming) was particularly improved, increasing in total length by fifteen-fold, by increasing both the length and number of UTR exons. In addition, the UTR data provided by these CCS allowed for the identification of a novel SL2 splice leader sequence for A. ceylanicum and an increase in the number and proportion of functionally annotated genes. RNA-seq data also confirmed some of the newly annotated genes and gene features.Overall, PacBio data has supported a significant improvement in gene annotation in this genome, and is an appealing alternative or complementary technique for genome annotation to the other transcript sequencing technologies.


September 22, 2019  |  

PacBio sequencing and its applications.

Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing (SGS) technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research. The highly-contiguous de novo assemblies using PacBio sequencing can close gaps in current reference assemblies and characterize structural variation (SV) in personal genomes. With longer reads, we can sequence through extended repetitive regions and detect mutations, many of which are associated with diseases. Moreover, PacBio transcriptome sequencing is advantageous for the identification of gene isoforms and facilitates reliable discoveries of novel genes and novel isoforms of annotated genes, due to its ability to sequence full-length transcripts or fragments with significant lengths. Additionally, PacBio’s sequencing technique provides information that is useful for the direct detection of base modifications, such as methylation. In addition to using PacBio sequencing alone, many hybrid sequencing strategies have been developed to make use of more accurate short reads in conjunction with PacBio long reads. In general, hybrid sequencing strategies are more affordable and scalable especially for small-size laboratories than using PacBio Sequencing alone. The advent of PacBio sequencing has made available much information that could not be obtained via SGS alone. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.


September 22, 2019  |  

Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies.

Alternations in usage of polyadenylation sites during transcription termination yield transcript isoforms from a gene. Recent findings of transcriptome-wide alternative polyadenylation (APA) as a molecular response to changes in biology position APA not only as a molecular event of early transcriptional termination but also as a cellular regulatory step affecting various biological pathways. With the development of high-throughput profiling technologies at a single nucleotide level and their applications targeted to the 3′-end of mRNAs, dynamics in the landscape of mRNA 3′-end is measureable at a global scale. In this review, methods and technologies that have been adopted to study APA events are discussed. In addition, various bioinformatics algorithms for APA isoform analysis using publicly available RNA-seq datasets are introduced. [BMB Reports 2017; 50(4): 201-207].


September 22, 2019  |  

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.


September 22, 2019  |  

2′-O-methylation in mRNA disrupts tRNA decoding during translation elongation.

Chemical modifications of mRNA may regulate many aspects of mRNA processing and protein synthesis. Recently, 2′-O-methylation of nucleotides was identified as a frequent modification in translated regions of human mRNA, showing enrichment in codons for certain amino acids. Here, using single-molecule, bulk kinetics and structural methods, we show that 2′-O-methylation within coding regions of mRNA disrupts key steps in codon reading during cognate tRNA selection. Our results suggest that 2′-O-methylation sterically perturbs interactions of ribosomal-monitoring bases (G530, A1492 and A1493) with cognate codon-anticodon helices, thereby inhibiting downstream GTP hydrolysis by elongation factor Tu (EF-Tu) and A-site tRNA accommodation, leading to excessive rejection of cognate aminoacylated tRNAs in initial selection and proofreading. Our current and prior findings highlight how chemical modifications of mRNA tune the dynamics of protein synthesis at different steps of translation elongation.


September 22, 2019  |  

The plant growth-promoting rhizobacterium Variovorax boronicumulans CGMCC 4969 regulates the level of indole-3-acetic acid synthesized from indole-3-acetonitrile.

Variovorax is a metabolically diverse genus of plant growth-promoting rhizobacteria (PGPR) that engages in mutually beneficial interactions between plants and microbes. Unlike most PGPR, Variovorax cannot synthesize the phytohormone indole-3-acetic acid (IAA) via tryptophan. However, we found that V. boronicumulans strain CGMCC 4969 could produce IAA using indole-3-acetonitrile (IAN) as the precursor. Thus, in the present study, the IAA synthesis mechanism of V. boronicumulans CGMCC 4969 was investigated. V. boronicumulans CGMCC 4969 metabolized IAN to IAA through both a nitrilase-dependent pathway and a nitrile hydratase (NHase) and amidase-dependent pathway. Cobalt enhanced the metabolic flux via the NHase/amidase, by which IAN was rapidly converted to indole-3-acetamide (IAM) and in turn to IAA. IAN stimulated the metabolic flux via the nitrilase, by which IAN was rapidly converted to IAA. Subsequently, the IAA was degraded. V. boronicumulans CGMCC 4969 could use IAN as the sole carbon and nitrogen source for growth. Genome sequencing confirmed the IAA synthesis pathways. Gene cloning and overexpression in Escherichia coli indicated that NitA has the nitrilase activity, and IamA has the amidase activity to respectively transform IAN and IAM to IAA. Interestingly, NitA showed a close genetic relationship with the nitrilase of the phytopathogen Pseudomonas syringae Quantitative PCR analysis indicated that the NHase/amidase system is constitutively expressed, whereas the nitrilase is inducible. The present study helps our understanding of the versatile functions of Variovorax nitrile-converting enzymes that mediate IAA synthesis and the interactions between plants and these bacteria.IMPORTANCE We demonstrated that Variovorax boronicumulans CGMCC 4969 has two enzymatic systems-nitrilase and nitrile hydratase/amidase-that convert indole-3-acetonitrile (IAN) to the important plant hormone indole-3-acetic acid (IAA). The two IAA synthesis systems have very different regulatory mechanisms, affecting the IAA synthesis rate and duration. The nitrilase was induced by IAN, which was rapidly converted to IAA; subsequently IAA was rapidly consumed for cell growth. The NHase and amidase system was constitutively expressed and slowly but continuously synthesized IAA. In addition to synthesizing IAA from IAN, CGMCC 4969 has a rapid IAA degradation system, which would be helpful for a host plant to eliminate redundant IAA. This study indicates that the plant growth-promoting rhizobacterium V. boronicumulans CGMCC 4969 has the potential to be used by host plants to regulate the IAA level. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Comparative genome analysis of jujube witches’-broom Phytoplasma, an obligate pathogen that causes jujube witches’-broom disease.

JWB phytoplasma is a kind of insect-transmitted and uncultivable bacterial plant pathogen causeing a destructive Jujube disease. To date, no genome information about JWB phytoplasma has been published, which hindered its characterization at genomic level. To understand its pathogenicity and ecology, the genome of a JWB phytoplasma isolate jwb-nky was sequenced and compared with other phytoplasmas enabled us to explore the mechanisms of genomic rearrangement.The complete genome sequence of JWB phytoplasma (jwb-nky) was determined, which consisting of one circular chromosome of 750,803 bp with a GC content of 23.3%. 694 protein-encoding genes, 2 operons for rRNA genes and 31 tRNA genes as well as 4 potential mobile units (PMUs) containing clusters of DNA repeats were identified. Based on PHIbaes analysis, a large number of genes were genome-specific and approximately 13% of JWB phytoplasma genes were predicted to be associated with virulence. Although transporters for maltose, dipeptides/oligopeptides, spermidine/putrescine, cobalt, Mn/Zn and methionine were identified, KEGG pathway analysis revealed the reduced metabolic capabilities of JWB phytoplasma. Comparative genome analyses between JWB phytoplasma and other phytoplasmas shows the occurrence of large-scale gene rearrangements. The low synteny with other phytoplasmas indicated that the expansion of multiple gene families/duplication probably occurred separately after differentiation.In this study, the complete genome sequence of a JWB phytoplasma isolate jwb-nky that causing JWB disease was reported for the first time and a number of species-specific genes were identified in the genome. The study enhanced our understandings about genomic basis and the pathogenicity mechanism of this pathogen, which will aid in the development of improved strategies for efficient management of JWB diseases.


September 22, 2019  |  

Characterization and genomic analyses of Pseudomonas aeruginosa podovirus TC6: establishment of genus Pa11virus.

Phages have attracted a renewed interest as alternative to chemical antibiotics. Although the number of phages is 10-fold higher than that of bacteria, the number of genomically characterized phages is far less than that of bacteria. In this study, phage TC6, a novel lytic virus of Pseudomonas aeruginosa, was isolated and characterized. TC6 consists of an icosahedral head with a diameter of approximately 54 nm and a short tail with a length of about 17 nm, which are characteristics of the family Podoviridae. TC6 can lyse 86 out of 233 clinically isolated P. aeruginosa strains, thus showing application potentials for phage therapy. The linear double-stranded genomic DNA of TC6 consisted of 49796 base pairs and was predicted to contain 71 protein-coding genes. A total of 11 TC6 structural proteins were identified by mass spectrometry. Comparative analysis revealed that the P. aeruginosa phages TC6, O4, PA11, and IME180 shared high similarity at DNA sequence and proteome levels, among which PA11 was the first phage discovered and published. Meanwhile, these phages contain 54 core genes and have very close phylogenetic relationships, which distinguish them from other known phage genera. We therefore proposed that these four phages can be classified as Pa11virus, comprising a new phage genus of Podoviridae that infects Pseudomonas spp. The results of this work promoted our understanding of phage biology, classification, and diversity.


September 22, 2019  |  

Real-time assembly of ribonucleoprotein complexes on nascent RNA transcripts.

Cellular protein-RNA complexes assemble on nascent transcripts, but methods to observe transcription and protein binding in real time and at physiological concentrations are not available. Here, we report a single-molecule approach based on zero-mode waveguides that simultaneously tracks transcription progress and the binding of ribosomal protein S15 to nascent RNA transcripts during early ribosome biogenesis. We observe stable binding of S15 to single RNAs immediately after transcription for the majority of the transcripts at 35?°C but for less than half at 20?°C. The remaining transcripts exhibit either rapid and transient binding or are unable to bind S15, likely due to RNA misfolding. Our work establishes the foundation for studying transcription and its coupled co-transcriptional processes, including RNA folding, ligand binding, and enzymatic activity such as in coupling of transcription to splicing, ribosome assembly or translation.


September 22, 2019  |  

Cryptocurrencies and Zero Mode Wave guides: An unclouded path to a more contiguous Cannabis sativa L. genome assembly

We describe the use ofa Decentralized Autonomous Organization (DAO) to crypto- fund the single molecule sequencing and publication ofa Type ll Cannabis plant. This resulted in the construction of the most contiguous Cannabis genome assembly to date. The combined use of the Dash cryptocurrency, DAOs, and Pacific Biosciences sequencing delivered a 1.03 Gb genome with a N50 of 665Kb in 77 days from funding to public upload. This represents a 230 fold improvement in the contiguity of the first cannabis assemblies in 2011 and a 4 fold improvement over all cannabis assemblies to date. 34Gb ofadditional sequencing pushed the assembly to a N50 of 3.8Mb. Hi-C data from Phase Genomics further scaffolded the assembly to 35 contigs at an N50 of 74Mb but requires additional curation. The genome is partially phased and larger than previously reported (2N : 1.33Gb). The CBCA, THCA and CBDA synthase gene clusters have been phased onto respective contigs demonstrating tandem repeat expansions.


September 22, 2019  |  

Identification of DNA base modifications by means of Pacific Biosciences RS Sequencing technology.

Whole phage genomes can be sequenced readily using one or a combination of next generation sequencing (NGS) technologies. One of the most recently developed NGS platforms, the so-called Single-Molecule Real-Time (SMRT) sequencing approach provided by the PacBio RS platform, is particularly useful in providing complete (i.e., un-gapped) genome sequences, but differs from other technologies in that the platform also allows for downstream analysis to identify nucleotides that have been modified by DNA methylation. Here, we describe the methodological approach for the detection of genomic methylation motifs by means of SMRT sequencing.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.