Menu
July 7, 2019  |  

Innovations and challenges in detecting long read overlaps: an evaluation of the state-of-the-art.

Identifying overlaps between error-prone long reads, specifically those from Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB), is essential for certain downstream applications, including error correction and de novo assembly. Though akin to the read-to-reference alignment problem, read-to-read overlap detection is a distinct problem that can benefit from specialized algorithms that perform efficiently and robustly on high error rate long reads. Here, we review the current state-of-the-art read-to-read overlap tools for error-prone long reads, including BLASR, DALIGNER, MHAP, GraphMap and Minimap. These specialized bioinformatics tools differ not just in their algorithmic designs and methodology, but also in their robustness of performance on a variety of datasets, time and memory efficiency and scalability. We highlight the algorithmic features of these tools, as well as their potential issues and biases when utilizing any particular method. To supplement our review of the algorithms, we benchmarked these tools, tracking their resource needs and computational performance, and assessed the specificity and precision of each. In the versions of the tools tested, we observed that Minimap is the most computationally efficient, specific and sensitive method on the ONT datasets tested; whereas GraphMap and DALIGNER are the most specific and sensitive methods on the tested PB datasets. The concepts surveyed may apply to future sequencing technologies, as scalability is becoming more relevant with increased sequencing throughput.cjustin@bcgsc.ca , ibirol@bcgsc.ca.Supplementary data are available at Bioinformatics online.


July 7, 2019  |  

Long-read sequencing offers path to more accurate drug metabolism profiles

In the complex drug discovery process, one of the looming questions for any new compound is how it will be metabolised in a human bodyWhi|e there are several methods for evaluating this, one of the most common involves CYP2D6,the enzyme encoded by the cytochrome P450—2D6 gene.This enzyme is involved in metabolising a quarter of all commonly used medications, making it an important target for ADME and pharmacogenomics studies. It is known to activate some drugs and to play a role in the deactivation or excretion of others.


July 7, 2019  |  

Post-termination ribosome intermediate acts as the gateway to ribosome recycling.

During termination of translation, the nascent peptide is first released from the ribosome, which must be subsequently disassembled into subunits in a process known as ribosome recycling. In bacteria, termination and recycling are mediated by the translation factors RF, RRF, EF-G, and IF3, but their precise roles have remained unclear. Here, we use single-molecule fluorescence to track the conformation and composition of the ribosome in real time during termination and recycling. Our results show that peptide release by RF induces a rotated ribosomal conformation. RRF binds to this rotated intermediate to form the substrate for EF-G that, in turn, catalyzes GTP-dependent subunit disassembly. After the 50S subunit departs, IF3 releases the deacylated tRNA from the 30S subunit, thus preventing reassembly of the 70S ribosome. Our findings reveal the post-termination rotated state as the crucial intermediate in the transition from termination to recycling. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


July 7, 2019  |  

Length-independent DNA packing into nanopore zero-mode waveguides for low-input DNA sequencing.

Compared with conventional methods, single-molecule real-time (SMRT) DNA sequencing exhibits longer read lengths than conventional methods, less GC bias, and the ability to read DNA base modifications. However, reading DNA sequence from sub-nanogram quantities is impractical owing to inefficient delivery of DNA molecules into the confines of zero-mode waveguides-zeptolitre optical cavities in which DNA sequencing proceeds. Here, we show that the efficiency of voltage-induced DNA loading into waveguides equipped with nanopores at their floors is five orders of magnitude greater than existing methods. In addition, we find that DNA loading is nearly length-independent, unlike diffusive loading, which is biased towards shorter fragments. We demonstrate here loading and proof-of-principle four-colour sequence readout of a polymerase-bound 20,000-base-pair-long DNA template within seconds from a sub-nanogram input quantity, a step towards low-input DNA sequencing and mammalian epigenomic mapping of native DNA samples.


July 7, 2019  |  

Research Highlights: Packing, trapping and sequencing

Ultralow concentrations of DNA can be optically sequenced with SMRT DNA sequencing. In principle, optical DNA-sequencing protocols have the advantage of reading long strands of DNA in real time and at high speeds. In practice, however, reading long DNA strands is a challenge with current methods, which require high concentrations and suffer from short- chain loading bias. To overcome these limitations, a research team led by Meni Wanunu at Northeastern University in Boston has now developed an efficient voltage-controlled DNA- loading technology that enables single molecule, real time (SMRT) sequencing of long DNA strands at ultralow concentrations.


July 7, 2019  |  

Probing the translation dynamics of ribosomes using Zero-Mode Waveguides

In order to coordinate the complex biochemical and structural feat of converting triple-nucleotide codons into their corresponding amino acids, the ribosome must physically manipulate numerous macromolecules including the mRNA, tRNAs, and numerous translation factors. The ribosome choreographs binding, dissociation, physical movements, and structural rearrangements so that they synergistically harness the energy from biochemical processes, including numerous GTP hydrolysis steps and peptide bond formation. Due to the dynamic and complex nature of translation, the large cast of ligands involved, and the large number of possible configurations, tracking the global time evolution or dynamics of the ribosome complex in translation has proven to be challenging for bulk methods. Conventional single-molecule fluorescence experiments on the other hand require low concentrations of fluorescent ligands to reduce background noise. The significantly reduced bimolecular association rates under those conditions limit the number of steps that can be observed within the time window available to a fluorophore. The advent of zero-mode waveguide (ZMW) technology has allowed the study of translation at near-physiological concentrations of labeled ligands, moving single-molecule fluorescence microscopy beyond focused model systems into studying the global dynamics of translation in realistic setups. This chapter reviews the recent works using the ZMW technology to dissect the mechanism of translation initiation and elongation in prokaryotes, including complex processes such as translational stalling and frameshifting. Given the success of the technology, similarly complex biological processes could be studied in near-physiological conditions with the controllability of conventional in vitro experiments. Copyright © 2016 Elsevier Inc. All rights reserved.


July 7, 2019  |  

Exocytotic fusion pores are composed of both lipids and proteins.

During exocytosis, fusion pores form the first aqueous connection that allows escape of neurotransmitters and hormones from secretory vesicles. Although it is well established that SNARE proteins catalyze fusion, the structure and composition of fusion pores remain unknown. Here, we exploited the rigid framework and defined size of nanodiscs to interrogate the properties of reconstituted fusion pores, using the neurotransmitter glutamate as a content-mixing marker. Efficient Ca(2+)-stimulated bilayer fusion, and glutamate release, occurred with approximately two molecules of mouse synaptobrevin 2 reconstituted into ~6-nm nanodiscs. The transmembrane domains of SNARE proteins assumed distinct roles in lipid mixing versus content release and were exposed to polar solvent during fusion. Additionally, tryptophan substitutions at specific positions in these transmembrane domains decreased glutamate flux. Together, these findings indicate that the fusion pore is a hybrid structure composed of both lipids and proteins.


July 7, 2019  |  

Single-molecule DNA hybridisation studied by using a modified DNA sequencer: a comparison with surface plasmon resonance data

Current methods for the determination of molecular interactions are widely used in the analytical sciences. To identify new methods, we investigated as a model system the hybridisation of a short 7 nt oligonucleotide labelled with, structurally, very similar cyanine dyes CY3 and DY-547, respectively, to a 34 nt oligonucleotide probe immobilised in a zero-mode waveguide (ZMW) nanostructure. Using a modified commercial off-the-shelf DNA sequencer, we established the principles to measure biomolecular interactions at the single-molecule level. Kinetic data were obtained from trains of fluorescence pulses, allowing the calculation of association and dissociation rate constants (k on, k off). For the 7mer labelled with the positively charged CY3 dye, k on and k off are ~3 larger and ~2 times smaller, respectively, compared with the oligonucleotide labelled with negatively charged DY-547 dye. The effect of neighbouring molecules lacking the 7nt binding sequence on single-molecule rate constants is small. The association rate constants is reduced by only 20–35%. Hybrid dissociation is not affected, since as a consequence of the experimental design, rebinding cannot take place. Results of single-molecule experiments were compared with data obtained from surface plasmon resonance (SPR) performed under comparable conditions. A good correlation for the association rate constants within a factor of 1.5 was found. Dissociation rate constants are smaller by a factor of 2–3 which we interpreted as a result of rebinding to neighbouring probes. Results of SPR measurements tend to systematically underestimate dissociation rate constants. The amount of this deviation depends on the association rate constant and the surface probe density. As a consequence, it is recommended to work at low probe densities to keep this effect small.


July 7, 2019  |  

Refined Pichia pastoris reference genome sequence.

Strains of the species Komagataella phaffii are the most frequently used “Pichia pastoris” strains employed for recombinant protein production as well as studies on peroxisome biogenesis, autophagy and secretory pathway analyses. Genome sequencing of several different P. pastoris strains has provided the foundation for understanding these cellular functions in recent genomics, transcriptomics and proteomics experiments. This experimentation has identified mistakes, gaps and incorrectly annotated open reading frames in the previously published draft genome sequences. Here, a refined reference genome is presented, generated with genome and transcriptome sequencing data from multiple P. pastoris strains. Twelve major sequence gaps from 20 to 6000 base pairs were closed and 5111 out of 5256 putative open reading frames were manually curated and confirmed by RNA-seq and published LC-MS/MS data, including the addition of new open reading frames (ORFs) and a reduction in the number of spliced genes from 797 to 571. One chromosomal fragment of 76kbp between two previous gaps on chromosome 1 and another 134kbp fragment at the end of chromosome 4, as well as several shorter fragments needed re-orientation. In total more than 500 positions in the genome have been corrected. This reference genome is presented with new chromosomal numbering, positioning ribosomal repeats at the distal ends of the four chromosomes, and includes predicted chromosomal centromeres as well as the sequence of two linear cytoplasmic plasmids of 13.1 and 9.5kbp found in some strains of P. pastoris. Copyright © 2016. Published by Elsevier B.V.


July 7, 2019  |  

N(6)-methyladenosine in mRNA disrupts tRNA selection and translation-elongation dynamics.

N(6)-methylation of adenosine (forming m(6)A) is the most abundant post-transcriptional modification within the coding region of mRNA, but its role during translation remains unknown. Here, we used bulk kinetic and single-molecule methods to probe the effect of m(6)A in mRNA decoding. Although m(6)A base-pairs with uridine during decoding, as shown by X-ray crystallographic analyses of Thermus thermophilus ribosomal complexes, our measurements in an Escherichia coli translation system revealed that m(6)A modification of mRNA acts as a barrier to tRNA accommodation and translation elongation. The interaction between an m(6)A-modified codon and cognate tRNA echoes the interaction between a near-cognate codon and tRNA, because delay in tRNA accommodation depends on the position and context of m(6)A within codons and on the accuracy level of translation. Overall, our results demonstrate that chemical modification of mRNA can change translational dynamics.


July 7, 2019  |  

Multiple parallel pathways of translation initiation on the CrPV IRES.

The complexity of eukaryotic translation allows fine-tuned regulation of protein synthesis. Viruses use internal ribosome entry sites (IRESs) to minimize or, like the CrPV IRES, eliminate the need for initiation factors. Here, by exploiting the CrPV IRES, we observed the entire process of initiation and transition to elongation in real time. We directly tracked the CrPV IRES, 40S and 60S ribosomal subunits, and tRNA using single-molecule fluorescence spectroscopy and identified multiple parallel initiation pathways within the system. Our results distinguished two pathways of 80S:CrPV IRES complex assembly that produce elongation-competent complexes. Following 80S assembly, the requisite eEF2-mediated translocation results in an unstable intermediate that is captured by binding of the elongator tRNA. Whereas initiation can occur in the 0 and +1 frames, the arrival of the first tRNA defines the reading frame and strongly favors 0 frame initiation. Overall, even in the simplest system, an intricate reaction network regulates translation initiation. Copyright © 2016 Elsevier Inc. All rights reserved.


July 7, 2019  |  

Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.


July 7, 2019  |  

Strategies for sequence assembly of plant genomes

The field of plant genome assembly has greatly benefited from the development and widespread adoption of next-generation DNA sequencing platforms. Very high sequencing throughputs and low costs per nucleotide have considerably reduced the technical and budgetary constraints associated with early assembly projects done primarily with a traditional Sanger-based approach. Those improvements led to a sharp increase in the number of plant genomes being sequenced, including large and complex genomes of economically important crops. Although next-generation DNA sequencing has considerably improved our understanding of the overall structure and dynamics of many plant genomes, severe limitations still remain because next-generation DNA sequencing reads typically are shorter than Sanger reads. In addition, the software tools used to de novo assemble sequences are not necessarily designed to optimize the use of short reads. These cause challenges, common to many plant species with large genome sizes, high repeat contents, polyploidy and genome-wide duplications. This chapter provides an overview of historical and current methods used to sequence and assemble plant genomes, along with new solutions offered by the emergence of technologies such as single molecule sequencing and optical mapping to address the limitations of current sequence assemblies.


July 7, 2019  |  

Genomic analyses of multidrug resistant Pseudomonas aeruginosa PA1 resequenced by single-molecule real-time sequencing.

As a third-generation sequencing (TGS) method, single-molecule real-time (SMRT) technology provides long read length, and it is well suited for resequencing projects and de novo assembly. In the present study, Pseudomonas aeruginosa PA1 was characterized and resequenced using SMRT technology. PA1 was also subjected to genomic, comparative and pan-genomic analyses. The multidrug resistant strain PA1 possesses a 6,498,072 bp genome and a sequence type of ST-782. The genome of PA1 was also visualized, and the results revealed the details of general genome annotations, virulence factors, regulatory proteins (RPs), secretion system proteins, type II toxin-antitoxin (T-A) pairs and genomic islands. Whole genome comparison analysis suggested that PA1 exhibits similarity to other P. aeruginosa strains but differs in terms of horizontal gene transfer (HGT) regions, such as prophages and genomic islands. Phylogenetic analyses based on 16S rRNA sequences demonstrated that PA1 is closely related to PAO1, and P. aeruginosa strains can be divided into two main groups. The pan-genome of P. aeruginosa consists of a core genome of approximately 4,000 genes and an accessory genome of at least 6,600 genes. The present study presented a detailed, visualized and comparative analysis of the PA1 genome, to enhance our understanding of this notorious pathogen. © 2016 The Author(s).


July 7, 2019  |  

Crystal structures of the TRIC trimeric intracellular cation channel orthologues.

Ca(2+) release from the sarcoplasmic reticulum (SR) and endoplasmic reticulum (ER) is crucial for muscle contraction, cell growth, apoptosis, learning and memory. The trimeric intracellular cation (TRIC) channels were recently identified as cation channels balancing the SR and ER membrane potentials, and are implicated in Ca(2+) signaling and homeostasis. Here we present the crystal structures of prokaryotic TRIC channels in the closed state and structure-based functional analyses of prokaryotic and eukaryotic TRIC channels. Each trimer subunit consists of seven transmembrane (TM) helices with two inverted repeated regions. The electrophysiological, biochemical and biophysical analyses revealed that TRIC channels possess an ion-conducting pore within each subunit, and that the trimer formation contributes to the stability of the protein. The symmetrically related TM2 and TM5 helices are kinked at the conserved glycine clusters, and these kinks are important for the channel activity. Furthermore, the kinks of the TM2 and TM5 helices generate lateral fenestrations at each subunit interface. Unexpectedly, these lateral fenestrations are occupied with lipid molecules. This study provides the structural and functional framework for the molecular mechanism of this ion channel superfamily.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.