Menu
July 7, 2019

Next-generation sequencing and large genome assemblies.

The next-generation sequencing (NGS) revolution has drastically reduced time and cost requirements for sequencing of large genomes, and also qualitatively changed the problem of assembly. This article reviews the state of the art in de novo genome assembly, paying particular attention to mammalian-sized genomes. The strengths and weaknesses of the main sequencing platforms are highlighted, leading to a discussion of assembly and the new challenges associated with NGS data. Current approaches to assembly are outlined and the various software packages available are introduced and compared. The question of whether quality assemblies can be produced using short-read NGS data alone, or whether it must be combined with more expensive sequencing techniques, is considered. Prospects for future assemblers and tests of assembly performance are also discussed.


July 7, 2019

Bacteriophage P70: unique morphology and unrelatedness to other Listeria bacteriophages.

Listeria monocytogenes is an important food-borne pathogen, and its bacteriophages find many uses in detection and biocontrol of its host. The novel broad-host-range virulent phage P70 has a unique morphology with an elongated capsid. Its genome sequence was determined by a hybrid sequencing strategy employing Sanger and PacBio techniques. The P70 genome contains 67,170 bp and 119 open reading frames (ORFs). Our analyses suggest that P70 represents an archetype of virus unrelated to other known Listeria bacteriophages.


July 7, 2019

An Inv(16)(p13.3q24.3)-encoded CBFA2T3-GLIS2 fusion protein defines an aggressive subtype of pediatric acute megakaryoblastic leukemia.

To define the mutation spectrum in non-Down syndrome acute megakaryoblastic leukemia (non-DS-AMKL), we performed transcriptome sequencing on diagnostic blasts from 14 pediatric patients and validated our findings in a recurrency/validation cohort consisting of 34 pediatric and 28 adult AMKL samples. Our analysis identified a cryptic chromosome 16 inversion (inv(16)(p13.3q24.3)) in 27% of pediatric cases, which encodes a CBFA2T3-GLIS2 fusion protein. Expression of CBFA2T3-GLIS2 in Drosophila and murine hematopoietic cells induced bone morphogenic protein (BMP) signaling and resulted in a marked increase in the self-renewal capacity of hematopoietic progenitors. These data suggest that expression of CBFA2T3-GLIS2 directly contributes to leukemogenesis. Copyright © 2012 Elsevier Inc. All rights reserved.


July 7, 2019

Real-time sequencing.

This month’s Genome Watch describes the impact of next-generation sequencing on the ‘real-time’ analysis of pathogen genomes during outbreaks.


July 7, 2019

Real-time tRNA transit on single translating ribosomes at codon resolution.

Translation by the ribosome occurs by a complex mechanism involving the coordinated interaction of multiple nucleic acid and protein ligands. Here we use zero-mode waveguides (ZMWs) and sophisticated detection instrumentation to allow real-time observation of translation at physiologically relevant micromolar ligand concentrations. Translation at each codon is monitored by stable binding of transfer RNAs (tRNAs)-labelled with distinct fluorophores-to translating ribosomes, which allows direct detection of the identity of tRNA molecules bound to the ribosome and therefore the underlying messenger RNA (mRNA) sequence. We observe the transit of tRNAs on single translating ribosomes and determine the number of tRNA molecules simultaneously bound to the ribosome, at each codon of an mRNA molecule. Our results show that ribosomes are only briefly occupied by two tRNA molecules and that release of deacylated tRNA from the exit (E) site is uncoupled from binding of aminoacyl-tRNA site (A-site) tRNA and occurs rapidly after translocation. The methods outlined here have broad application to the study of mRNA sequences, and the mechanism and regulation of translation.


July 7, 2019

Computational solutions to large-scale data management and analysis.

Today we can generate hundreds of gigabases of DNA and RNA sequencing data in a week for less than US$5,000. The astonishing rate of data generation by these low-cost, high-throughput technologies in genomics is being matched by that of other technologies, such as real-time imaging and mass spectrometry-based flow cytometry. Success in the life sciences will depend on our ability to properly interpret the large-scale, high-dimensional data sets that are generated by these technologies, which in turn requires us to adopt advances in informatics. Here we discuss how we can master the different types of computational environments that exist – such as cloud and heterogeneous computing – to successfully tackle our big data problems.


July 7, 2019

Absence of genome reduction in diverse, facultative endohyphal bacteria.

Fungi interact closely with bacteria, both on the surfaces of the hyphae and within their living tissues (i.e. endohyphal bacteria, EHB). These EHB can be obligate or facultative symbionts and can mediate diverse phenotypic traits in their hosts. Although EHB have been observed in many lineages of fungi, it remains unclear how widespread and general these associations are, and whether there are unifying ecological and genomic features can be found across EHB strains as a whole. We cultured 11 bacterial strains after they emerged from the hyphae of diverse Ascomycota that were isolated as foliar endophytes of cupressaceous trees, and generated nearly complete genome sequences for all. Unlike the genomes of largely obligate EHB, the genomes of these facultative EHB resembled those of closely related strains isolated from environmental sources. Although all analysed genomes encoded structures that could be used to interact with eukaryotic hosts, pathways previously implicated in maintenance and establishment of EHB symbiosis were not universally present across all strains. Independent isolation of two nearly identical pairs of strains from different classes of fungi, coupled with recent experimental evidence, suggests horizontal transfer of EHB across endophytic hosts. Given the potential for EHB to influence fungal phenotypes, these genomes could shed light on the mechanisms of plant growth promotion or stress mitigation by fungal endophytes during the symbiotic phase, as well as degradation of plant material during the saprotrophic phase. As such, these findings contribute to the illumination of a new dimension of functional biodiversity in fungi.


July 7, 2019

Next-generation polyploid phylogenetics: rapid resolution of hybrid polyploid complexes using PacBio single-molecule sequencing.

Difficulties in generating nuclear data for polyploids have impeded phylogenetic study of these groups. We describe a high-throughput protocol and an associated bioinformatics pipeline (Pipeline for Untangling Reticulate Complexes (Purc)) that is able to generate these data quickly and conveniently, and demonstrate its efficacy on accessions from the fern family Cystopteridaceae. We conclude with a demonstration of the downstream utility of these data by inferring a multi-labeled species tree for a subset of our accessions. We amplified four c. 1-kb-long nuclear loci and sequenced them in a parallel-tagged amplicon sequencing approach using the PacBio platform. Purc infers the final sequences from the raw reads via an iterative approach that corrects PCR and sequencing errors and removes PCR-mediated recombinant sequences (chimeras). We generated data for all gene copies (homeologs, paralogs, and segregating alleles) present in each of three sets of 50 mostly polyploid accessions, for four loci, in three PacBio runs (one run per set). From the raw sequencing reads, Purc was able to accurately infer the underlying sequences. This approach makes it easy and economical to study the phylogenetics of polyploids, and, in conjunction with recent analytical advances, facilitates investigation of broad patterns of polyploid evolution.© 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.


July 7, 2019

Cultivation of a chemoautotroph from the SUP05 clade of marine bacteria that produces nitrite and consumes ammonium.

Marine oxygen minimum zones (OMZs) are expanding regions of intense nitrogen cycling. Up to half of the nitrogen available for marine organisms is removed from the ocean in these regions. Metagenomic studies have identified an abundant group of sulfur-oxidizing bacteria (SUP05) with the genetic potential for nitrogen cycling and loss in OMZs. However, SUP05 have defied cultivation and their physiology remains untested. We cultured, sequenced and tested the physiology of an isolate from the SUP05 clade. We describe a facultatively anaerobic sulfur-oxidizing chemolithoautotroph that produces nitrite and consumes ammonium under anaerobic conditions. Genetic evidence that closely related strains are abundant at nitrite maxima in OMZs suggests that sulfur-oxidizing chemoautotrophs from the SUP05 clade are a potential source of nitrite, fueling competing nitrogen removal processes in the ocean.


July 7, 2019

Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform.

Continued advancements in sequencing technologies have fueled the development of new sequencing applications and promise to flood current databases with raw data. A number of factors prevent the seamless and easy use of these data, including the breadth of project goals, the wide array of tools that individually perform fractions of any given analysis, the large number of associated software/hardware dependencies, and the detailed expertise required to perform these analyses. To address these issues, we have developed an intuitive web-based environment with a wide assortment of integrated and cutting-edge bioinformatics tools in pre-configured workflows. These workflows, coupled with the ease of use of the environment, provide even novice next-generation sequencing users with the ability to perform many complex analyses with only a few mouse clicks and, within the context of the same environment, to visualize and further interrogate their results. This bioinformatics platform is an initial attempt at Empowering the Development of Genomics Expertise (EDGE) in a wide range of applications for microbial research.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

Quantum changes in Helicobacter pylori gene expression accompany host-adaptation.

Helicobacter pylori is a highly successful gastric pathogen. High genomic plasticity allows its adaptation to changing host environments. Complete genomes of H. pylori clinical isolate UM032 and its mice-adapted serial derivatives 298 and 299, generated using both PacBio RS and Illumina MiSeq sequencing technologies, were compared to identify novel elements responsible for host-adaptation. The acquisition of a jhp0562-like allele, which encodes for a galactosyltransferase, was identified in the mice-adapted strains. Our analysis implies a new ß-1,4-galactosyltransferase role for this enzyme, essential for Ley antigen expression. Intragenomic recombination between babA and babB genes was also observed. Further, we expanded on the list of candidate genes whose expression patterns have been mediated by upstream homopolymer-length alterations to facilitate host adaption. Importantly, greater than four-fold reduction of mRNA levels was demonstrated in five genes. Among the down-regulated genes, three encode for outer membrane proteins, including BabA, BabB and HopD. As expected, a substantial reduction in BabA protein abundance was detected in mice-adapted strains 298 and 299 via Western analysis. Our results suggest that the expression of Ley antigen and reduced outer membrane protein expressions may facilitate H. pylori colonisation of mouse gastric epithelium.© The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


July 7, 2019

Implementation and data analysis of Tn-seq, whole genome resequencing, and single-molecule real time sequencing for bacterial genetics.

Few discoveries have been more transformative to the biological sciences than the development of DNA sequencing technologies. The rapid advancement of sequencing and bioinformatics tools has revolutionized bacterial genetics, deepening our understanding of model and clinically relevant organisms. Although application of newer sequencing technologies to studies in bacterial genetics is increasing, the implementation of DNA sequencing technologies and development of the bioinformatics tools required for analyzing the large data sets generated remains a challenge for many. In this minireview, we have chosen to summarize three sequencing approaches that are particularly useful for bacterial genetics. We provide resources for scientists new to and interested in their application. Herein, we discuss the analysis of Tn-seq data to determine gene disruptions differentially represented in a mutant population, Illumina sequencing for identification of suppressor or other mutations, and we summarize single-molecule real time (SMRT) sequencing for de novo genome assembly and the use of the output data for detection of DNA base modifications. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Development of new methods for the quantitative detection and typing of Lactobacillus parabuchneri in dairy products

Thirty-one isolates of Lactobacillus parabuchneri were obtained from cheese containing histamine; of these, 26 were found to possess the hdcA gene encoding histidine decarboxylase. By analysing the genome data of 13 isolates, specific targets for the development of PCR-based detection and typing systems for L. parabuchneri were identified. The real-time PCR for detection showed a linear quantification over a range of 7 logs and a detection limit of 10 gene equivalents per reaction. The strain typing method utilised the amplification of repeat sequences and showed discrimination comparable with a phylogenetic tree, based on genome comparisons. The method was suitable for detecting and monitoring the development of L. parabuchneri in raw milk and cheese.


July 7, 2019

Genomic analysis of the multi-drug-resistant clinical isolate Myroides odoratimimus PR63039.

Myroides odoratimimus (M. odoratimimus) has been gradually implicated as an important nosocomial pathogen that poses a serious health threat to immunocompromised patients owing to its multi-drug resistance. However, the resistance mechanism is currently unclear. To clarify the antibiotic resistance and infectivity mechanisms of M. odoratimimus, whole genome sequencing was performed on the multi-drug-resistant M. odoratimimus strain PR63039. The genome sequence was completed with single molecule real-time (SMRT) technologies. Then, annotation was performed using RAST and IMG-ER. A number of databases and software programs were used to analyze the genomic characteristics, including GC-Profile, ISfinder, CG viewer, ARDB, CARD, ResFinder, the VFDB database, PHAST and Progressive Mauve. The M. odoratimimus PR63039 genome consisted of a chromosome and a plasmid. The genome contained a large number of resistance genes and virulence factors. The distribution of the resistance genes was distinctive, and a resistance region named MY63039-RR was found. The subsystem features generated by RAST indicated that the annotated genome had 108 genes that were potentially involved in virulence, disease and defense, all of which had strong associations with resistance and pathogenicity. The prophage analysis showed two incomplete prophages in the genome. The genomic analysis of M. odoratimimus PR63039 partially clarified its antibiotic resistance mechanisms and virulence factors. Obtaining a clear understanding of its genomic characteristics will be conducive to the management of multidrug-resistant M. odoratimimus.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.