Garth Ehrlich from the Center for Genomic Sciences at Allegheny Singer Research Institute reports on new studies of pneumococcal epigenetics. Streptococcus pneumonia, which causes more than 1.6 million deaths annually, has a highly plastic genome. Methylation analysis with SMRT Sequencing found a novel modification in addition to the expected epigenetic changes.
Peter Evans from the US FDA shares insights on whole-genome sequencing for bacteria of importance to public health. Comparing data across PacBio, 454, and MiSeq sequencers, he says having closed genomes, long reads, and methylation patterns are critical for gleaning comprehensive information about a microbe.
Jonas Korlach, CSO of PacBio, discusses the revival of finished genomes the microbial community will see with long read data, emphasizing that for certain organisms such as rapidly evolving microbes, having a de novo finished genome will be more useful than creating a draft based on a previous related reference genome. Korlach describes two bioinformatic methods from PacBio, a hierarchical genome assembly process (HGAP) and an consensus caller (Quiver), which are used to generate finished genomes from just long-read PacBio data, with final genome sequence accuracies over 99.999%. Korlach demonstrates the ability of PacBio data to generate closed, high-quality de…
Bart Weimer, a professor at the University of California, Davis, who is leading the 100K Foodborne Pathogen Genome Project, talks about using PacBio sequencing to produce long reads for microbial genomes as well as to study how bacteria use epigenetics to regulate gene expression.
Brian Anton from New England BioLabs presents data on methylation analysis using SMRT Sequencing. He describes both restriction-modification systems and orphan methylases, noting that the number of methylases characterized has more than tripled since the introduction of SMRT Sequencing. The presentation includes a phylogenetic analysis of methyltransferase genes
In this AGBT plenary talk, Jonas Korlach presented a number of collaborative studies between PacBio and other institutions to make use of highly accurate, long-read sequence data, which has led to a revival of finished genomes. Examples from the infectious disease or pathogen realm included Pertussis, Salmonella, and Listeria, all of which now have closed genomes from PacBio-generated data. Korlach also reported on epigenomic information in Salmonella and Listeria, indicating potential new forms of DNA modifications.
In this presentation, Greg Harhay from the USDA offers data on pathogens involved in bovine respiratory disease complex, known as “shipping fever.” His team used PacBio sequencing to analyze several isolates from two different pathogens, looking at their DNA sequence and methylation patterns.
Harold Swerdlow, who formerly ran the R&D department at Wellcome Trust Sanger Institute, discusses the Sanger team’s use of the PacBio RS sequencer. He says the system is uniquely suited for de novo sequencing and genome assembly, methylation pattern identification, and low-level variant detection because of its long reads and high-accuracy, single-molecule sequencing. At Sanger, that makes a real difference for the large-scale projects they have in cancer biology, pathogen sequencing, and human genetics.
PacBio scientists Kristi Kim and Michael Weiand present on high-throughput solutions for sample preparation including 10 kb template preparation, high-throughput PacBio RS II sample prep kits, and compatible automated liquid handling platforms.
Jonas Korlach, CSO of PacBio, discusses the scientific value of >10 kb, unbiased sequencing reads for an expanding range of applications. Single molecule, real-time (SMRT) Sequencing technology has rapidly advanced in read lengths, throughput and adoption in the past year – a review of published work from a variety of researchers utilizing these new capabilities is also conducted.
Keith Robison, from Warp Drive Bio, discusses his experiences using PacBio for antibiotic drug discovery in GC-rich Streptomyces genomes
This seminar features great hands-on information and best practices for analyzing SMRT Sequencing data for eukaryotic genome assembly. Michael Schatz provides an overview of the assembly tools, provides recommendations for when to use each one, and discusses the challenges of short-read assemblies. James Gurtowski gives an in-depth overview of hybrid assemblies methods, where short read data are used used to correct errors in longer reads. Finally, Sergey Koren presents on chromosome-scale assembly, including the MinHash Alignment Process (MHAP) he developed to dramatically reduce the computational processing power required for genome assemblies.
Chongyuan Luo from the Salk Institute for Biological Studies describes sequencing three strains of Arabidopsis thaliana using PacBio technology. The goal: uncover structural variants that have been missed by short-read and other sequencers. Luo notes that PacBio sequencing provides highly accurate SNP detection and also extends the mappability of reads beyond what is possible with short-read data, producing better and more accurate assemblies.
Allen Van Deynze from UC Davis presents the genome sequencing and assembly project for spinach, an organism of 980 Mb. Results indicate a high-accuracy assembly with significantly higher N50 contig length than a previous short-read assembly. The PacBio assembly has allowed for filling gaps in the prior assembly.
David Wheeler from Baylor’s Human Genome Sequencing Center presents data from matched tumor/normal pairs. His research uses SMRT Sequencing to identify structural rearrangements, like tandem duplications, finding that many of these were caused by repeat regions moving around the genome. Also: details of the new Honey-tails and Honey-spots algorithms.