Learn how Single Molecule, Real-Time (SMRT) Sequencing and the Sequel IIe System and will accelerate your research by delivering highly accurate long reads to provide the most comprehensive view of genomes, transcriptomes and epigenomes.
At Cold Spring Harbor Laboratory, scientists used SMRT Sequencing to decode one of the most challenging cancer genomes ever encountered. Along the way, they built a portfolio of open-access analysis tools that will help researchers everywhere make structural variation discoveries with long-read sequencing data.
Kim Worley from Baylor’s Human Genome Sequencing Center describes the improvement of the sooty mangabey primate genome. Sooty mangabey is a model organism for HIV research, since this particular primate can be infected with the immunodeficiency virus and never develop any symptoms. Worley and her team used PacBio long reads in conjunction with their own assembly tool, PBJelly, closing 64% and improving another 19% of the gaps.
Adam English, lead bioinformatics programmer at Baylor College of Medicine, discusses challenges with resolving gaps in high-quality draft genome assemblies. Sequencing biases, repetitive genomic features, genomic polymorphism, and other complicating factors all come together to make some regions difficult or impossible to assemble. For more facile assembly and automated finishing of draft genomes, he presents an automated approach to finishing using long reads from the PacBio System. The tool PBJelly automates the finishing process using long sequence reads in a reference-guided assembly process. Using PBJelly and SMRT Sequencing, they upgraded the draft genome sequences of a simulated Drosophila melanogaster, the version…
David Wheeler from Baylor’s Human Genome Sequencing Center presents data from matched tumor/normal pairs. His research uses SMRT Sequencing to identify structural rearrangements, like tandem duplications, finding that many of these were caused by repeat regions moving around the genome. Also: details of the new Honey-tails and Honey-spots algorithms.
This seminar features great hands-on information and best practices for analyzing SMRT Sequencing data for eukaryotic genome assembly. Michael Schatz provides an overview of the assembly tools, provides recommendations for when to use each one, and discusses the challenges of short-read assemblies. James Gurtowski gives an in-depth overview of hybrid assemblies methods, where short read data are used used to correct errors in longer reads. Finally, Sergey Koren presents on chromosome-scale assembly, including the MinHash Alignment Process (MHAP) he developed to dramatically reduce the computational processing power required for genome assemblies.
Jason Chin, senior director of bioinformatics at PacBio, talks about using long-read sequence data and string graph assembly for assembling diploid genomes. A major challenge for diploid genome assembly is in distinguishing homologous regions from repeats, so he discusses how long reads are essential for resolving repeat regions. In the presentation, Chin displays data from two inbred Arabidopsis strains used to create a synthetic diploid assembly.
Ali Bashir from the Icahn Institute for Genomics and Multiscale Biology at Mount Sinai describes a tool to detect tandem repeats (PACMonSTR), which he believes are dramatically underrepresented in the human genome reference but that can be discovered with PacBio sequencing. In a collaboration with Cold Spring Harbor Laboratory and Cornell, Bashir and his team generated shotgun, whole-genome sequence data from human genomic DNA using PacBio sequencing. Their goal was to find structural variation features that are not present in the existing reference. He shows numerous examples wherein the long PacBio reads were able to resolve inversions in the sample,…
In this webinar, Elizabeth Tseng from PacBio demonstrates how to run the Iso-Seq bioinformatics software pipeline that is part of PacBio’s SMRTAnalysis software suite. Both the web portal interface (SMRT Portal) and the command line version will be introduced. In addition, she reviews the community version of Iso-Seq (pbtranscript-tofu) and other community tools to perform additional analyses.
At the PacBio AGBT workshop, Gene Myers from the Max-Planck Institute said it will soon be possible to generate a near-perfect human assembly. He presents a portfolio of analysis and quality-control tools designed to work with SMRT Sequencing data.
Deanna Church, formerly of Personalis, describes the value of a finished human reference genome that fully represents genetic variation. In this talk from the PacBio workshop at AGBT 2015, she makes the case for generating high-quality sequence data and contributing it to databases for continued improvement of the human genome assembly.
Jason Chin, senior director of bioinformatics at PacBio, talks about using long-read sequence data to generate diploid genome assemblies to produce comprehensive haplotype sequence reconstructions. In the presentation, Chin describes the FALCON Unzip process that combines SNP phasing with the assembly process and allows for determination of the haplotype sequences and identification of structural variants. He presents an example of diploid assembly from inbred Arabidopsis strains.
Adam Ameur talks about a range of applications for which SMRT Sequencing had been useful in the SciLifeLab. Examples include analyzing a DNA translocation in chronic myeloid leukemia samples; studying the HPV genome; and sequencing the FADS region to understand fatty acid production.