Learn how highly accurate long-read sequencing from the Sequel IIe Systems delivers data you can trust for advanced biological insights across a range of applications.
PacBio customers discuss their applications of PacBio SMRT Sequencing and long reads, including Lemuel Racacho (Children’s Hospital of Eastern Ontario Research Institute), Matthew Blow (JGI), Yuta Suzuki (U. of Tokyo), Daniel Geraghty (Fred Hutchinson Cancer Center), and Mike Schatz (CSHL)
Jonas Korlach spoke about recent SMRT Sequencing updates, such as latest Sequel System chemistry release (1.2.1) and updates to the Integrative Genomics Viewer that’s now update optimized for PacBio data. He presented the recent data release of structural variation detected in the NA12878 genome, including many more insertions and deletions than short-read-based technologies were able to find.
In this AGBT 2017 talk, PacBio CSO Jonas Korlach provided a technology roadmap for the Sequel System, including plans the continue performance and throughput increases through early 2019. Per SMRT Cell throughput of the Sequel System is expected to double this year and again next year. Together with a new higher-capacity SMRT Cell expected to be released by the end of 2018, these improvements result in a ~30-fold increase or ~150 Gb / SMRT Cell allowing a real $1000 real de novo human genome assembly. Also discussed: Additional application protocol improvements, new chemistry and software updates, and a look at…
In this Webinar, we will give an introduction to Pacific Biosciences’ single molecule, real-time (SMRT) sequencing. After showing how the system works, we will discuss the main features of the technology with an emphasis on the difference between systematic error and random error and how SMRT sequencing produces better consensus accuracy than other systems. Following this, we will discuss several ground-breaking discoveries in medical science that were made possible by the longs reads and high accuracy of SMRT Sequencing.
SMRT Sequencing is a DNA sequencing technology characterized by long read lengths and high consensus accuracy, regardless of the sequence complexity or GC content of the DNA sample. These characteristics can be harnessed to address medically relevant genes, mRNA transcripts, and other genomic features that are otherwise difficult or impossible to resolve. I will describe examples for such new clinical research in diverse areas, including full-length gene sequencing with allelic haplotype phasing, gene/pseudogene discrimination, sequencing extreme DNA contexts, high-resolution pharmacogenomics, biomarker discovery, structural variant resolution, full-length mRNA isoform cataloging, and direct methylation detection.
In this PacBio User Group Meeting presentation, Tina Graves-Lindsay of the McDonnell Genome Institute and the Genome Reference Consortium speaks about the importance of phasing human reference genomes. Her team is now working on its fifteenth human genome assembly — part of a major effort to improve genomic representation of ethnic diversity — with a pipeline that generates 60-fold PacBio coverage for a de novo assembly, followed by scaffolding with other technologies. They are also using FALCON-Unzip to separate haplotypes, leading to reference-grade diploid assemblies. This approach has already helped resolve errors seen in other genomes and even the gold-standard…
In this presentation, Emily Hatas of PacBio offers a look a how SMRT Sequencing has changed over the years as well as the most common applications in human genome analysis: high-throughput structural variant detection; comprehensive variant detection; and de novo assembly of reference genomes.
De novo assemblies of human genomes from accurate (85-90%), continuous long reads (CLR) now approach the human reference genome in contiguity, but the assembly base pair accuracy is typically below QV40 (99.99%), an order-of-magnitude lower than the standard for finished references. The base pair errors complicate downstream interpretation, particularly false positive indels that lead to false gene loss through frameshifts. PacBio HiFi sequence data, which are both long (>10 kb) and very accurate (>99.9%) at the individual sequence read level, enable a new paradigm in human genome assembly. Haploid human assemblies using HiFi data achieve similar contiguity to those using…
The human reference sequence has provided a foundation for studies of genome structure, human variation, evolutionary biology, and disease. At the time the reference was originally completed there were some loci recalcitrant to closure; however, the degree to which structural variation and diversity affected our ability to produce a representative genome sequence at these loci was still unknown. Many of these regions in the genome are associated with large, repetitive sequences and exhibit complex allelic diversity such producing a single, haploid representation is not possible. To overcome this challenge, we have sequenced DNA from two hydatidiform moles (CHM1 and CHM13),…
The complex immune regions of the genome, including MHC and KIR, contain large copy number variants (CNVs), a high density of genes, hyper-polymorphic gene alleles, and conserved extended haplotypes (CEH) with enormous linkage disequilibrium (LDs). This level of complexity and inherent biases of short-read sequencing make it challenging for extracting immune region haplotype information from reference-reliant, shotgun sequencing and GWAS methods. As NGS based genome and exome sequencing and SNP arrays have become a routine for population studies, numerous efforts are being made for developing software to extract and or impute the immune gene information from these datasets. Despite these…
The long reads, random error, and unbiased sampling of SMRT Sequencing enables high quality, de novo assembly of the human genome. PacBio long reads are capable of resolving genomic variations at all size scales, including SNPs, insertions, deletions, inversions, translocations, and repeat expansions, all of which are important in understanding the genetic basis for human disease and difficult to access via other technologies. In demonstration of this, we report a new high-quality, diploid aware de novo assembly of Craig Venter’s well-studied genome.
Single-molecule sequencing is now routinely used to assemble complete, high-quality microbial genomes, but these assembly methods have not scaled well to large genomes. To address this problem, we previously introduced the MinHash Alignment Process (MHAP) for overlapping single-molecule reads using probabilistic, locality-sensitive hashing. Integrating MHAP with Celera Assembler (CA) has enabled reference-grade assemblies of model organisms, revealing novel heterochromatic sequences and filling low-complexity gap sequences in the GRCh38 human reference genome. We have applied our methods to assemble the San Clemente goat genome. Combining single-molecule sequencing from Pacific Biosciences and BioNano Genomics generates and assembly that is over 150-fold more…
The long reads, random error, and unbiased sampling of SMRT Sequencing enables high quality, de novo assembly of the human genome. PacBio long reads are capable of resolving genomic variations at all size scales, including SNPs, insertions, deletions, inversions, translocations, and repeat expansions, all of which are both important in understanding the genetic basis for human disease, and difficult to access via other technologies. In demonstration of this, we report a new high-quality, diploid-aware de novo assembly of Craig Venter’s well-studied genome.
After nearly two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no one chromosome has been finished end to end, and hundreds of unresolved gaps persist. The remaining gaps include ribosomal rDNA arrays, large near-identical segmental duplications, and satellite DNA arrays. These regions harbor largely unexplored variation of unknown consequence, and their absence from the current reference genome can lead to experimental artifacts and hide true variants when re-sequencing additional human genomes. Here we present a de novo human genome assembly that surpasses the continuity of GRCh38, along…