In this webinar, the presenters describe a targeted sequencing workflow that combines Roche NimbleGen’s SeqCap EZ enrichment technology with PacBio’ SMRT Sequencing to provide a more comprehensive view of variants and haplotype information over multi-kilobase, contiguous regions. They demonstrate that 6 kb fragments can also be utilized to enrich for long fragments that extend beyond the targeted capture site and well into (and often across) the adjacent intronic regions. When combined with SMRT Sequencing, multi-kilobase genomic regions can be phased and variants, including complex structural variants, can be detected in exons, introns and intergenic regions.
In a talk at AGBT 2017, Histogenetics CEO Nezih Cereb reported on how SMRT Sequencing is allowing his team to produce full-length, phased sequences for HLA alleles, which are important for matching organ transplants to recipients. The company is typing thousands of samples per day on their PacBio RS II systems and their new Sequel System. Cereb noted that SMRT Sequencing is unique in its ability to reliably phase mutations in the HLA alleles without imputation. Cereb concluded with his plans to use this approach for other complex regions, such as KIR, and announced their continued increasing HLA typing capacity…
At AGBT 2017, Mike Schatz from Johns Hopkins University and Cold Spring Harbor Laboratory presented data from sequencing, assembling, and analyzing personalized, phased diploid genomes with either Illumina, 10x Genomics, and PacBio SMRT Sequencing. Compared to the short-read-based methods, PacBio data assembled in large, complete contigs and contained the broadest range of structural variants with the best resolution. Plus: unexpected translocation findings with SMRT Sequencing, validated in follow-up studies.
Early detection of colorectal cancer (CRC) and its precursor lesions (adenomas) is crucial to reduce mortality rates. The fecal immunochemical test (FIT) is a non-invasive CRC screening test that detects the blood-derived protein hemoglobin. However, FIT sensitivity is suboptimal especially in detection of CRC precursor lesions. As adenoma-to-carcinoma progression is accompanied by alternative splicing, tumor-specific proteins derived from alternatively spliced RNA transcripts might serve as candidate biomarkers for CRC detection.
Tremendous flexibility is maintained in the human proteome via alternative splicing, and cancer genomes often subvert this flexibility to promote survival. Identification and annotation of cancer-specific mRNA isoforms is critical to understanding how mutations in the genome affect the biology of cancer cells. While microarrays and other NGS-based methods have become useful for studying transcriptomes, these technologies yield short, fragmented transcripts that remain a challenge for accurate, complete reconstruction of splice variants. The Iso-Seq method developed at PacBio offers the only solution for direct sequencing of full-length, single-molecule cDNA sequences needed to discover biomarkers for early detection and cancer stratification,…
This webinar, presented by Nisha Pillai, provides an overview of amplicon sequencing to target specific regions of a genome using PacBio Single Molecule, Real-Time (SMRT) Sequencing. This session provides an overview of bioinformatics approaches for PacBio amplicon analysis including circular consensus sequencing and long amplicon analysis.
In a push to develop insect-based food sources for people, Brenda Oppert from the USDA has been sequencing bug genomes with PacBio technology. Long reads are essential because of the highly repetitive sequences and large genomes. On the Sequel II System, a single SMRT Cell is sufficient to generate 350-fold coverage and produce a high-quality assembly for some of the insects she’s studying.
The utility of new highly accurate long reads, or HiFi reads, was first demonstrated for calling all variant types in human genomes. It has since been shown that HiFi reads can be used to generate contiguous, complete, and accurate human genomes, even in repeat structures such as centromeres and telomeres. In this virtual workshop scientists from PacBio as well as Tina Graves-Lindsay from the McDonnell Genome Institute at Washington University share the many improvements we’ve made to HiFi sequencing in the past year, tools that take advantage of HiFi data for variant detection and assembly, and examples in numerous genomics…
Long-read mRNA sequencing such as PacBio’s Iso-Seq method offer high-throughput transcriptome profiling that circumvents the transcript assembly problem by sequencing full-length cDNA. The Iso-Seq method has emerged as the most reliable technology for fully characterizing isoforms and, in turn, help shed light on underlying disease mechanisms. Here, we have utilized the Iso-Seq method to sequence an Alzheimer’s disease whole brain?sample. This is a devastating neurodegenerative disease that affects ~44 million people worldwide, making it the most common form of dementia. Studies looking into disease mechanism have shown that changes in gene expression due to alternative splicing likely contribute to the…
In this ASHG 2020 CoLab presentation hear Principal Scientists, Aaron Wenger and Elizabeth Tseng share how highly accurate long reads (HiFi reads) provide comprehensive variant detection for both genomes and transcriptomes. Aaron Wenger describes how new improvements in protocols and analysis methods have increased scalability and accuracy of variant calling. As demonstrated in the precisionFDA Truth Challenge V2, HiFi reads (>99% accurate, 15 kb – 20 kb) now outperform short reads for single nucleotide and structural variant calling and match for small indels. This includes calling >30,000 small variants and >10,000 structural variants missed by short reads, many in medically…
The three classes of genes that comprise the MHC gene family are actively involved in determining donor-recipient compatibility for organ transplant, as well as susceptibility to autoimmune diseases via cross-reacting immunization. Specifically, Class I genes HLA-A, -B, -C, and class II genes HLA-DR, -DQ and -DP are considered medically important for genetic analysis to determine histocompatibility. They are highly polymorphic and have thousands of alleles implicated in disease resistance and susceptibility. The importance of full-length HLA gene sequencing for genotyping, detection of null alleles, and phasing is now widely acknowledged. While DNA-sequencing-based HLA genotyping has become routine, only 7% of…
Sequence based typing (SBT) is considered the gold standard method for HLA typing. Current SBT methods are rather laborious and are prone to phase ambiguity problems and genotyping uncertainties. As a result, the NGS community is rapidly seeking to remedy these challenges, to produce high resolution and high throughput HLA sequencing conducive to a clinical setting. Today, second generation NGS technologies are limited in their ability to yield full length HLA sequences required for adequate phasing and identification of novel alleles. Here we present the use of single molecule real time (SMRT) sequencing as a means of determining full length/long…
Alternative splicing of RNA is an important mechanism that increases protein diversity and is pervasive in the most complex biological functions. While advances in RNA sequencing methods have accelerated our understanding of the transcriptome, isoform discovery remains computationally challenging due to short read lengths. Here, we describe the Isoform Sequencing (Iso-Seq) method using long reads generated by the PacBio RS II. We sequenced rat heart and lung RNA using the Clontech® SMARTer® cDNA preparation kit followed by size selection using agarose gel. Additionally, we tested the BluePippin™ device from Sage Science for efficiently extracting longer transcripts = 3 kb. Post-sequencing,…
PacBio’s new Iso-Seq technology allows for rapid generation of full-length cDNA sequences without the need for assembly steps. The technology was tested on leaf mRNA from two model O. sativa ssp. indica cultivars – Minghui 63 and Zhenshan 97. Even though each transcriptome was not exhaustively sequenced, several thousand isoforms described genes over a wide size range, most of which are not present in any currently available FL cDNA collection. In addition, the lack of an assembly requirement provides direct and immediate access to complete mRNA sequences and rapid unraveling of biological novelties.
As a cost-effective alternative to whole genome human sequencing, targeted sequencing of specific regions, such as exomes or panels of relevant genes, has become increasingly common. These methods typically include direct PCR amplification of the genomic DNA of interest, or the capture of these targets via probe-based hybridization. Commonly, these approaches are designed to amplify or capture exonic regions and thereby result in amplicons or fragments that are a few hundred base pairs in length, a length that is well-addressed with short-read sequencing technologies. These approaches typically provide very good coverage and can identify SNPs in the targeted region, but…