In this ASHG 2016 poster video, Martin Pollard from the Wellcome Trust Sanger Institute and the University of Cambridge describes an ambitious project to better represent natural variation in the complex MHC region by sequencing the locus in thousands of people from various populations in Africa. A pilot project in five populations has already revealed a lot of diversity in the region, which is important for human disease, vaccine response, and organ transplantation. Pollard says SMRT Sequencing is the only technology that can deliver the full-length haplotypes necessary to identify complete variation in this highly polymorphic complex. Plus: plans to…
In this AGBT 2017 talk, PacBio CSO Jonas Korlach provided a technology roadmap for the Sequel System, including plans the continue performance and throughput increases through early 2019. Per SMRT Cell throughput of the Sequel System is expected to double this year and again next year. Together with a new higher-capacity SMRT Cell expected to be released by the end of 2018, these improvements result in a ~30-fold increase or ~150 Gb / SMRT Cell allowing a real $1000 real de novo human genome assembly. Also discussed: Additional application protocol improvements, new chemistry and software updates, and a look at…
Tremendous flexibility is maintained in the human proteome via alternative splicing, and cancer genomes often subvert this flexibility to promote survival. Identification and annotation of cancer-specific mRNA isoforms is critical to understanding how mutations in the genome affect the biology of cancer cells. While microarrays and other NGS-based methods have become useful for studying transcriptomes, these technologies yield short, fragmented transcripts that remain a challenge for accurate, complete reconstruction of splice variants. The Iso-Seq method developed at PacBio offers the only solution for direct sequencing of full-length, single-molecule cDNA sequences needed to discover biomarkers for early detection and cancer stratification,…
In this presentation, Sonja Vernes of the Max Plank Institute shares her work with the Bat1K project which aims to catalog the genetic diversity of all living bat species. She highlights the unique biology of bats, from their widely varying sizes to their capacity for healthy aging and disease resistance and provides recent findings from ongoing efforts to sequence and annotate the genomes of 21 phylogenetic families of bats.
In this PacBio User Group Meeting presentation, Erin Bernberg from the University of Delaware reports on using the Agilent Femto Pulse System for high-resolution, highly sensitive fragment analysis and on the low DNA input protocol, which her team used for a recent study of ice worms.
In this presentation at PAG 2020, Bart Nijland of Genetwister Technologies explains how his team set out to make a haplotype-aware assembly of the highly complex tetraploid Rosa x hybrida L. genome in order to capture its full range of genetic variation. HiFi reads generated from PacBio’s Sequel II System have made it possible to parse out critical information from many of the plant’s parental genes.
In this SMRT Leiden 2020 Online Virtual Event presentation Pedro Oliveira of Mount Sinai shares his research on Clostridioides – a leading cause of nosocomial-acquired diarrhea and colitis across the developed world. In this study, Oliveira and coworkers performed the first comprehensive DNA methylome analysis of 36 human C. difficile isolates from a hospital setting using SMRT Sequencing and comparative epigenomics.
In this SMRT Leiden 2020 Online Virtual Event presentation, Erwin Datema of KeyGene shares his work on using high-throughput, accurate long-read sequencing technologies, such as PacBio HiFi sequencing, to drastically reduced the investment required to generate high-quality genome sequences. As a result, they have shifted away from the reference-centric view of the genome, and entered the pan-genome era. Here, Datema highlights some of the breakthrough algorithmic innovations KeyGene has developed to generate and analyze population-scale pan-genomes for plant genomes of all complexities and sizes.
Past large scale cancer genome sequencing efforts, including The Cancer Genome Atlas and the International Cancer Genome Consortium, have utilized short-read sequencing, which is well-suited for detecting single nucleotide variants (SNVs) but far less reliable for detecting variants larger than 20 base pairs, including insertions, deletions, duplications, inversions and translocations. Recent same-sample comparisons of short- and long-read human reference genome data have revealed that short-read resequencing typically uncovers only ~4,000 structural variants (SVs, =50 bp) per genome and is biased towards deletions, whereas sequencing with PacBio long-reads consistently finds ~20,000 SVs, evenly balanced between insertions and deletions. This discovery has…
Transcriptome sequencing has proven to be an important tool for understanding the biological changes in cancer genomes including the consequences of structural rearrangements. Short read sequencing has been the method of choice, as the high throughput at low cost allows for transcript quantitation and the detection of even rare transcripts. However, the reads are generally too short to reconstruct complete isoforms. Conversely, long-read approaches can provide unambiguous full-length isoforms, but lower throughput has complicated quantitation and high RNA input requirements has made working with cancer samples challenging. Recently, the COLO 829 cell line was sequenced to 50-fold coverage with PacBio…
Background: The sequencing and haplotype phasing of entire gene sequences improves the understanding of the genetic basis of disease and drug response. One example is cystic fibrosis (CF). Cystic fibrosis transmembrane conductance regulator (CFTR) modulator therapies have revolutionized CF treatment, but only in a minority of CF subjects. Observed heterogeneity in CFTR modulator efficacy is related to the range of CFTR mutations; revertant mutations can modify the response to CFTR modulators, and other intronic variations in the ~200 kb CFTR gene have been linked to disease severity. Heterogeneity in the CFTR gene may also be linked to differential responses to…
NGS is commonly used for amplicon sequencing in clinical applications to study genetic disorders and detect disease-causing mutations. This approach can be plagued by limited ability to phase sequence variants and makes interpretation of sequence data difficult when pseudogenes are present. Long-read highly accurate amplicon sequencing can provide very accurate, efficient, high throughput (through multiplexing) sequences from single molecules, with read lengths largely limited by PCR. Data is easy to interpret; phased variants and breakpoints are present within high fidelity individual reads. Here we show SMRT Sequencing of the PMS2 and OPN1 (MW and LW) genes using the Sequel System.…
Human MHC class I genes HLA-A, -B, -C, and class II genes HLA-DR, -DP and -DQ, play a critical role in the immune system as major factors responsible for organ transplant rejection. The have a direct or linkage-based association with several diseases, including cancer and autoimmune diseases, and are important targets for clinical and drug sensitivity research. HLA genes are also highly polymorphic and their diversity originates from exonic combinations as well as recombination events. A large number of new alleles are expected to be encountered if these genes are sequenced through the UTRs. Thus allele-level resolution is strongly preferred…
Single Molecule, Real-Time (SMRT) Sequencing holds promise for addressing new frontiers to understand molecular mechanisms in evolution and gain insight into adaptive strategies. With read lengths exceeding 10 kb, we are able to sequence high-quality, closed microbial genomes with associated plasmids, and investigate large genome complexities, such as long, highly repetitive, low-complexity regions and multiple tandem-duplication events. Improved genome quality, observed at 99.9999% (QV60) consensus accuracy, and significant reduction of gap regions in reference genomes (up to and beyond 50%) allow researchers to better understand coding sequences with high confidence, investigate potential regulatory mechanisms in noncoding regions, and make inferences…
Fully phased allele-level sequencing of highly polymorphic HLA genes is greatly facilitated by SMRT Sequencing technology. In the present work, we have evaluated multiple DNA barcoding strategies for multiplexing several loci from multiple individuals, using three different tagging methods. Specifically MHC class I genes HLA-A, -B, and –C were indexed via DNA Barcodes by either tailed primers or barcoded SMRTbell adapters. Eight different 16-bp barcode sequences were used in symmetric & asymmetric pairing. Eight DNA barcoded adapters in symmetric pairing were independently ligated to a pool of HLA-A, -B and –C for eight different individuals, one at a time and…