This video provides an overview of the techniques and steps of generating a de novo genome assembly with long-read sequencing data generated using PacBio Single Molecule, Real-Time (SMRT) Sequencing. In this video, a PacBio scientist covers the benefits of long reads when generating high-quality genome assemblies, the latest tools for creating assemblies, including HGAP, FALCON and FALCON-Unzip, how to polish and assess the quality of a genome assembly, and how to submit an assembly to NCBI.
In this webinar, Ben Auch, Research Scientist, Innovation Lab, University of Minnesota Genomics Center, Cody Sheik, Assistant Professor of Biology, University of Minnesota Duluth, and Harm van Bakel, Assistant Professor of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai provide details of the newly updated microbial whole genome sequencing pipeline, which leverages the multiplexing capabilities of the Sequel System, share new insights into the ecophysiology of Minnesota microbes using long-read sequencing, and show of how whole genome sequencing is used in pathogen surveillance programs at hospitals.
Highly accurate long reads, known as HiFi reads, are a new tool in scientists’ sequencing toolbox. Hear PacBio users share how they are using HiFi reads to explore the genomes, transcriptomes, metagenomes and the benefits HiFi reads provide for their addressing critical life science questions.
Studying microbial genomics and infectious disease? Learn how the PacBio Sequel II System can help advance your research, with first-hand perspectives from scientists who are investigating SARS-CoV-2 and COVID-19. In this webinar, Melissa Laird-Smith (Mt. Sinai School of Medicine) discusses her work evaluating the impact of host immune restriction in health and disease with high resolution HLA typing. She is joined by Corey Watson (University of Louisville School of Medicine) who talks about overcoming complexity to elucidate the role of IGH haplotype diversity in antibody-mediated immunity. Hosted by Meredith Ashby, Director of Microbial Genomics at PacBio. Access additional PacBio resources…
The utility of new highly accurate long reads, or HiFi reads, was first demonstrated for calling all variant types in human genomes. It has since been shown that HiFi reads can be used to generate contiguous, complete, and accurate human genomes, even in repeat structures such as centromeres and telomeres. In this virtual workshop scientists from PacBio as well as Tina Graves-Lindsay from the McDonnell Genome Institute at Washington University share the many improvements we’ve made to HiFi sequencing in the past year, tools that take advantage of HiFi data for variant detection and assembly, and examples in numerous genomics…
In this Labroots webinar, Meredith Ashby, Director of Microbial Genomics at PacBio, describes the utility of highly accurate long-read sequencing, known as HiFi sequencing, to understand the SARs-CoV-2 viral genome. HiFi sequencing enables mutation phasing and rare variant detection to understand viral stability and mutation rates, as well as providing insights into viral population structure for monitoring viral evolution. Ashby also shares how HiFi sequencing can be used to explore the host immune response to COVID-19, specifically by providing full-length sequencing of the B cell repertoire, IGH locus and HLA genes. Access additional COVID-19 Sequencing Tools and Resources.
Human genomic variations range in size from single nucleotide substitutions to large chromosomal rearrangements. Sequencing technologies tend to be optimized for detecting particular variant types and sizes. Short reads excel at detecting SNVs and small indels, while long or linked reads are typically used to detect larger structural variants or phase distant loci. Long reads are more easily mapped to repetitive regions, but tend to have lower per-base accuracy, making it difficult to call short variants. The PacBio Sequel System produces two main data types: long continuous reads (up to 100 kbp), generated by single passes over a long template,…
The three classes of genes that comprise the MHC gene family are actively involved in determining donor-recipient compatibility for organ transplant, as well as susceptibility to autoimmune diseases via cross-reacting immunization. Specifically, Class I genes HLA-A, -B, -C, and class II genes HLA-DR, -DQ and -DP are considered medically important for genetic analysis to determine histocompatibility. They are highly polymorphic and have thousands of alleles implicated in disease resistance and susceptibility. The importance of full-length HLA gene sequencing for genotyping, detection of null alleles, and phasing is now widely acknowledged. While DNA-sequencing-based HLA genotyping has become routine, only 7% of…
Allelic-level resolution HLA typing is known to improve survival prognoses post Unrelated Donor (UD) Haematopoietic Stem Cell Transplantation (HSCT). Currently, many commonly used HLA typing methodologies are limited either due to the fact that ambiguity cannot be resolved or that they are not amenable to high-throughput laboratories. Pacific Biosciences’ Single Molecule Real-Time (SMRT) DNA sequencing technology enables sequencing of single molecules in isolation and has read-length capabilities to enable whole gene sequencing for HLA. DNA barcode technology labels samples with unique identifiers that can be traced throughout the sequencing process. The use of DNA barcodes means that multiple samples can…
Sequence based typing (SBT) is considered the gold standard method for HLA typing. Current SBT methods are rather laborious and are prone to phase ambiguity problems and genotyping uncertainties. As a result, the NGS community is rapidly seeking to remedy these challenges, to produce high resolution and high throughput HLA sequencing conducive to a clinical setting. Today, second generation NGS technologies are limited in their ability to yield full length HLA sequences required for adequate phasing and identification of novel alleles. Here we present the use of single molecule real time (SMRT) sequencing as a means of determining full length/long…
The correct phasing of genetic variations is a key challenge for many applications of DNA sequencing. Allele-level resolution is strongly preferred for histocompatibility sequencing where recombined genes can exhibit different compatibilities than their parents. In other contexts, gene complementation can provide protection if deleterious mutations are found on only one allele of a gene. These problems are especially pronounced in immunological domains given the high levels of genetic diversity and recombination seen in regions like the Major Histocompatibility Complex. A new tool for analyzing Single Molecule, Real-Time (SMRT) Sequencing data – Long Amplicon Analysis (LAA) – can generate highly accurate,…
As a cost-effective alternative to whole genome human sequencing, targeted sequencing of specific regions, such as exomes or panels of relevant genes, has become increasingly common. These methods typically include direct PCR amplification of the genomic DNA of interest, or the capture of these targets via probe-based hybridization. Commonly, these approaches are designed to amplify or capture exonic regions and thereby result in amplicons or fragments that are a few hundred base pairs in length, a length that is well-addressed with short-read sequencing technologies. These approaches typically provide very good coverage and can identify SNPs in the targeted region, but…
Human MHC class I genes HLA-A, -B, -C, and class II genes HLA-DR, -DP and -DQ, play a critical role in the immune system as major factors responsible for organ transplant rejection. The have a direct or linkage-based association with several diseases, including cancer and autoimmune diseases, and are important targets for clinical and drug sensitivity research. HLA genes are also highly polymorphic and their diversity originates from exonic combinations as well as recombination events. A large number of new alleles are expected to be encountered if these genes are sequenced through the UTRs. Thus allele-level resolution is strongly preferred…
While the identification of individual SNPs has been readily available for some time, the ability to accurately phase SNPs and structural variation across a haplotype has been a challenge. With individual reads of an average length of 9 kb (P5-C3), and individual reads beyond 30 kb in length, SMRT Sequencing technology allows the identification of mutation combinations such as microdeletions, insertions, and substitutions without any predetermined reference sequence. Long- amplicon analysis is a novel protocol that identifies and reports the abundance of differing clusters of sequencing reads within a single library. Graphs generated via hierarchical clustering of individual sequencing reads…
Fully phased allele-level sequencing of highly polymorphic HLA genes is greatly facilitated by SMRT Sequencing technology. In the present work, we have evaluated multiple DNA barcoding strategies for multiplexing several loci from multiple individuals, using three different tagging methods. Specifically MHC class I genes HLA-A, -B, and –C were indexed via DNA Barcodes by either tailed primers or barcoded SMRTbell adapters. Eight different 16-bp barcode sequences were used in symmetric & asymmetric pairing. Eight DNA barcoded adapters in symmetric pairing were independently ligated to a pool of HLA-A, -B and –C for eight different individuals, one at a time and…