Interested to learn about pangenomes? Explore this guide to learn how they provide a more complete picture of the core genes of a given species and how that can provide better biological understanding.
Michael Lutz, from the Duke University Medical Center, discussed a recently published software tool that can now be used in a pipeline with SMRT Sequencing data to find structural variant biomarkers for neurodegenerative diseases with a focus on Alzheimer’s disease, ALS, and Lewy body dementia. His team is particularly interested in short sequence repeats and short tandem repeats, which have already been implicated in neurodegenerative disease.
Human MHC class I genes HLA-A, -B, -C, and class II genes HLA -DR, -DQ, and -DP play a critical role in the immune system as primary factors responsible for organ transplant rejection. Additionally, the HLA genes are important targets for clinical and drug sensitivity research because of their direct or linkage-based association with several diseases, including cancer, and autoimmune diseases. HLA genes are highly polymorphic, and their diversity originates from exonic combinations as well as recombination events. With full-length gene sequencing, a significant increase of new alleles in the HLA database is expected, stressing the need for high-resolution sequencing.…
In this PacBio User Group Meeting presentation, PacBio scientist Kristin Mars speaks about recent updates, such as the single-day library prep that’s now possible with the Iso-Seq Express workflow. She also notes that one SMRT Cell 8M is sufficient for most Iso-Seq experiments for whole transcriptome sequencing at an affordable price.
In this Labroots webinar, Meredith Ashby, Director of Microbial Genomics at PacBio, describes the utility of highly accurate long-read sequencing, known as HiFi sequencing, to understand the SARs-CoV-2 viral genome. HiFi sequencing enables mutation phasing and rare variant detection to understand viral stability and mutation rates, as well as providing insights into viral population structure for monitoring viral evolution. Ashby also shares how HiFi sequencing can be used to explore the host immune response to COVID-19, specifically by providing full-length sequencing of the B cell repertoire, IGH locus and HLA genes. Access additional COVID-19 Sequencing Tools and Resources.
In this ASHG 2020 PacBio Workshop Jonas Korlach, CSO, shares how the new PacBio Sequel IIe System makes highly accurate long-read sequencing easy and affordable so?all scientists can gain comprehensive views of human genomes and transcriptomes. He goes on to provide updates on the applications including human WGS for variant detection, de novo genome assembly, single-cell full-length RNA sequencing, and targeted sequencing using PCR and No-Amp methods.
In this ASHG 2020 CoLab presentation hear Principal Scientists, Aaron Wenger and Elizabeth Tseng share how highly accurate long reads (HiFi reads) provide comprehensive variant detection for both genomes and transcriptomes. Aaron Wenger describes how new improvements in protocols and analysis methods have increased scalability and accuracy of variant calling. As demonstrated in the precisionFDA Truth Challenge V2, HiFi reads (>99% accurate, 15 kb – 20 kb) now outperform short reads for single nucleotide and structural variant calling and match for small indels. This includes calling >30,000 small variants and >10,000 structural variants missed by short reads, many in medically…
Dr. Wenger gives attendees an update on PacBio’s long-read sequencing and variant detection capabilities on the Sequel II System and shares recommendations on how to design your own study using HiFi reads. Then, Dr. Sund from Cincinnati Children’s Hospital Medical Center describes how she has used long-read sequencing to solve rare neurological diseases involving complex structural rearrangements that were previously unsolved with standard methods.
Although the accuracy of the human reference genome is critical for basic and clinical research, structural variants (SVs) have been difficult to assess because data capable of resolving them have been limited. To address potential bias, we sequenced a diversity panel of nine human genomes to high depth using long-read, single-molecule, real-time sequencing data. Systematically identifying and merging SVs =50 bp in length for these nine and one public genome yielded 83,909 sequence-resolved insertions, deletions, and inversions. Among these, 2,839 (2.0 Mbp) are shared among all discovery genomes with an additional 13,349 (6.9 Mbp) present in the majority of humans,…
The correct phasing of genetic variations is a key challenge for many applications of DNA sequencing. Allele-level resolution is strongly preferred for histocompatibility sequencing where recombined genes can exhibit different compatibilities than their parents. In other contexts, gene complementation can provide protection if deleterious mutations are found on only one allele of a gene. These problems are especially pronounced in immunological domains given the high levels of genetic diversity and recombination seen in regions like the Major Histocompatibility Complex. A new tool for analyzing Single Molecule, Real-Time (SMRT) Sequencing data – Long Amplicon Analysis (LAA) – can generate highly accurate,…
Human MHC class I genes HLA-A, -B, -C, and class II genes HLA-DR, -DP and -DQ, play a critical role in the immune system as major factors responsible for organ transplant rejection. The have a direct or linkage-based association with several diseases, including cancer and autoimmune diseases, and are important targets for clinical and drug sensitivity research. HLA genes are also highly polymorphic and their diversity originates from exonic combinations as well as recombination events. A large number of new alleles are expected to be encountered if these genes are sequenced through the UTRs. Thus allele-level resolution is strongly preferred…
Fully phased allele-level sequencing of highly polymorphic HLA genes is greatly facilitated by SMRT Sequencing technology. In the present work, we have evaluated multiple DNA barcoding strategies for multiplexing several loci from multiple individuals, using three different tagging methods. Specifically MHC class I genes HLA-A, -B, and –C were indexed via DNA Barcodes by either tailed primers or barcoded SMRTbell adapters. Eight different 16-bp barcode sequences were used in symmetric & asymmetric pairing. Eight DNA barcoded adapters in symmetric pairing were independently ligated to a pool of HLA-A, -B and –C for eight different individuals, one at a time and…
PacBio 2013 User Group Meeting Presentation Slides: Lisbeth Guethlein from Stanford University School of Medicine looked at highly repetitive and variable immune regions of the orangutan genome. Guethlein reported that “PacBio managed to accomplish in a week what I have been working on for a couple years” (with Sanger sequencing), and the results were concordant. “Long story short, I was a happy customer.”
Sequence-based estimation of genetic diversity of Plasmodium falciparum, the most lethal malarial parasite, has proved challenging due to a lack of a complete genomic assembly. The skewed AT-richness (~80.6% (A+T)) of its genome and the lack of technology to assemble highly polymorphic sub-telomeric regions that contain clonally variant, multigene virulence families (i.e. var and rifin) have confounded attempts using short-read NGS technologies. Using single molecule, real-time (SMRT) sequencing, we successfully compiled all 14 nuclear chromosomes of the P. falciparum genome from telomere-to-telomere in single contigs. Specifically, amplification-free sequencing generated reads of average length 12 kb, with =50% of the reads…
The human immunoglobulin heavy chain locus (IGH) remains among the most understudied regions of the human genome. Recent efforts have shown that haplotype diversity within IGH is elevated and exhibits population specific patterns; for example, our re-sequencing of the locus from only a single chromosome uncovered >100 Kb of novel sequence, including descriptions of six novel alleles, and four previously unmapped genes. Historically, this complex locus architecture has hindered the characterization of IGH germline single nucleotide, copy number, and structural variants (SNVs; CNVs; SVs), and as a result, there remains little known about the role of IGH polymorphisms in inter-individual…