Dan Geraghty explains that while there have been decades’ worth of studies associating the genetics of the major histocompatibility complex (MHC), and the highly polymorphic HLA class 1 and 2 genes, we still haven’t found the key mutations for a variety of different autoimmune diseases such as type 1 diabetes, rheumatoid arthritis, multiple sclerosis, and others. Enormous amounts of linkage disequilibrium in these regions are one factor, as is getting information in phase, so larger stretches of sequence are needed. Recently Geraghty has begun using SMRT Technology with hopes of drilling down to the causal genetics.
Rick Wilson, Director of the McDonnell Genome Institute at Washington University in St. Louis titled his talk “Of reference genomes and precious metals” and walked the audience through definitions and standards for the various quality levels for de novo assembled human genomes, e.g., platinum, gold, and silver. He noted that this was a good topic for this session because of the important role PacBio has played in the community’s work to create reference-grade genomes. For example, PacBio technology has enabled them to sequence additional genomes (CHM1, CHM13) to a very high quality level. Although these sequences were essential for further…
Michael Schatz of Cold Spring Harbor Laboratory and Johns Hopkins University discusses the challenges in detecting structural variations (SVs) in high throughput sequencing data, especially more complex SVs such as a duplication nested within an inversion. To overcome these challenges, Dr. Schatz and his team have been applying long-read sequencing to analyze SVs in a range of samples from small microbial genomes, through mid-sized plant and animal genomes, to large mammalian genomes. The increased read lengths, which currently average over 10kbp and some approach 100kbp, make it possible to span more complex SVs and accurately assess SVs in repetitive regions,…
Doreen Ware introduces her team’s new assembly of maize, built with PacBio long-read sequencing and genome maps from BioNano Genomics. With a contig N50 of nearly 10 Mb and more complete information than any previous assembly, Ware says, “This is just an amazing time to be a plant scientist.” Her presentation includes a number of highlights from the new assembly, which may help crop improvement efforts for maize.
Sergey Koren of the National Biodefense Analysis and Countermeasures Center (NBACC) discusses integrating the MinHash Alignment Process (MHAP) with Celera Assembler to enable reference-grade assemblies of model organisms, revealing novel heterochromatic sequences and filling low-complexity gap sequences in the GRCh38 human reference genome. Dr. Koren and his team have applied this method to assemble the San Clemente goat genome. Combining SMRT Sequencing and next-generation optical mapping from BioNano Genomics generates an assembly that is over 150-fold more contiguous than the latest Capra hircusgoat reference. In combination with Hi-C sequencing, the assembly surpasses reference assemblies de novo, with minimal manual intervention.…
Karyn Meltz Steinberg presents the first high quality African reference genome assembly of the Yoruban individual, NA19240, produced from SMRT Sequencing data. She said PacBio sequencing offers significant improvement over short-read sequence data for high-quality assemblies.
Melissa Laird Smith discussed how the Icahn School of Medicine at Mount Sinai uses long-read sequencing for translational research. She gave several examples of targeted sequencing projects run on the Sequel System including CYP2D6, phased mutations of GLA in Fabry’s disease, structural variation breakpoint validation in glioblastoma, and full-length immune profiling of TCR sequences.
In this AGBT 2017 poster, Ulf Gyllensten from Uppsala University presents two local reference genomes generated with PacBio and Bionano Genomics data. These assemblies include structural variation and repetitive regions that have been missed with previous short-read efforts, including some new genes not annotated in the human reference genome.
PacBio SMRT Sequencing is fast changing the genomics space with its long reads and high consensus sequence accuracy, providing the most comprehensive view of the genome and transcriptome. In this webinar, I will talk about the various data analysis tools available in PacBio’s data analysis suite – SMRT Link – as well as 3rd party tools available. Key applications addressed in this talk are: Genome Assemblies, Structural Variant Analysis, Long Amplicon and Targeted Sequencing, Barcoding Strategies, Iso-Seq Analysis for Full-length Transcript Sequencing
SMRT Sequencing is a DNA sequencing technology characterized by long read lengths and high consensus accuracy, regardless of the sequence complexity or GC content of the DNA sample. These characteristics can be harnessed to address medically relevant genes, mRNA transcripts, and other genomic features that are otherwise difficult or impossible to resolve. I will describe examples for such new clinical research in diverse areas, including full-length gene sequencing with allelic haplotype phasing, gene/pseudogene discrimination, sequencing extreme DNA contexts, high-resolution pharmacogenomics, biomarker discovery, structural variant resolution, full-length mRNA isoform cataloging, and direct methylation detection.
At AGBT 2017, the Broad Institute’s Daniel Neafsey reported a large collaborative effort to sequence the mosquito that carries Zika virus. The team is using long-read PacBio sequencing to produce a high-quality genome assembly, which Neafsey expects will replace the 10-year-old Sanger assembly for Aedes aegypti. The new assembly reduces the number of contigs by at least 10-fold, boosts the contig N50 to nearly 2 Mb, and features more complete gene content.
In this ASHG workshop presentation , Jonas Korlach, CSO of PacBio, walked attendees through recent product updates and the coming technology roadmap. The Sequel System 6.0 release offered major improvements to accuracy, throughput, structural variant calling, and large-insert libraries, he said, showing examples of 35 kb libraries. Looking ahead, Korlach said that the V2 express library preparation product should be available early in 2019, with the new 8M SMRT Cell being introduced sometime later.
To make improvements to crops like corn, soybeans, and canola, scientists at Corteva are building a compendium of crop genomics resources to provide actionable sequence info for genetic discovery, gene-editing, and seed product development. Hear how Kevin Fengler, Comparative Genomics Lead of Data Science and Bioinformatics at Corteva, is using PacBio sequences to build visualization tools and genome assembly pipelines as a contribution to this effort.
In this presentation, Sonja Vernes of the Max Plank Institute shares her work with the Bat1K project which aims to catalog the genetic diversity of all living bat species. She highlights the unique biology of bats, from their widely varying sizes to their capacity for healthy aging and disease resistance and provides recent findings from ongoing efforts to sequence and annotate the genomes of 21 phylogenetic families of bats.