PacBio’s Jenny Ekholm presents this ASHG 2016 poster on a new method being developed that enriches for unamplified DNA and uses SMRT Sequencing to characterize repeat expansion disorders. Incorporating the CRISPR/Cas9 system to target specific genes allows for amplification-free enrichment to preserve epigenetic information and avoid PCR bias. Internal studies have shown that the approach can successfully be used to target and sequence the CAG repeat responsible for Huntington’s disease, the repeat associated with ALS, and more. The approach allows for pooling many samples and sequencing with a single SMRT Cell.
Targeted sequencing has proven to be an economical means of obtaining sequence information for one or more defined regions of a larger genome. However, most target enrichment methods are reliant upon some form of amplification. Amplification removes the epigenetic marks present in native DNA, and some genomic regions, such as those with extreme GC content and repetitive sequences, are recalcitrant to faithful amplification. Yet, a large number of genetic disorders are caused by expansions of repeat sequences. Furthermore, for some disorders, methylation status has been shown to be a key factor in the mechanism of disease. We have developed a…
In this PAG 2018 presentation, Tanya Renner of Pennsylvania State University shares research using PacBio SMRT Sequencing to understand the genomes and transcriptomes of carnivorous plants. She describes the humped bladderwort, Utricularia gibba, as having an extreme genome due to its small size (100 Mbp) despite containing numerous tandem gene duplications and having undergone two whole genome duplications. Renner shares ongoing research into two Drosera species, commonly known as sundews, which through whole genome sequencing are illuminating carnivorous plant genome structural evolution including the transition from monocentric to holocentric chromosomes.
This tutorial provides an overview of the PacBio Demultiplex Barcodes analysis application in SMRT Link, followed by de novo assembly of the demultiplexed samples using HGAP4 for the Multiplexed Microbial Assembly analysis application. This tutorial covers features of SMRT Link v5.1.0.
In this PacBio User Group Meeting presentation, Jonas Korlach and Roberto Lleras share the latest updates to the structural variation application and analysis tools.
In this PacBio User Group Meeting presentation, Tim Smith of the USDA’s Agricultural Research Service describes efforts to generate reference-grade genome assemblies for various bovine species and analyze them to understand factors such as how selective breeding has affected certain breeds. Genome assemblies he presents span cattle, water buffalo, and gaur. Smith shows data for each assembly, noting that as data production shifted to the Sequel System, long-read PacBio data became even better at producing highly contiguous assemblies.
In this presentation Fritz Sedlazeck describes his latest work to obtain comprehensive genomes leveraging long-read sequencing and linked reads.
In this webinar, Jenny Ekholm and Paul Kotturi provide an overview of the PacBio No-Amp targeted sequencing application and its uses for targeting hard-to-amplify genes. This approach couples CRISPR-Cas9 with Single Molecule, Real Time (SMRT) Sequencing to enrich targets, without the need for PCR amplification, and generate complete sequence information with base-level resolution.
In this Labroots webinar, Meredith Ashby, Director of Microbial Genomics at PacBio, describes the utility of highly accurate long-read sequencing, known as HiFi sequencing, to understand the SARs-CoV-2 viral genome. HiFi sequencing enables mutation phasing and rare variant detection to understand viral stability and mutation rates, as well as providing insights into viral population structure for monitoring viral evolution. Ashby also shares how HiFi sequencing can be used to explore the host immune response to COVID-19, specifically by providing full-length sequencing of the B cell repertoire, IGH locus and HLA genes. Access additional COVID-19 Sequencing Tools and Resources.
We show that linearizing and directly sequencing full-length fosmids simplifies the assembly problem such that it is possible to unambiguously assemble individual haplotypes for the highly repetitive 100-200 kb killer Ig-like receptor (KIR) gene loci of chromosome 19. A tiling of targeted fosmids can be used to clone extended lengths of genomic DNA, 100s of kb in length, but repeat complexity in regions of particular interest, such as the KIR locus, means that sequence assembly of pooled samples into complete haplotypes is difficult and in many cases impossible. The current maximum read length generated by SMRT Sequencing exceeds the length…
Single Molecule, Real-Time (SMRT) Sequencing holds promise for addressing new frontiers in large genome complexities, such as long, highly repetitive, low-complexity regions and duplication events, and differentiating between transcript isoforms that are difficult to resolve with short-read technologies. We present solutions available for both reference genome improvement (>100 MB) and transcriptome research to best leverage long reads that have exceeded 20 Kb in length. Benefits for these applications are further realized with consistent use of size-selection of input sample using the BluePippin™ device from Sage Science. Highlights from our genome assembly projects using the latest P5-C3 chemistry on model organisms…
AGBT 2015 Workshop Presentation Slides: Dick McCombie from Cold Spring Harbor Laboratory (CSHL) described the use of SMRT Sequencing to analyze a breast cancer cell line with complex genomic events. Still ongoing, the project has already uncovered structural variants missed by other sequencers.
Numerous whole genome sequencing projects already achieved or ongoing have highlighted the fact that obtaining a high quality genome sequence is necessary to address comparative genomics questions such as structural variations among genotypes and gain or loss of specific function. Despite the spectacular progress that has been done regarding sequencing technologies, accurate and reliable data are still challenging, at the whole genome scale but also when targeting specific genomic regions. These issues are even more noticeable for complex plant genomes. Most plant genomes are known to be particularly challenging due to their size, high density of repetitive elements and various…
For many of the repeat expansion disorders, the disease gene has been discovered, however the underlying biological mechanisms have not yet been fully understood. This is mainly due to technological limitations that do not allow for the needed base-pair resolution of the long, repetitive genomic regions. We have developed a novel, amplification-free enrichment technique that uses the CRISPR/Cas9 system to target large repeat expansions. This method, in conjunction with PacBio’s long reads and uniform coverage, enables sequencing of these complex genomic regions. By using a PCR-free amplification method, we are able to access not only the repetitive elements and interruption…
Genes associated with several neurological disorders have been shown to be highly polymorphic. Targeted sequencing of these genes using NGS technologies is a powerful way to increase the cost-effectiveness of variant discovery and detection. However, for a comprehensive view of these target genes, it is necessary to have complete and uniform coverage across regions of interest. Unfortunately, short-read sequencing technologies are not ideal for these types of studies as they are prone to mis-mapping and often fail to span repetitive regions. Targeted sequencing with PacBio long reads provides the unique advantage of single-molecule observations of complex genomic regions. PacBio long…