Pacbio long reads Archives - Page 3 of 7

June 1, 2021 |

Full-length cDNA sequencing of prokaryotic transcriptome and metatranscriptome samples

Next-generation sequencing has become a useful tool for studying transcriptomes. However, these methods typically rely on sequencing short fragments of cDNA, then attempting to assemble the pieces into full-length transcripts. Here, we describe a method that uses PacBio long reads to sequence full-length cDNAs from individual transcriptomes and metatranscriptome samples. We have adapted the PacBio Iso-Seq protocol for use with prokaryotic samples by incorporating RNA polyadenylation and rRNA-depletion steps. In conjunction with SMRT Sequencing, which has average readlengths of 10-15 kb, we are able to sequence entire transcripts, including polycistronic RNAs, in a single read. Here, we show full-length bacterial transcriptomes with the ability to visualize transcription of operons. In the area of metatranscriptomics, long reads reveal unambiguous gene sequences without the need for post-sequencing transcript assembly. We also show full-length bacterial transcripts sequenced after being treated with NEB’s Cappable-Seq, which is an alternative method for depleting rRNA and enriching for full-length transcripts with intact 5’ ends. Combining Cappable-Seq with PacBio long reads allows for the detection of transcription start sites, with the additional benefit of sequencing entire transcripts.

June 1, 2021 |

Population-scale discovery of structural variants with PacBio SMRT Sequencing

Structural variants (SVs) – genomic differences =50 base pairs – are few by count compared to single nucleotide variants (SNVs) and indels but include most of the base pairs that differ between two humans.

June 1, 2021 |

Single molecule high-fidelity (HiFi) Sequencing with >10 kb libraries

Recent improvements in sequencing chemistry and instrument performance combine to create a new PacBio data type, Single Molecule High-Fidelity reads (HiFi reads). Increased read length and improvement in library construction enables average read lengths of 10-20 kb with average sequence identity greater than 99% from raw single molecule reads. The resulting reads have the accuracy comparable to short read NGS but with 50-100 times longer read length. Here we benchmark the performance of this data type by sequencing and genotyping the Genome in a Bottle (GIAB) HG0002 human reference sample from the National Institute of Standards and Technology (NIST). We further demonstrate the general utility of HiFi reads by analyzing multiple clones of Cabernet Sauvignon. Three different clones were sequenced and de novo assembled with the CANU assembly algorithm, generating draft assemblies of very high contiguity equal to or better than earlier assembly efforts using PacBio long reads. Using the Cabernet Sauvignon Clone 8 assembly as a reference, we mapped the HiFi reads generated from Clone 6 and Clone 47 to identify single nucleotide polymorphisms (SNPs) and structural variants (SVs) that are specific to each of the three samples.

June 1, 2021 |

Structural variant detection with long read sequencing reveals driver and passenger mutations in a melanoma cell line

Past large scale cancer genome sequencing efforts, including The Cancer Genome Atlas and the International Cancer Genome Consortium, have utilized short-read sequencing, which is well-suited for detecting single nucleotide variants (SNVs) but far less reliable for detecting variants larger than 20 base pairs, including insertions, deletions, duplications, inversions and translocations. Recent same-sample comparisons of short- and long-read human reference genome data have revealed that short-read resequencing typically uncovers only ~4,000 structural variants (SVs, =50 bp) per genome and is biased towards deletions, whereas sequencing with PacBio long-reads consistently finds ~20,000 SVs, evenly balanced between insertions and deletions. This discovery has important implications for cancer research, as it is clear that SVs are both common and biologically important in many cancer subtypes, including colorectal, breast and ovarian cancer. Without confident and comprehensive detection of structural variants, it is unlikely we have a sufficiently complete picture of all the genomic changes that impact cancer development, disease progression, treatment response, drug resistance, and relapse. To begin to address this unmet need, we have sequenced the COLO829 tumor and matched normal lymphoblastoid cell lines to 49- and 51-fold coverage, respectively, with PacBio SMRT Sequencing, with the goal of developing a high-confidence structural variant call set that can be used to empirically evaluate cost-effective experimental designs for larger scale studies and develop structural variation calling software suitable for cancer genomics. Structural variant calling revealed over 21,000 deletions and 19,500 insertions larger than 20 bp, nearly four times the number of events detected with short-read sequencing. The vast majority of events are shared between the tumor and normal, with about 100 putative somatic deletions and 400 insertions, primarily in microsatellites. A further 40 rearrangements were detected, nearly exclusively in the tumor. One rearrangement is shared between the tumor and normal, t(5;X) which disrupts the mismatch repeat gene MSH3, and is likely a driver mutation. Generating high-confidence call sets that cover the entire size-spectrum of somatic variants from a range of cancer model systems is the first step in determining what will be the best approach for addressing an ongoing blind spot in our current understanding of cancer genomes. Here the application of PacBio sequencing to a melanoma cancer cell line revealed thousands of previously overlooked variants, including a mutation likely involved in tumorogenesis.

June 1, 2021 |

Structural variant detection in crops using low-fold coverage long-read sequencing

Genomics studies have shown that the insertions, deletions, duplications, translocations, inversions, and tandem repeat expansions in the structural variant (SV) size range (>50 bp) contribute to the evolution of traits and often have significant associations with agronomically important phenotypes. However, most SVs are too small to detect with array comparative genomic hybridization and too large to reliably discover with short-read DNA sequencing. While de novo assembly is the most comprehensive way to identify variants in a genome, recent studies in human genomes show that PacBio SMRT Sequencing sensitively detects structural variants at low coverage. Here we present SV characterization in the major crop species Oryza sativa subsp. indica (rice) with low-fold coverage of long reads. In addition, we provide recommendations for sequencing and analysis for the application of this workflow to other important agricultural species.

June 1, 2021 |

Copy-number variant detection with PacBio long reads

Long-read sequencing of diverse humans has revealed more than 20,000 insertion, deletion, and inversion structural variants spanning more than 12 Mb in a healthy human genome. Most of these variants are too large to detect with short reads and too small for array comparative genome hybridization (aCGH). While the standard approaches to calling structural variants with long reads thrive in the 50 bp to 10 kb size range, they tend to miss exactly the large (>50 kb) copy-number variants that are called more readily with aCGH. Standard algorithms rely on reference-based mapping of reads that fully span a variant or on de novo assembly; and copy-number variants are often too large to be spanned by a single read and frequently involve segmentally duplicated sequence that is not yet included in most de novo assemblies. To comprehensively detect large variants in human genomes, we extended pbsv – a structural variant caller for long reads – to call copy-number variants (CNVs) from read-clipping and read-depth signatures. In human germline benchmark samples, we detect more than 300 CNVs spanning around 10 Mb, and we call hundreds of additional events in re-arranged cancer samples. Together with insertion, deletion, inversion, duplication, and translocation calling from spanning reads, this allows pbsv to comprehensively detect large variants from a single data type.

February 5, 2021 |

Customer Experience: For complete view of microbial function, 100k genome project team uses PacBio’s SMRT Sequencing

Bart Weimer, a professor at the University of California, Davis, who is leading the 100K Foodborne Pathogen Genome Project, talks about using PacBio sequencing to produce long reads for microbial…

February 5, 2021 |

Customer Experience (in Spanish): At TGAC, PacBio’s long reads are key for complex genomics studies

Mario Caccamo, head of bioinformatics at The Genome Analysis Centre (TGAC) in the UK, integrates many different sequencing technologies to get the best of each for optimal genome assemblies, analysis,…

February 5, 2021 |

Customer Experience: At TGAC, PacBio’s long reads are key for complex genomics studies

February 5, 2021 |

Podcast: Long-read sequencing dramatically improves blood matching – Steven Marsh

One of the popular questions on the Mendelspod program is how those doing sequencing decide between the quality of PacBio’s long reads and the cheaper short read technology, such as…

February 5, 2021 |

Podcast: The 9 billion people problem – Rod Wing on plant genomics

By 2050, there will be 9 billion people on the planet. What will they eat? This is the question that led Rod Wing, Director of the Arizona Genomics Institute, into…

February 5, 2021 |

ASHG PacBio Workshop: Identification and characterization of informative genetic structural variants for neurodegenerative diseases

Michael Lutz, from the Duke University Medical Center, discussed a recently published software tool that can now be used in a pipeline with SMRT Sequencing data to find structural variant…

February 5, 2021 |

ASHG Virtual Poster: Effect of coverage depth and haplotype phasing on structural variant detection with PacBio long reads

PacBio bioinformatician Aaron Wenger presents this ASHG 2016 poster demonstrating human structural variation detection at varying coverage levels with SMRT Sequencing on the Sequel System. Results were compared to truth…

February 5, 2021 |

Webinar: Addressing “NGS Dead Zones” with third generation PacBio sequencing

SMRT Sequencing is a DNA sequencing technology characterized by long read lengths and high consensus accuracy, regardless of the sequence complexity or GC content of the DNA sample. These characteristics…

February 5, 2021 |

Webinar: Beginner’s guide to PacBio SMRT Sequencing data analysis

PacBio SMRT Sequencing is fast changing the genomics space with its long reads and high consensus sequence accuracy, providing the most comprehensive view of the genome and transcriptome. In this…

Auto Tag: Pacbio long reads

Full-length cDNA sequencing of prokaryotic transcriptome and metatranscriptome samples

Population-scale discovery of structural variants with PacBio SMRT Sequencing

Single molecule high-fidelity (HiFi) Sequencing with >10 kb libraries

Structural variant detection with long read sequencing reveals driver and passenger mutations in a melanoma cell line

Structural variant detection in crops using low-fold coverage long-read sequencing

Copy-number variant detection with PacBio long reads

Customer Experience: For complete view of microbial function, 100k genome project team uses PacBio’s SMRT Sequencing

Customer Experience (in Spanish): At TGAC, PacBio’s long reads are key for complex genomics studies

Customer Experience: At TGAC, PacBio’s long reads are key for complex genomics studies

Podcast: Long-read sequencing dramatically improves blood matching – Steven Marsh

Podcast: The 9 billion people problem – Rod Wing on plant genomics

ASHG PacBio Workshop: Identification and characterization of informative genetic structural variants for neurodegenerative diseases

ASHG Virtual Poster: Effect of coverage depth and haplotype phasing on structural variant detection with PacBio long reads

Webinar: Addressing “NGS Dead Zones” with third generation PacBio sequencing

Webinar: Beginner’s guide to PacBio SMRT Sequencing data analysis

Subscribe for blog updates:

Filter by topic

Talk with an expert

ALS case study

Subscribe for blog updates:

Filter by topic

Talk with an expert