Highly accurate long reads – HiFi reads – with single-molecule resolution make Single Molecule, Real-Time (SMRT) Sequencing ideal for full-length 16S rRNA sequencing, shotgun metagenomic profiling, and metagenome assembly.
Interested to learn about pangenomes? Explore this guide to learn how they provide a more complete picture of the core genes of a given species and how that can provide better biological understanding.
Fritz Sedlazeck, a postdoc at Johns Hopkins University, describes his structural variant detection tool Sniffles in this poster from AGBT 2016. Included: examples of structural variants that could not be detected with other algorithms.
To make improvements to crops like corn, soybeans, and canola, scientists at Corteva are building a compendium of crop genomics resources to provide actionable sequence info for genetic discovery, gene-editing, and seed product development. Hear how Kevin Fengler, Comparative Genomics Lead of Data Science and Bioinformatics at Corteva, is using PacBio sequences to build visualization tools and genome assembly pipelines as a contribution to this effort.
Understanding interactions among plants and the complex communities of organisms living on, in and around them requires more than one experimental approach. A new method for de novo metagenome assembly, PacBio HiFi sequencing, has unique strengths for determining the functional capacity of metagenomes. With HiFi sequencing, the accuracy and median read length of unassembled data outperforms the quality metrics for many existing assemblies generated with other technologies, enabling cost-competitive recovery of full-length genes and operons even from rare species. When paired with the ability to close the genomes of even challenging isolates like Xanthomonas, the PacBio Sequel II System is…
In this SMRT Leiden 2020 Online Virtual Event presentation Pedro Oliveira of Mount Sinai shares his research on Clostridioides – a leading cause of nosocomial-acquired diarrhea and colitis across the developed world. In this study, Oliveira and coworkers performed the first comprehensive DNA methylome analysis of 36 human C. difficile isolates from a hospital setting using SMRT Sequencing and comparative epigenomics.
In this SMRT Leiden 2020 Online Virtual Event presentation, Erwin Datema of KeyGene shares his work on using high-throughput, accurate long-read sequencing technologies, such as PacBio HiFi sequencing, to drastically reduced the investment required to generate high-quality genome sequences. As a result, they have shifted away from the reference-centric view of the genome, and entered the pan-genome era. Here, Datema highlights some of the breakthrough algorithmic innovations KeyGene has developed to generate and analyze population-scale pan-genomes for plant genomes of all complexities and sizes.
In this LabRoots webinar, Jonas Korlach the CSO of PacBio provides an introduction to PacBio HiFi sequence reads, which are both long (up to 25 kb currently) and accurate (>99%) at the individual single-molecule sequence read level andhave allowed for advances in de novo genome assemblies. Korlach reviews the characteristics of HiFi read data obtained with the Sequel II System, followed by examples of high-quality genome assemblies for human, plant and animal genomes including the different aspects of evaluating genome assemblies (contiguity, accuracy, completeness and allelic phasing) and illustrates their high quality by examples of resolving centromeres, telomeres, segmental duplications…
Dr. Wenger gives attendees an update on PacBio’s long-read sequencing and variant detection capabilities on the Sequel II System and shares recommendations on how to design your own study using HiFi reads. Then, Dr. Sund from Cincinnati Children’s Hospital Medical Center describes how she has used long-read sequencing to solve rare neurological diseases involving complex structural rearrangements that were previously unsolved with standard methods.
Introduction: Long-read sequencing has revealed more than 20,000 structural variants spanning over 12 Mb in a healthy human genome. Short-read sequencing fails to detect most structural variants but has remained the more effective approach for small variants, due to 10-15% error rates in long reads, and copy-number variants (CNVs), due to lack of effective long-read variant callers. The development of PacBio highly accurate long reads (HiFi reads) with read lengths of 10-25 kb and quality >99% presents the opportunity to capture all classes of variation with one approach.Methods: We sequence the Genome in a Bottle benchmark sample HG002 and an…
Shiga toxin-producing Escherichia coli (STEC) is an emerging pathogen. Recently there has been a global in the number of outbreaks caused by non-O157 STECs, typically involving six serogroups O26, O45, 0103, 0111, and 0145. STEC O145:H28 has been associated with severe human disease including hemolytic-uremic syndrome (HUS), and is demonstrated by the 2007 Belgian ice-cream-associated outbreak and 2010 US lettuce-associated outbreak, with over 10% of patients developing HUS in each. The goal of this work was to do comparative genomics of strains, clinical and environmental, to investigate genome diversity and virulence evolution of this important foodborne pathogen.
Whole genome sequencing can provide comprehensive information important for determining the biochemical and genetic nature of all elements inside a genome. The high-quality genome references produced from past genome projects and advances in short-read sequencing technologies have enabled quick and cheap analysis for simple variants. However even with the focus on genome-wide resequencing for SNPs, the heritability of more than 50% of human diseases remains elusive. For non-human organisms, high-contiguity references are deficient, limiting the analysis of genomic features. The long and unbiased reads from single molecule, real-time (SMRT) Sequencing and new de novo assembly approaches have demonstrated the ability…
As the costs for genome sequencing have decreased the number of “genome” sequences have increased at a rapid pace. Unfortunately, the quality and completeness of these so–called “genome” sequences have suffered enormously. We prefer to call such genome assemblies as “gene assembly space” (GAS). We believe it is important to distinguish GAS assemblies from reference genome assemblies (RGAs) as all subsequent research that depends on accurate genome assemblies can be highly compromised if the only assembly available is a GAS assembly.
Lameness is a significant problem resulting in millions of dollars in lost revenue annually. In commercial broilers, the most common cause of lameness is bacterial chondronecrosis with osteomyelitis (BCO). We are using a wire flooring model to induce lameness attributable to BCO. We used 16S ribosomal DNA sequencing to determine that Staphylococcus spp. were the main species associated with BCO. Staphylococcus agnetis, which previously had not been isolated from poultry, was the principal species isolated from the majority of the bone lesion samples. Administering S. agnetis in the drinking water to broilers reared on wire flooring increased the incidence of…
Understanding the genetic basis of infectious diseases is critical to enacting effective treatments, and several large-scale sequencing initiatives are underway to collect this information. Sequencing bacterial samples is typically performed by mapping sequence reads against genomes of known reference strains. While such resequencing informs on the spectrum of single nucleotide differences relative to the chosen reference, it can miss numerous other forms of variation known to influence pathogenicity: structural variations (duplications, inversions), acquisition of mobile elements (phages, plasmids), homonucleotide length variation causing phase variation, and epigenetic marks (methylation, phosphorothioation) that influence gene expression to switch bacteria from non-pathogenic to pathogenic…