Jason Chin, senior director of bioinformatics at PacBio, talks about using long-read sequence data and string graph assembly for assembling diploid genomes. A major challenge for diploid genome assembly is in distinguishing homologous regions from repeats, so he discusses how long reads are essential for resolving repeat regions. In the presentation, Chin displays data from two inbred Arabidopsis strains used to create a synthetic diploid assembly.
Jason Chin, senior director of bioinformatics at PacBio, talks about using long-read sequence data to generate diploid genome assemblies to produce comprehensive haplotype sequence reconstructions. In the presentation, Chin describes the FALCON Unzip process that combines SNP phasing with the assembly process and allows for determination of the haplotype sequences and identification of structural variants. He presents an example of diploid assembly from inbred Arabidopsis strains.
PacBio Sequencing is characterized by very long sequence reads (averaging > 10,000 bases), lack of GC-bias, and high consensus accuracy. These features have allowed the method to provide a new gold standard in de novo genome assemblies, producing highly contiguous (contig N50 > 1 Mb) and accurate (> QV 50) genome assemblies. We will briefly describe the technology and then highlight the full workflow, from sample preparation through sequencing to data analysis, on examples of insect genome assemblies, and illustrate the difference these high-quality genomes represent with regard to biological insights, compared to fragmented draft assemblies generated by short-read sequencing.
The goal of this session is to help users complete their PacBio genome assembly and generate the best resource for their research. Kingan begins with a brief review of the diploid assembly process used by FALCON and FALCON-Unzip, highlighting the enhanced phasing of the Unzip module, and concluding with recommendations for genome polishing. Next, she explores how heterozygosity can influence the assembly process and how read coverage depth along the assembly can reveal important characteristics of assembly structure. Kingan then recommends approaches, including specific tools, that can be used to quality filter and curate the assembly, including annotation-, coverage-, and…
In this presentation Fritz Sedlazeck describes his latest work to obtain comprehensive genomes leveraging long-read sequencing and linked reads.
In this webinar, Emily Hatas of PacBio shares information about the applications and benefits of SMRT Sequencing in plant and animal biology, agriculture, and industrial research fields. This session contains an overview of several applications: whole-genome sequencing for de novo assembly; transcript isoform sequencing (Iso-Seq) method for genome annotation; targeted sequencing solutions; and metagenomics and microbial interactions. High-level workflows and best practices are discussed for key applications.
In this PacBio User Group Meeting presentation, Garth Ehrlich of Drexel University College of Medicine shares his work on developing a microbiome assay that uses SMRT Sequencing to provide high-quality coverage of the 16S bacterial rRNA for species identification. The microbiome analysis pipeline, MCSMRT, takes advantage of PacBio circular consensus sequencing (CCS) technology and second-generation pathway analysis system for generating extremely high-fidelity sequences that provide the user with ultra-high-confidence species-level microbiome data.
In this PacBio User Group Meeting presentation, Jonas Korlach and Roberto Lleras share the latest updates to the structural variation application and analysis tools.
This video provides an overview of the techniques and steps of preparing samples, DNA, and libraries for PacBio Single Molecule, Real-Time (SMRT) Sequencing to be used in de novo assembly projects. In this video, a PacBio scientist covers how to assess DNA quantity and purity, size-selection of DNA libraries, and provides and introduction to SMRT Sequencing, including the benefits of long-reads when generating high-quality genome assemblies.
In this PacBio User Group Meeting presentation, Zev Kronenberg of PacBio presents on using the combination of PacBio and Phase Genomics data and analysis tools to create highly contiguous genome assemblies.
This video provides an overview of the techniques and steps of generating a de novo genome assembly with long-read sequencing data generated using PacBio Single Molecule, Real-Time (SMRT) Sequencing. In this video, a PacBio scientist covers the benefits of long reads when generating high-quality genome assemblies, the latest tools for creating assemblies, including HGAP, FALCON and FALCON-Unzip, how to polish and assess the quality of a genome assembly, and how to submit an assembly to NCBI.
In this PacBio User Group Meeting presentation, Chris Boles of Sage Science presents updates on the Sage System for getting the largest DNA fragments using the SageHLS.
In this PacBio User Group Meeting presentation, Tim Smith of the USDA’s Agricultural Research Service describes efforts to generate reference-grade genome assemblies for various bovine species and analyze them to understand factors such as how selective breeding has affected certain breeds. Genome assemblies he presents span cattle, water buffalo, and gaur. Smith shows data for each assembly, noting that as data production shifted to the Sequel System, long-read PacBio data became even better at producing highly contiguous assemblies.
In this PacBio User Group Meeting presentation, Bruce Kingham of the DNA Sequencing & Genotyping Center at the University of Delaware describes tips on using the FEMTO Pulse for large-insert libraries.
In this PacBio User Group Meeting presentation, Tina Graves-Lindsay of the McDonnell Genome Institute and the Genome Reference Consortium speaks about the importance of phasing human reference genomes. Her team is now working on its fifteenth human genome assembly — part of a major effort to improve genomic representation of ethnic diversity — with a pipeline that generates 60-fold PacBio coverage for a de novo assembly, followed by scaffolding with other technologies. They are also using FALCON-Unzip to separate haplotypes, leading to reference-grade diploid assemblies. This approach has already helped resolve errors seen in other genomes and even the gold-standard…