X

Quality Statement

Pacific Biosciences is committed to providing high-quality products that meet customer expectations and comply with regulations. We will achieve these goals by adhering to and maintaining an effective quality-management system designed to ensure product quality, performance, and safety.

X

Image Use Agreement

By downloading, copying, or making any use of the images located on this website (“Site”) you acknowledge that you have read and understand, and agree to, the terms of this Image Usage Agreement, as well as the terms provided on the Legal Notices webpage, which together govern your use of the images as provided below. If you do not agree to such terms, do not download, copy or use the images in any way, unless you have written permission signed by an authorized Pacific Biosciences representative.

Subject to the terms of this Agreement and the terms provided on the Legal Notices webpage (to the extent they do not conflict with the terms of this Agreement), you may use the images on the Site solely for (a) editorial use by press and/or industry analysts, (b) in connection with a normal, peer-reviewed, scientific publication, book or presentation, or the like. You may not alter or modify any image, in whole or in part, for any reason. You may not use any image in a manner that misrepresents the associated Pacific Biosciences product, service or technology or any associated characteristics, data, or properties thereof. You also may not use any image in a manner that denotes some representation or warranty (express, implied or statutory) from Pacific Biosciences of the product, service or technology. The rights granted by this Agreement are personal to you and are not transferable by you to another party.

You, and not Pacific Biosciences, are responsible for your use of the images. You acknowledge and agree that any misuse of the images or breach of this Agreement will cause Pacific Biosciences irreparable harm. Pacific Biosciences is either an owner or licensee of the image, and not an agent for the owner. You agree to give Pacific Biosciences a credit line as follows: "Courtesy of Pacific Biosciences of California, Inc., Menlo Park, CA, USA" and also include any other credits or acknowledgments noted by Pacific Biosciences. You must include any copyright notice originally included with the images on all copies.

IMAGES ARE PROVIDED BY Pacific Biosciences ON AN "AS-IS" BASIS. Pacific Biosciences DISCLAIMS ALL REPRESENTATIONS AND WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, INCLUDING, BUT NOT LIMITED TO, NON-INFRINGEMENT, OWNERSHIP, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT SHALL Pacific Biosciences BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, PUNITIVE, OR CONSEQUENTIAL DAMAGES OF ANY KIND WHATSOEVER WITH RESPECT TO THE IMAGES.

You agree that Pacific Biosciences may terminate your access to and use of the images located on the PacificBiosciences.com website at any time and without prior notice, if it considers you to have violated any of the terms of this Image Use Agreement. You agree to indemnify, defend and hold harmless Pacific Biosciences, its officers, directors, employees, agents, licensors, suppliers and any third party information providers to the Site from and against all losses, expenses, damages and costs, including reasonable attorneys' fees, resulting from any violation by you of the terms of this Image Use Agreement or Pacific Biosciences' termination of your access to or use of the Site. Termination will not affect Pacific Biosciences' rights or your obligations which accrued before the termination.

I have read and understand, and agree to, the Image Usage Agreement.

I disagree and would like to return to the Pacific Biosciences home page.

Pacific Biosciences
Contact:

Data Release: Highest-Quality, Most Contiguous Individual Human Genome Assembly to Date

Monday, October 8, 2018

We’re proud to announce the release of the most contiguous diploid human genome assembly of a single individual to date, representing the nearly complete DNA sequence from all 46 chromosomes inherited from both parents. The sample used was derived from a Puerto Rican female who has been included in population genetics studies such as the 1000 Genomes Project. The phased diploid assembly will give unprecedented views of population-specific variation through the long-range resolution of maternal and paternal haplotypes.

This work is part of a larger effort in the field of personalized medicine and human genomics to add ethnic diversity to the available human reference genomes. More than 40 global initiatives are currently underway to apply de novo assembly methods to individuals representing multiple ethnic populations. Notable among these initiatives is the McDonnell Genome Institute at Washington Universitywhich has contributed 11 high-quality PacBio genomes for individuals representing populations from Africa, Asia, Europe, and the Americas.

Our approach to the Puerto Rican genome relied upon the current best practices for de novo assembly while also pushing read lengths ever longer and adding new methods and data types to better tackle the problem of diploid genome assembly.

Summary of sequencing technologies used in recent human genome assemblies. SMRT Sequencing has been used in 94% of the most contiguous assemblies.

The Puerto Rican sample was sequenced on the Sequel System with 2.1 chemistry and v5.1 software using a large insert library aggressively size selected to 35 kb. The resulting contig assembly totaling 2.89 Gb has the highest contiguity to date, with half of the genome contained in gapless contigs longer than 27 Mb. These results are even better than the consistently stellar assemblies MGI has been producing, which typically have contig N50s of 20-25 Mb.

Like the MGI genomes, the new Puerto Rican genome was assembled using FALCON, but with a newer version of FALCON-Unzip that includes algorithmic improvements to phasing and accuracy. Nearly 85% of the genome was resolved as maternal and paternal haplotypes, with more than 600 Mb of sequence in haplotype blocks longer than 1 Mb. An analysis of variants within phase blocks indicates high accuracy with 95% of SNPs showing concordant inheritance from a single parent.

In addition to the improvements in PacBio’s FALCON-Unzip assembler, the Puerto Rican assembly includes the novel use of Hi-C data to extend phasing between haplotype blocks. In collaboration with Phase Genomics, PacBio developed a new method for enhanced phasing that does not rely on family trio data. The new method, called FALCON-Phase, maps ultra-long range Hi-C reads to the FALCON-Unzip contigs to extend phasing to the contig scale. The Hi-C data was also used to scaffold the phased contigs before performing another round of phasing on the scaffolds.

The resulting assembly consists of 46 chromosome-scale scaffolds, representing the maternal and paternal chromosome set for the Puerto Rican individual. Each set of 23 scaffolds contain only 511 gaps and are a total of 2.83 Gb long. The remainder of each haploid genome is contained in 260 scaffolds of 63 Mb in length.

 

Data Links

BioProject: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA483067

SRA: https://www.ncbi.nlm.nih.gov/sra?linkname=bioproject_sra_all&from_uid=483067

Genome: https://www.ncbi.nlm.nih.gov/genome/?term=RBJD00000000 (currently not live)

https://downloads.pacbcloud.com/public/dataset/HG00733_assembly/

 

Method Links

https://github.com/PacificBiosciences/pb-assembly

https://github.com/phasegenomics/FALCON-Phase

https://www.biorxiv.org/content/early/2018/05/21/327064

https://www.biorxiv.org/content/early/2018/02/26/271486

 

 

Subscribe for blog updates:

Archives