Doreen Ware introduces her team’s new assembly of maize, built with PacBio long-read sequencing and genome maps from BioNano Genomics. With a contig N50 of nearly 10 Mb and more complete information than any previous assembly, Ware says, “This is just an amazing time to be a plant scientist.” Her presentation includes a number of highlights from the new assembly, which may help crop improvement efforts for maize.
Euan Ashley speaks about precision medicine and said clinical-grade analysis has been limited by complex regions in the human genome. His key theme,”Precision medicine needs to be accurate medicine,” was illustrated with several examples where short-read sequencing or traditional clinical sequencing methods failed to be accurate. Also included: targeted RNA sequencing and gene phasing with long-read sequencing.
Jim Lupski is a professor at Baylor College of Medicine where he’s on the frontline of incorporating genomic research into everyday clinical practice. The story begins with Jim’s own genome, which is perhaps the most sequenced genome ever. Jim’s life as a leading genomic researcher has been driven in part for a strong personal reason. He has a rare genetic disease named after three researchers who first defined it, Charcot Marie Tooth Neuropathy. What began as a personal journey to uncover the source of his own disease led Jim to seminal work that launched the field of structural variation. Working…
Grant Cramer from the University of Nevada, Reno, and Dario Cantu from the Univeristy of Callifornia, Davis, discuss past challenges with sequencing Clone 8 of Cabernet Sauvignon (Vitis vinifera). An assembly of the genome was attempted with approximately 110x Illumina reads and 5x PacBio reads. The PacBio SMRT Sequencing read made major improvements in the assembly compared with the results of Illumina reads only. However, the assembly results were still unsatisfactory, so an additional 100-fold SMRT Sequencing coverage had been generated. An update on the current sequencing results and status of the assembly are presented.
Adam Ameur talks about a range of applications for which SMRT Sequencing had been useful in the SciLifeLab. Examples include analyzing a DNA translocation in chronic myeloid leukemia samples; studying the HPV genome; and sequencing the FADS region to understand fatty acid production.
Sergey Koren of the National Biodefense Analysis and Countermeasures Center (NBACC) discusses integrating the MinHash Alignment Process (MHAP) with Celera Assembler to enable reference-grade assemblies of model organisms, revealing novel heterochromatic sequences and filling low-complexity gap sequences in the GRCh38 human reference genome. Dr. Koren and his team have applied this method to assemble the San Clemente goat genome. Combining SMRT Sequencing and next-generation optical mapping from BioNano Genomics generates an assembly that is over 150-fold more contiguous than the latest Capra hircusgoat reference. In combination with Hi-C sequencing, the assembly surpasses reference assemblies de novo, with minimal manual intervention.…
Alan Archibald compares two new de novo PacBio pig genome assemblies to a previously released draft genome, finding significant improvement that could be important for breeding programs. In one example, he shows chromosome 1, which was split into more than 9,000 contigs in the draft genome, is now represented in just 10 contigs.
Tim Smith, molecular geneticist at the USDA Agricutural Research Service (ARS) in Clay Center, Nebraska, and director of the U.S. Meat Animal Reseach Center Core Facilities, discusses the USDA’s efforts to improve the goat, swine, cattle, and sheep genomes through long read-based de novoassemblies scaffolded with a variety of approaches. Recent advances in long-read sequencing, combined with new technologies for scaffolding the resulting contigs, have made it possible to make a significant change in the quality of genome assemblies for a very small fraction of the price required to create the originals. Although a change of reference genomes incurs cost,…
Christopher Hill presents data from efforts to produce reference-grade assemblies for the great ape species. Using SMRT Sequencing, Hill and his colleagues are generating assemblies with much higher contiguity to resolve repetitive and other particularly complex regions. In this talk, he focuses on data from their new high-quality gorilla assembly.
Karyn Meltz Steinberg presents the first high quality African reference genome assembly of the Yoruban individual, NA19240, produced from SMRT Sequencing data. She said PacBio sequencing offers significant improvement over short-read sequence data for high-quality assemblies.
PacBio scientist Cheryl Heiner describes new low-input protocols for SMRT Sequencing library construction. With these revised methods, 2 kb libraries can be generated from as little as 10 ng of DNA, while 10 kb libraries require only 100 ng of sample.
In this AGBT virtual poster video, Jason Chin, a bioinformatician at PacBio, describes a polyploidy-aware de novo assembly approach called FALCON and a new algorithm, dubbed FALCON-unzip, that involves “unzipping” diploid genomes for de novo haplotype reconstructions from SMRT Sequencing data. These methods are illustrated in a studies of fungal, Arabidopsis and human datasets for the resolution of structural variation and characterization of haplotypes.
In this AGBT poster, PacBio bioinformatician Matthew Seetin presents a new assembly for Aedes aegypti cell line, the mosquito responsible for spreading viruses like Dengue and Zika. SMRT Sequencing generated a gapless assembly with a contig N50 of 1.4 Mb, compared to 82 kb in the previous assembly. The genome features a number of transposable elements and long tandem repeats.
Swati Ranade from PacBio presents her AGBT poster demonstrating the use of SMRT Sequencing to characterize complex immune regions from human, macaque, and hummingbird. Included: a de novo assembly of complete KIR haplotypes, the MHC region, and MHC alleles.