Menu
September 22, 2019  |  

CompStor Novos: a low cost yet fast assembly-based variant calling for personal genomes

Authors: Oenning, Travis and Bae, Taejeong and Iyengar, Aravind and Brickner, Barrett and Soysa, Madushanka and Wright, Nicholas and Kumar, Prasanth and Indupuru, Suneel and Abyzov, Alexej and Coker, Jonathan

Application of assembly methods for personal genome analysis from next generation sequencing data has been limited by the requirement for an expensive supercomputer hardware or long computation times when using ordinary resources. We describe CompStor Novos, achieving supercomputer-class performance in de novo assembly computation time on standard server hardware, based on a tiered-memory algorithm. Run on commercial off-the-shelf servers, Novos assembly is more precise and 10-20 times faster than that of existing assembly algorithms. Furthermore, we integrated Novos into a variant calling pipeline and demonstrate that both compute times and precision of calling point variants and indels compare well with standard alignment-based pipelines. Additionally, assembly eliminates bias in the estimation of allele frequency for indels and naturally enables discovery of breakpoints for structural variants with base pair resolution. Thus, Novos bridges the gap between alignment-based and assembly-based genome analyses. Extension and adaption of its underlying algorithm will help quickly and fully harvest information in sequencing reads for personal genome reconstruction.

Journal: BioRxiv
DOI: 10.1101/486092
Year: 2018

Read publication

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.