One of the major challenges of de novo mammalian genome assembly arises from the presence of large, interspersed segmental duplications with high levels of sequence identity. These regions are particularly difficult to assemble using current short-read high-throughput sequencing methods. Combining long-read single-molecule, real-time (SMRT) sequencing with a hierarchical genome-assembly process (HGAP), as well as the consensus and variant caller Quiver, enabled these complex genomic regions to be resolved in a more cost-and time-effective manner than previously possible.
Journal: Nature reviews. Genetics