Menu
February 25, 2021  |  

Haplotype-resolved diverse human genomes and integrated analysis of structural variation

Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent{\textendash}child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) integrate all forms of genetic variation even across complex loci. We identify 107,590 structural variants (SVs), of which 68\% are not discovered by short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterize 130 of the most active mobile element source elements and find that 63\% of all SVs arise by homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1,526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.


February 25, 2021  |  

Pathogenic 12-kb copy-neutral inversion in syndromic intellectual disability identified by high-fidelity long-read sequencing

We report monozygotic twin girls with syndromic intellectual disability who underwent exome sequencing but with negative pathogenic variants. To search for variants that are unrecognized by exome sequencing, high-fidelity long-read genome sequencing (HiFi LR-GS) was applied. A 12-kb copy-neutral inversion was precisely identified by HiFi LR-GS after trio-based variant filtering. This inversion directly disrupted two genes, CPNE9 and BRPF1, the latter of which attracted our attention because pathogenic BRPF1 variants have been identified in autosomal dominant intellectual developmental disorder with dysmorphic facies and ptosis (IDDDFP), which later turned out to be clinically found in the twins. Trio-based HiFi LR-GS together with haplotype phasing revealed that the 12-kb inversion occurred de novo on the maternally transmitted chromosome. This study clearly indicates that submicroscopic copy-neutral inversions are important but often uncharacterized culprits in monogenic disorders and that long-read sequencing is highly advantageous for detecting such inversions involved in genetic diseases.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.