Menu
July 19, 2019  |  

Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing.

Authors: Doi, Koichiro and Monjo, Taku and Hoang, Pham H and Yoshimura, Jun and Yurino, Hideaki and Mitsui, Jun and Ishiura, Hiroyuki and Takahashi, Yuji and Ichikawa, Yaeko and Goto, Jun and Tsuji, Shoji and Morishita, Shinichi

Long expansions of short tandem repeats (STRs), i.e. DNA repeats of 2-6 nt, are associated with some genetic diseases. Cost-efficient high-throughput sequencing can quickly produce billions of short reads that would be useful for uncovering disease-associated STRs. However, enumerating STRs in short reads remains largely unexplored because of the difficulty in elucidating STRs much longer than 100 bp, the typical length of short reads.We propose ab initio procedures for sensing and locating long STRs promptly by using the frequency distribution of all STRs and paired-end read information. We validated the reproducibility of this method using biological replicates and used it to locate an STR associated with a brain disease (SCA31). Subsequently, we sequenced this STR site in 11 SCA31 samples using SMRT(TM) sequencing (Pacific Biosciences), determined 2.3-3.1 kb sequences at nucleotide resolution and revealed that (TGGAA)- and (TAAAATAGAA)-repeat expansions determined the instability of the repeat expansions associated with SCA31. Our method could also identify common STRs, (AAAG)- and (AAAAG)-repeat expansions, which are remarkably expanded at four positions in an SCA31 sample. This is the first proposed method for rapidly finding disease-associated long STRs in personal genomes using hybrid sequencing of short and long reads.Our TRhist software is available at http://trhist.gi.k.u-tokyo.ac.jp/[email protected] data are available at Bioinformatics online.

Journal: Bioinformatics
DOI: 10.1093/bioinformatics/btt647
Year: 2014

Read publication

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.