Menu
April 21, 2020  |  

High-coverage, long-read sequencing of Han Chinese trio reference samples.

Authors: Wang, Ying-Chih and Olson, Nathan D and Deikus, Gintaras and Shah, Hardik and Wenger, Aaron M and Trow, Jonathan and Xiao, Chunlin and Sherry, Stephen and Salit, Marc L and Zook, Justin M and Smith, Melissa and Sebra, Robert

Single-molecule long-read sequencing datasets were generated for a son-father-mother trio of Han Chinese descent that is part of the Genome in a Bottle (GIAB) consortium portfolio. The dataset was generated using the Pacific Biosciences Sequel System. The son and each parent were sequenced to an average coverage of 60 and 30, respectively, with N50 subread lengths between 16 and 18?kb. Raw reads and reads aligned to both the GRCh37 and GRCh38 are available at the NCBI GIAB ftp site (ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/ChineseTrio/). The GRCh38 aligned read data are archived in NCBI SRA (SRX4739017, SRX4739121, and SRX4739122). This dataset is available for anyone to develop and evaluate long-read bioinformatics methods.

Journal: Scientific data
DOI: 10.1038/s41597-019-0098-2
Year: 2019

Read publication

Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.