Pacific Biosciences’ SMRT sequencing method was used to extend the sequence of HLA-A*02:13. © 2019 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Hematopoietic cells are continuously replenished from progenitor cells that reside in the bone marrow. To evaluate molecular changes during this process, we analyzed the transcriptomes of freshly harvested human bone marrow progenitor (lineage-negative) and differentiated (lineage-positive) cells by single-molecule real-time (SMRT) full-length RNA-sequencing. This analysis revealed a ~5-fold higher number of transcript isoforms than previously detected and showed a distinct composition of individual transcript isoforms characteristic for bone marrow subpopulations. A detailed analysis of messenger RNA (mRNA) isoforms transcribed from the ANXA1 and EEF1A1 loci confirmed their distinct composition. The expression of proteins predicted from the transcriptome analysis was evaluated…
Next generation DNA sequencing is used to determine the HLA-A, -B, -C, -DRB1, -DRB3/4/5, and -DQB1 assignments of 1009 unrelated volunteers for the unrelated donor registry in The Netherlands. The analysis characterizes all HLA exons and introns for class I alleles; at least exons 2 to 3 for HLA-DRB1; and exons 2 to 6 for HLA-DQB1. Of the distinct alleles present, there are 229 class I and 71 class II; 36 of these alleles are novel. The majority (approximately 98%) of the cumulative allele frequency at each locus is contributed by alleles that appear three or more times. Alleles encoding…
Our understanding of sequence variation in the HLA-DPB1 gene is largely restricted to the hypervariable antigen recognition domain (ARD) encoded by exon 2. Here, we employed a redundant sequencing strategy combining long-read and short-read data to accurately phase and characterise in full length the majority of common and well-documented (CWD) DPB1 alleles as well as alleles with an observed frequency of at least 0.0006% in our predominantly European sample set. We generated 664 DPB1 sequences, comprising 279 distinct allelic variants. This allows us to present the, to date, most comprehensive analysis of the nature and extent of DPB1 sequence variation.…
HLA matching at an allelic-level resolution for volunteer unrelated donor (VUD) hematopoietic cell transplanta- tion (HCT) results in improved survival and fewer post-transplant complications. Limitations in typing technolo- gies used for the hyperpolymorphic HLA genes have meant that variations outside of the antigen recognition domain (ARD) have not been previously characterized in HCT. Our aim was to explore the extent of diversity out- side of the ARD and determine the impact of this diversity on transplant outcome. Eight hundred ninety-one VUD-HCT donors and their recipients transplanted for a hematologic malignancy in the United Kingdom were ret- rospectively HLA typed at…
Chagas disease is a complex tropical pathology caused by the kinetoplastid Trypanosoma cruzi. This parasite displays massive genetic diversity and has been classified by international consensus in at least six Discrete Typing Units (DTUs) that are broadly distributed in the American continent. The main clinical manifestation of the disease is the chronic chagasic cardiomyopathy (CCC) that is lethal in the infected individuals. However, one intriguing feature is that only 30-40% of the infected individuals will develop CCC. Some authors have suggested that the immune response, host genetic factors, virulence factors and even the massive genetic heterogeneity of T. cruzi are…
Human leukocyte antigen (HLA) is a gene complex known for its exceptional diversity across populations, importance in organ and blood stem cell transplantation, and associations of specific alleles with various diseases. We constructed a Japanese reference panel of class I HLA genes (ToMMo HLA panel), comprising a distinct set of HLA-A, HLA-B, HLA-C, and HLA-H alleles, by single-molecule, real-time (SMRT) sequencing of 208 individuals included in the 1070 whole-genome Japanese reference panel (1KJPN). For high-quality allele reconstruction, we developed a novel pipeline, Primer-Separation Assembly and Refinement Pipeline (PSARP), in which the SMRT sequencing and additional short-read data were used. The…
The DNA sequencing technologies in use today produce either highly accurate short reads or less-accurate long reads. We report the optimization of circular consensus sequencing (CCS) to improve the accuracy of single-molecule real-time (SMRT) sequencing (PacBio) and generate highly accurate (99.8%) long high-fidelity (HiFi) reads with an average length of 13.5?kilobases (kb). We applied our approach to sequence the well-characterized human HG002/NA24385 genome and obtained precision and recall rates of at least 99.91% for single-nucleotide variants (SNVs), 95.98% for insertions and deletions 15?megabases (Mb) and concordance of 99.997%, substantially outperforming assembly with less-accurate long reads.