Menu
September 22, 2019

Report from the Killer-cell Immunoglobulin-like Receptors (KIR) component of the 17th International HLA and Immunogenetics Workshop.

The goals of the KIR component of the 17th International HLA and Immunogenetics Workshop (IHIW) were to encourage and educate researchers to begin analyzing KIR at allelic resolution, and to survey the nature and extent of KIR allelic diversity across human populations. To represent worldwide diversity, we analyzed 1269 individuals from ten populations, focusing on the most polymorphic KIR genes, which express receptors having three immunoglobulin (Ig)-like domains (KIR3DL1/S1, KIR3DL2 and KIR3DL3). We identified 13 novel alleles of KIR3DL1/S1, 13 of KIR3DL2 and 18 of KIR3DL3. Previously identified alleles, corresponding to 33 alleles of KIR3DL1/S1, 38 of KIR3DL2, and 43 of KIR3DL3, represented over 90% of the observed allele frequencies for these genes. In total we observed 37 KIR3DL1/S1 allotypes, 40 for KIR3DL2 and 44 for KIR3DL3. As KIR allotype diversity can affect NK cell function, this demonstrates potential for high functional diversity worldwide. Allelic variation further diversifies KIR haplotypes. We determined KIR3DL3?~?KIR3DL1/S1?~?KIR3DL2 haplotypes from five of the studied populations, and observed multiple population-specific haplotypes in each. This included 234 distinct haplotypes in European Americans, 191 in Ugandans, 35 in Papuans, 95 in Egyptians and 86 in Spanish populations. For another 35 populations, encompassing 642,105 individuals we focused on KIR3DL2 and identified another 375 novel alleles, with approximately half of them observed in more than one individual. The KIR allelic level data gathered from this project represents the most comprehensive summary of global KIR allelic diversity to date, and continued analysis will improve understanding of KIR allelic polymorphism in global populations. Further, the wealth of new data gathered in the course of this workshop component highlights the value of collaborative, community-based efforts in immunogenetics research, exemplified by the IHIW.Copyright © 2018. Published by Elsevier Inc.


September 22, 2019

Biparental Inheritance of Mitochondrial DNA in Humans.

Although there has been considerable debate about whether paternal mitochondrial DNA (mtDNA) transmission may coexist with maternal transmission of mtDNA, it is generally believed that mitochondria and mtDNA are exclusively maternally inherited in humans. Here, we identified three unrelated multigeneration families with a high level of mtDNA heteroplasmy (ranging from 24 to 76%) in a total of 17 individuals. Heteroplasmy of mtDNA was independently examined by high-depth whole mtDNA sequencing analysis in our research laboratory and in two Clinical Laboratory Improvement Amendments and College of American Pathologists-accredited laboratories using multiple approaches. A comprehensive exploration of mtDNA segregation in these families shows biparental mtDNA transmission with an autosomal dominantlike inheritance mode. Our results suggest that, although the central dogma of maternal inheritance of mtDNA remains valid, there are some exceptional cases where paternal mtDNA could be passed to the offspring. Elucidating the molecular mechanism for this unusual mode of inheritance will provide new insights into how mtDNA is passed on from parent to offspring and may even lead to the development of new avenues for the therapeutic treatment for pathogenic mtDNA transmission.


September 22, 2019

Long-read sequencing technology indicates genome-wide effects of non-B DNA on polymerization speed and error rate.

DNA conformation may deviate from the classical B-form in ~13% of the human genome. Non-B DNA regulates many cellular processes; however, its effects on DNA polymerization speed and accuracy have not been investigated genome-wide. Such an inquiry is critical for understanding neurological diseases and cancer genome instability. Here, we present the first simultaneous examination of DNA polymerization kinetics and errors in the human genome sequenced with Single-Molecule Real-Time (SMRT) technology. We show that polymerization speed differs between non-B and B-DNA: It decelerates at G-quadruplexes and fluctuates periodically at disease-causing tandem repeats. Analyzing polymerization kinetics profiles, we predict and validate experimentally non-B DNA formation for a novel motif. We demonstrate that several non-B motifs affect sequencing errors (e.g., G-quadruplexes increase error rates), and that sequencing errors are positively associated with polymerase slowdown. Finally, we show that highly divergent G4 motifs have pronounced polymerization slowdown and high sequencing error rates, suggesting similar mechanisms for sequencing errors and germline mutations.© 2018 Guiblet et al.; Published by Cold Spring Harbor Laboratory Press.


September 22, 2019

Detection and visualization of complex structural variants from long reads.

With applications in cancer, drug metabolism, and disease etiology, understanding structural variation in the human genome is critical in advancing the thrusts of individualized medicine. However, structural variants (SVs) remain challenging to detect with high sensitivity using short read sequencing technologies. This problem is exacerbated when considering complex SVs comprised of multiple overlapping or nested rearrangements. Longer reads, such as those from Pacific Biosciences platforms, often span multiple breakpoints of such events, and thus provide a way to unravel small-scale complexities in SVs with higher confidence.We present CORGi (COmplex Rearrangement detection with Graph-search), a method for the detection and visualization of complex local genomic rearrangements. This method leverages the ability of long reads to span multiple breakpoints to untangle SVs that appear very complicated with respect to a reference genome. We validated our approach against both simulated long reads, and real data from two long read sequencing technologies. We demonstrate the ability of our method to identify breakpoints inserted in synthetic data with high accuracy, and the ability to detect and plot SVs from NA12878 germline, achieving 88.4% concordance between the two sets of sequence data. The patterns of complexity we find in many NA12878 SVs match known mechanisms associated with DNA replication and structural variant formation, and highlight the ability of our method to automatically label complex SVs with an intuitive combination of adjacent or overlapping reference transformations.CORGi is a method for interrogating genomic regions suspected to contain local rearrangements using long reads. Using pairwise alignments and graph search CORGi produces labels and visualizations for local SVs of arbitrary complexity.


September 22, 2019

MadID, a versatile approach to map protein-DNA interactions, highlights telomere-nuclear envelope contact sites in human cells.

Mapping the binding sites of DNA- or chromatin-interacting proteins is essential to understanding biological processes. DNA adenine methyltransferase identification (DamID) has emerged as a comprehensive method to map genome-wide occupancy of proteins of interest. A caveat of DamID is the specificity of Dam methyltransferase for GATC motifs that are not homogenously distributed in the genome. Here, we developed an optimized method named MadID, using proximity labeling of DNA by the methyltransferase M.EcoGII. M.EcoGII mediates N6-adenosine methylation in any DNA sequence context, resulting in deeper and unbiased coverage of the genome. We demonstrate, using m6A-specific immunoprecipitation and deep sequencing, that MadID is a robust method to identify protein-DNA interactions at the whole-genome level. Using MadID, we revealed contact sites between human telomeres, repetitive sequences devoid of GATC sites, and the nuclear envelope. Overall, MadID opens the way to identification of binding sites in genomic regions that were largely inaccessible. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.


September 22, 2019

Trophoblast organoids as a model for maternal-fetal interactions during human placentation.

The placenta is the extraembryonic organ that supports the fetus during intrauterine life. Although placental dysfunction results in major disorders of pregnancy with immediate and lifelong consequences for the mother and child, our knowledge of the human placenta is limited owing to a lack of functional experimental models1. After implantation, the trophectoderm of the blastocyst rapidly proliferates and generates the trophoblast, the unique cell type of the placenta. In vivo, proliferative villous cytotrophoblast cells differentiate into two main sub-populations: syncytiotrophoblast, the multinucleated epithelium of the villi responsible for nutrient exchange and hormone production, and extravillous trophoblast cells, which anchor the placenta to the maternal decidua and transform the maternal spiral arteries2. Here we describe the generation of long-term, genetically stable organoid cultures of trophoblast that can differentiate into both syncytiotrophoblast and extravillous trophoblast. We used human leukocyte antigen (HLA) typing to confirm that the organoids were derived from the fetus, and verified their identities against four trophoblast-specific criteria3. The cultures organize into villous-like structures, and we detected the secretion of placental-specific peptides and hormones, including human chorionic gonadotropin (hCG), growth differentiation factor 15 (GDF15) and pregnancy-specific glycoprotein (PSG) by mass spectrometry. The organoids also differentiate into HLA-G+ extravillous trophoblast cells, which vigorously invade in three-dimensional cultures. Analysis of the methylome reveals that the organoids closely resemble normal first trimester placentas. This organoid model will be transformative for studying human placental development and for investigating trophoblast interactions with the local and systemic maternal environment.


September 22, 2019

Relationship between Alzheimer’s disease-associated SNPs within the CLU gene, local DNA methylation and episodic verbal memory in healthy and schizophrenia subjects.

Genetic variation may impact on local DNA methylation patterns. Therefore, information about allele-specific DNA methylation (ASM) within disease-related loci has been proposed to be useful for the interpretation of GWAS results. To explore mechanisms that may underlie associations between Alzheimer’s disease (AD) and schizophrenia risk CLU gene and verbal memory, one of the most affected cognitive domains in both conditions, we studied DNA methylation in a region between AD-associated SNPs rs9331888 and rs9331896 in 72 healthy individuals and 73 schizophrenia patients. Using single-molecule real-time bisulfite sequencing we assessed the haplotype-dependent ASM in this region. We then investigated whether its methylation could influence episodic verbal memory measured with the Rey Auditory Verbal Learning Test in these two cohorts. The region showed a complex methylation pattern, which was similar in healthy and schizophrenia individuals and unrelated to haplotypes. The pattern predicted memory scores in controls. The results suggest that epigenetic modifications within the CLU locus may play a role in memory variation, independent of ASM. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

T118M Variant of PMP22 Gene Presents with Painful Peripheral Neuropathy and Varying Charcot-Marie-Tooth Features: A Case Series and Review of the Literature.

The clinical effect of T118M variant of the PMP22 gene has been controversial. Several studies have suggested that it may be autosomal recessive, partial loss of function, or a benign variant. Here we report three cases in further support that the T118M variant of the PMP22 gene is a partial loss of function variant. These three unrelated cases were heterozygotes with the T118M variant of the PMP22 gene. All three cases presented with painful peripheral neuropathy and varying degrees of Charcot-Marie-Tooth exam features. Electrophysiological studies revealed polyneuropathy with axonal and demyelinating features in one case, but there were minimal electrophysiological changes in the other two cases. We propose that the T118M variant can cause painful peripheral neuropathy, which may be an underrecognized feature of this variant.


September 22, 2019

Integrative haplotype estimation with sub-linear complexity

The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here, we present a new method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear scaling with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPET4 in an open source format on https://odelaneau.github.io/shapeit4/ and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.


September 22, 2019

CompStor Novos: a low cost yet fast assembly-based variant calling for personal genomes

Application of assembly methods for personal genome analysis from next generation sequencing data has been limited by the requirement for an expensive supercomputer hardware or long computation times when using ordinary resources. We describe CompStor Novos, achieving supercomputer-class performance in de novo assembly computation time on standard server hardware, based on a tiered-memory algorithm. Run on commercial off-the-shelf servers, Novos assembly is more precise and 10-20 times faster than that of existing assembly algorithms. Furthermore, we integrated Novos into a variant calling pipeline and demonstrate that both compute times and precision of calling point variants and indels compare well with standard alignment-based pipelines. Additionally, assembly eliminates bias in the estimation of allele frequency for indels and naturally enables discovery of breakpoints for structural variants with base pair resolution. Thus, Novos bridges the gap between alignment-based and assembly-based genome analyses. Extension and adaption of its underlying algorithm will help quickly and fully harvest information in sequencing reads for personal genome reconstruction.


September 21, 2019

Decreased fitness and virulence in ST10 Escherichia coli harboring blaNDM-5 and mcr-1 against a ST4981 strain with blaNDM-5.

Although coexistence of blaNDM-5 and mcr-1 in Escherichia coli has been reported, little is known about the fitness and virulence of such strains. Three carbapenem-resistant Escherichia coli (GZ1, GZ2, and GZ3) successively isolated from one patient in 2015 were investigated for microbiological fitness and virulence. GZ1 and GZ2 were also resistant to colistin. To verify the association between plasmids and fitness, growth kinetics of the transconjugants were performed. We also analyzed genomic sequences of GZ2 and GZ3 using PacBio sequencing. GZ1 and GZ2 (ST10) co-harbored blaNDM-5 and mcr-1, while GZ3 (ST4981) carried only blaNDM-5. GZ3 demonstrated significantly more rapid growth (P < 0.001) and overgrew GZ2 with a competitive index of 1.0157 (4 h) and 2.5207 (24 h). Increased resistance to serum killing and mice mortality was also identified in GZ3. While GZ2 had four plasmids (IncI2, IncX3, IncHI2, IncFII), GZ3 possessed one plasmid (IncFII). The genetic contexts of blaNDM-5 in GZ2 and GZ3 were identical but inserted into different backbones, IncX3 (102,512 bp) and IncFII (91,451 bp), respectively. The growth was not statistically different between the transconjugants with mcr-1 or blaNDM-5 plasmid and recipient (P = 0.6238). Whole genome sequence analysis revealed that 28 virulence genes were specific to GZ3, potentially contributing to increased virulence of GZ3. Decreased fitness and virulence in a mcr-1 and blaNDM-5 co-harboring ST10 E. coli was found alongside a ST4981 strain with only blaNDM-5. Acquisition of mcr-1 or blaNDM-5 plasmid did not lead to considerable fitness costs, indicating the potential for dissemination of mcr-1 and blaNDM-5 in Enterobacteriaceae.


September 21, 2019

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.

Long-read, single-molecule real-time (SMRT) sequencing is routinely used to finish microbial genomes, but available assembly methods have not scaled well to larger genomes. We introduce the MinHash Alignment Process (MHAP) for overlapping noisy, long reads using probabilistic, locality-sensitive hashing. Integrating MHAP with the Celera Assembler enabled reference-grade de novo assemblies of Saccharomyces cerevisiae, Arabidopsis thaliana, Drosophila melanogaster and a human hydatidiform mole cell line (CHM1) from SMRT sequencing. The resulting assemblies are highly continuous, include fully resolved chromosome arms and close persistent gaps in these reference genomes. Our assembly of D. melanogaster revealed previously unknown heterochromatic and telomeric transition sequences, and we assembled low-complexity sequences from CHM1 that fill gaps in the human GRCh38 reference. Using MHAP and the Celera Assembler, single-molecule sequencing can produce de novo near-complete eukaryotic assemblies that are 99.99% accurate when compared with available reference genomes.


September 21, 2019

Discovery and genotyping of structural variation from long-read haploid genome sequence data.

In an effort to more fully understand the full spectrum of human genetic variation, we generated deep single-molecule, real-time (SMRT) sequencing data from two haploid human genomes. By using an assembly-based approach (SMRT-SV), we systematically assessed each genome independently for structural variants (SVs) and indels resolving the sequence structure of 461,553 genetic variants from 2 bp to 28 kbp in length. We find that >89% of these variants have been missed as part of analysis of the 1000 Genomes Project even after adjusting for more common variants (MAF > 1%). We estimate that this theoretical human diploid differs by as much as ~16 Mbp with respect to the human reference, with long-read sequencing data providing a fivefold increase in sensitivity for genetic variants ranging in size from 7 bp to 1 kbp compared with short-read sequence data. Although a large fraction of genetic variants were not detected by short-read approaches, once the alternate allele is sequence-resolved, we show that 61% of SVs can be genotyped in short-read sequence data sets with high accuracy. Uncoupling discovery from genotyping thus allows for the majority of this missed common variation to be genotyped in the human population. Interestingly, when we repeat SV detection on a pseudodiploid genome constructed in silico by merging the two haploids, we find that ~59% of the heterozygous SVs are no longer detected by SMRT-SV. These results indicate that haploid resolution of long-read sequencing data will significantly increase sensitivity of SV detection.© 2017 Huddleston et al.; Published by Cold Spring Harbor Laboratory Press.


September 21, 2019

Identification of a novel RASD1 somatic mutation in a USP8-mutated corticotroph adenoma.

Cushing’s disease (CD) is caused by pituitary corticotroph adenomas that secrete excess adrenocorticotropic hormone (ACTH). In these tumors, somatic mutations in the gene USP8 have been identified as recurrent and pathogenic and are the sole known molecular driver for CD. Although other somatic mutations were reported in these studies, their contribution to the pathogenesis of CD remains unexplored. No molecular drivers have been established for a large proportion of CD cases and tumor heterogeneity has not yet been investigated using genomics methods. Also, even in USP8-mutant tumors, a possibility may exist of additional contributing mutations, following a paradigm from other neoplasm types where multiple somatic alterations contribute to neoplastic transformation. The current study utilizes whole-exome discovery sequencing on the Illumina platform, followed by targeted amplicon-validation sequencing on the Pacific Biosciences platform, to interrogate the somatic mutation landscape in a corticotroph adenoma resected from a CD patient. In this USP8-mutated tumor, we identified an interesting somatic mutation in the gene RASD1, which is a component of the corticotropin-releasing hormone receptor signaling system. This finding may provide insight into a novel mechanism involving loss of feedback control to the corticotropin-releasing hormone receptor and subsequent deregulation of ACTH production in corticotroph tumors.


September 21, 2019

Long-read genome sequencing identifies causal structural variation in a Mendelian disease.

PurposeCurrent clinical genomics assays primarily utilize short-read sequencing (SRS), but SRS has limited ability to evaluate repetitive regions and structural variants. Long-read sequencing (LRS) has complementary strengths, and we aimed to determine whether LRS could offer a means to identify overlooked genetic variation in patients undiagnosed by SRS.MethodsWe performed low-coverage genome LRS to identify structural variants in a patient who presented with multiple neoplasia and cardiac myxomata, in whom the results of targeted clinical testing and genome SRS were negative.ResultsThis LRS approach yielded 6,971 deletions and 6,821 insertions?>?50?bp. Filtering for variants that are absent in an unrelated control and overlap a disease gene coding exon identified three deletions and three insertions. One of these, a heterozygous 2,184?bp deletion, overlaps the first coding exon of PRKAR1A, which is implicated in autosomal dominant Carney complex. RNA sequencing demonstrated decreased PRKAR1A expression. The deletion was classified as pathogenic based on guidelines for interpretation of sequence variants.ConclusionThis first successful application of genome LRS to identify a pathogenic variant in a patient suggests that LRS has significant potential for the identification of disease-causing structural variation. Larger studies will ultimately be required to evaluate the potential clinical utility of LRS.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.