Menu
September 22, 2019  |  

A protein-truncating HSD17B13 variant and protection from chronic liver disease.

Elucidation of the genetic factors underlying chronic liver disease may reveal new therapeutic targets.We used exome sequence data and electronic health records from 46,544 participants in the DiscovEHR human genetics study to identify genetic variants associated with serum levels of alanine aminotransferase (ALT) and aspartate aminotransferase (AST). Variants that were replicated in three additional cohorts (12,527 persons) were evaluated for association with clinical diagnoses of chronic liver disease in DiscovEHR study participants and two independent cohorts (total of 37,173 persons) and with histopathological severity of liver disease in 2391 human liver samples.A splice variant (rs72613567:TA) in HSD17B13, encoding the hepatic lipid droplet protein hydroxysteroid 17-beta dehydrogenase 13, was associated with reduced levels of ALT (P=4.2×10-12) and AST (P=6.2×10-10). Among DiscovEHR study participants, this variant was associated with a reduced risk of alcoholic liver disease (by 42% [95% confidence interval CI, 20 to 58] among heterozygotes and by 53% [95% CI, 3 to 77] among homozygotes), nonalcoholic liver disease (by 17% [95% CI, 8 to 25] among heterozygotes and by 30% [95% CI, 13 to 43] among homozygotes), alcoholic cirrhosis (by 42% [95% CI, 14 to 61] among heterozygotes and by 73% [95% CI, 15 to 91] among homozygotes), and nonalcoholic cirrhosis (by 26% [95% CI, 7 to 40] among heterozygotes and by 49% [95% CI, 15 to 69] among homozygotes). Associations were confirmed in two independent cohorts. The rs72613567:TA variant was associated with a reduced risk of nonalcoholic steatohepatitis, but not steatosis, in human liver samples. The rs72613567:TA variant mitigated liver injury associated with the risk-increasing PNPLA3 p.I148M allele and resulted in an unstable and truncated protein with reduced enzymatic activity.A loss-of-function variant in HSD17B13 was associated with a reduced risk of chronic liver disease and of progression from steatosis to steatohepatitis. (Funded by Regeneron Pharmaceuticals and others.).


September 22, 2019  |  

HIV-1 interacts with human endogenous retrovirus K (HML-2) envelopes derived from human primary lymphocytes.

Human endogenous retroviruses (HERVs) are viruses that have colonized the germ line and spread through vertical passage. Only the more recently acquired HERVs, such as the HERV-K (HML-2) group, maintain coding open reading frames. Expression of HERV-Ks has been linked to different pathological conditions, including HIV infection, but our knowledge on which specific HERV-Ks are expressed in primary lymphocytes currently is very limited. To identify the most expressed HERV-Ks in an unbiased manner, we analyzed their expression patterns in peripheral blood lymphocytes using Pacific Biosciences (PacBio) single-molecule real-time (SMRT) sequencing. We observe that three HERV-Ks (KII, K102, and K18) constitute over 90% of the total HERV-K expression in primary human lymphocytes of five different donors. We also show experimentally that two of these HERV-K env sequences (K18 and K102) retain their ability to produce full-length and posttranslationally processed envelope proteins in cell culture. We show that HERV-K18 Env can be incorporated into HIV-1 but not simian immunodeficiency virus (SIV) particles. Moreover, HERV-K18 Env incorporation into HIV-1 virions is dependent on HIV-1 matrix. Taken together, we generated high-resolution HERV-K expression profiles specific for activated human lymphocytes. We found that one of the most abundantly expressed HERV-K envelopes not only makes a full-length protein but also specifically interacts with HIV-1. Our findings raise the possibility that these endogenous retroviral Env proteins could directly influence HIV-1 replication.Here, we report the HERV-K expression profile of primary lymphocytes from 5 different healthy donors. We used a novel deep-sequencing technology (PacBio SMRT) that produces the long reads necessary to discriminate the complexity of HERV-K expression. We find that primary lymphocytes express up to 32 different HERV-K envelopes, and that at least two of the most expressed Env proteins retain their ability to make a protein. Importantly, one of them, the envelope glycoprotein of HERV-K18, is incorporated into HIV-1 in an HIV matrix-specific fashion. The ramifications of such interactions are discussed, as the possibility of HIV-1 target tissue broadening and immune evasion are considered.


September 22, 2019  |  

Improving eukaryotic genome annotation using single molecule mRNA sequencing.

The advantages of Pacific Biosciences (PacBio) single-molecule real-time (SMRT) technology include long reads, low systematic bias, and high consensus read accuracy. Here we use these attributes to improve on the genome annotation of the parasitic hookworm Ancylostoma ceylanicum using PacBio RNA-Seq.We sequenced 192,888 circular consensus sequences (CCS) derived from cDNAs generated using the CloneTech SMARTer system. These SMARTer-SMRT libraries were normalized and size-selected providing a robust population of expressed structural genes for subsequent genome annotation. We demonstrate PacBio mRNA sequences based genome annotation improvement, compared to genome annotation using conventional sequencing-by-synthesis alone, by identifying 1609 (9.2%) new genes, extended the length of 3965 (26.7%) genes and increased the total genomic exon length by 1.9 Mb (12.4%). Non-coding sequence representation (primarily from UTRs based on dT reverse transcription priming) was particularly improved, increasing in total length by fifteen-fold, by increasing both the length and number of UTR exons. In addition, the UTR data provided by these CCS allowed for the identification of a novel SL2 splice leader sequence for A. ceylanicum and an increase in the number and proportion of functionally annotated genes. RNA-seq data also confirmed some of the newly annotated genes and gene features.Overall, PacBio data has supported a significant improvement in gene annotation in this genome, and is an appealing alternative or complementary technique for genome annotation to the other transcript sequencing technologies.


September 22, 2019  |  

Two novel lncRNAs discovered in human mitochondrial DNA using PacBio full-length transcriptome data.

In this study, we established a general framework to use PacBio full-length transcriptome sequencing for the investigation of mitochondrial RNAs. As a result, we produced the first full-length human mitochondrial transcriptome using public PacBio data and characterized the human mitochondrial genome with more comprehensive and accurate information. Other results included determination of the H-strand primary transcript, identification of the ND5/ND6AS/tRNAGluAS transcript, discovery of palindrome small RNAs (psRNAs) and construction of the “mitochondrial cleavage” model, etc. These results reported for the first time in this study fundamentally changed annotations of human mitochondrial genome and enriched knowledge in the field of animal mitochondrial studies. The most important finding was two novel long non-coding RNAs (lncRNAs) of MDL1 and MDL1AS exist ubiquitously in animal mitochondrial genomes. Copyright © 2017. Published by Elsevier B.V.


September 22, 2019  |  

RNAi-based treatment of chronically infected patients and chimpanzees reveals that integrated hepatitis B virus DNA is a source of HBsAg.

Chronic hepatitis B virus (HBV) infection is a major health concern worldwide, frequently leading to liver cirrhosis, liver failure, and hepatocellular carcinoma. Evidence suggests that high viral antigen load may play a role in chronicity. Production of viral proteins is thought to depend on transcription of viral covalently closed circular DNA (cccDNA). In a human clinical trial with an RNA interference (RNAi)-based therapeutic targeting HBV transcripts, ARC-520, HBV S antigen (HBsAg) was strongly reduced in treatment-naïve patients positive for HBV e antigen (HBeAg) but was reduced significantly less in patients who were HBeAg-negative or had received long-term therapy with nucleos(t)ide viral replication inhibitors (NUCs). HBeAg positivity is associated with greater disease risk that may be moderately reduced upon HBeAg loss. The molecular basis for this unexpected differential response was investigated in chimpanzees chronically infected with HBV. Several lines of evidence demonstrated that HBsAg was expressed not only from the episomal cccDNA minichromosome but also from transcripts arising from HBV DNA integrated into the host genome, which was the dominant source in HBeAg-negative chimpanzees. Many of the integrants detected in chimpanzees lacked target sites for the small interfering RNAs in ARC-520, explaining the reduced response in HBeAg-negative chimpanzees and, by extension, in HBeAg-negative patients. Our results uncover a heretofore underrecognized source of HBsAg that may represent a strategy adopted by HBV to maintain chronicity in the presence of host immunosurveillance. These results could alter trial design and endpoint expectations of new therapies for chronic HBV. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.


September 22, 2019  |  

Improved full-length killer cell immunoglobulin-like receptor transcript discovery in Mauritian cynomolgus macaques.

Killer cell immunoglobulin-like receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens, and so, considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lack the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing Pacific Biosciences circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probe sets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.


September 22, 2019  |  

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.


September 22, 2019  |  

Fluorescently-tagged human eIF3 for single-molecule spectroscopy.

Human translation initiation relies on the combined activities of numerous ribosome-associated eukaryotic initiation factors (eIFs). The largest factor, eIF3, is an ~800 kDa multiprotein complex that orchestrates a network of interactions with the small 40S ribosomal subunit, other eIFs, and mRNA, while participating in nearly every step of initiation. How these interactions take place during the time course of translation initiation remains unclear. Here, we describe a method for the expression and affinity purification of a fluorescently-tagged eIF3 from human cells. The tagged eIF3 dodecamer is structurally intact, functions in cell-based assays, and interacts with the HCV IRES mRNA and the 40S-IRES complex in vitro. By tracking the binding of single eIF3 molecules to the HCV IRES RNA with a zero-mode waveguides-based instrument, we show that eIF3 samples both wild-type IRES and an IRES that lacks the eIF3-binding region, and that the high-affinity eIF3-IRES interaction is largely determined by slow dissociation kinetics. The application of single-molecule methods to more complex systems involving eIF3 may unveil dynamics underlying mRNA selection and ribosome loading during human translation initiation.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019  |  

CliqueSNV: Scalable reconstruction of intra-host viral populations from NGS reads

Highly mutable RNA viruses such as influenza A virus, human immunodeficiency virus and hepatitis C virus exist in infected hosts as highly heterogeneous populations of closely related genomic variants. The presence of low-frequency variants with few mutations with respect to major strains may result in an immune escape, emergence of drug resistance, and an increase of virulence and infectivity. Next-generation sequencing technologies permit detection of sample intra-host viral population at extremely great depth, thus providing an opportunity to access low-frequency variants. Long read lengths offered by single-molecule sequencing technologies allow all viral variants to be sequenced in a single pass. However, high sequencing error rates limit the ability to study heterogeneous viral populations composed of rare, closely related variants. In this article, we present CliqueSNV, a novel reference-based method for reconstruction of viral variants from NGS data. It efficiently constructs an allele graph based on linkage between single nucleotide variations and identifies true viral variants by merging cliques of that graph using combinatorial optimization techniques. The new method outperforms existing methods in both accuracy and running time on experimental and simulated NGS data for titrated levels of known viral variants. For PacBio reads, it accurately reconstructs variants with frequency as low as 0.1%. For Illumina reads, it fully reconstructs main variants. The open source implementation of CliqueSNV is freely available for download at https://github.com/vyacheslav-tsivina/CliqueSNV


September 22, 2019  |  

Epigenetic landscape influences the liver cancer genome architecture.

The accumulations of different types of genetic alterations such as nucleotide substitutions, structural rearrangements and viral genome integrations and epigenetic alterations contribute to carcinogenesis. Here, we report correlation between the occurrence of epigenetic features and genetic aberrations by whole-genome bisulfite, whole-genome shotgun, long-read, and virus capture sequencing of 373 liver cancers. Somatic substitutions and rearrangement breakpoints are enriched in tumor-specific hypo-methylated regions with inactive chromatin marks and actively transcribed highly methylated regions in the cancer genome. Individual mutation signatures depend on chromatin status, especially, signatures with a higher transcriptional strand bias occur within active chromatic areas. Hepatitis B virus (HBV) integration sites are frequently detected within inactive chromatin regions in cancer cells, as a consequence of negative selection for integrations in active chromatin regions. Ultra-high structural instability and preserved unmethylation of integrated HBV genomes are observed. We conclude that both precancerous and somatic epigenetic features contribute to the cancer genome architecture.


September 22, 2019  |  

The energy-coupling factor transporter module EcfAA’T, a novel candidate for the genetic basis of fatty acid-auxotrophic small-colony variants of Staphylococcus aureus.

Staphylococcal small-colony variants (SCVs) are invasive and persistent due to their ability to thrive intracellularly and to evade the host immune response. Thus, the course of infections due to this phenotype is often chronic, relapsing, and therapy-refractory. In order to improve treatment of patients suffering from SCV-associated infections, it is of major interest to understand triggers for the development of this phenotype, in particular for strains naturally occurring in clinical settings. Within this study, we comprehensively characterized two different Staphylococcus aureus triplets each consisting of isogenic strains comprising (i) clinically derived SCV phenotypes with auxotrophy for unsaturated fatty acids, (ii) the corresponding wild-types (WTs), and (iii) spontaneous in vitro revertants displaying the normal phenotype (REVs). Comparison of whole genomes revealed that clinical SCV isolates were closely related to their corresponding WTs and REVs showing only seven to eight alterations per genome triplet. However, both SCVs carried a mutation within the energy-coupling factor (ECF) transporter-encoding ecf module (EcfAA’T) resulting in truncated genes. In both cases, these mutations were shown to be naturally restored in the respective REVs. Since ECF transporters are supposed to be essential for optimal bacterial growth, their dysfunction might constitute another mechanism for the formation of naturally occurring SCVs. Another three triplets analyzed revealed neither mutations in the EcfAA’T nor in other FASII-related genes underlining the high diversity of mechanisms leading to the fatty acid-dependent phenotype. This is the first report on the ECF transporter as genetic basis of fatty acid-auxotrophic staphylococcal SCVs.


September 22, 2019  |  

Hepacivirus A infection in horses defines distinct envelope hypervariable regions and elucidates potential roles of viral strain and adaptive immune status in determining envelope diversity and infection outcome.

Hepacivirus A (also known as nonprimate hepacivirus and equine hepacivirus) is a hepatotropic virus that can cause both transient and persistent infections in horses. The evolution of intrahost viral populations (quasispecies) has not been studied in detail for hepacivirus A, and its roles in immune evasion and persistence are unknown. To address these knowledge gaps, we first evaluated the envelope gene (E1 and E2) diversity of two different hepacivirus A strains (WSU and CU) in longitudinal blood samples from experimentally infected adult horses, juvenile horses (foals), and foals with severe combined immunodeficiency (SCID). Persistent infection with the WSU strain was associated with significantly greater quasispecies diversity than that observed in horses who spontaneously cleared infection (P = 0.0002) or in SCID foals (P < 0.0001). In contrast, the CU strain was able to persist despite significantly lower (P < 0.0001) and relatively static envelope diversity. These findings indicate that envelope diversity is a poor predictor of hepacivirus A infection outcomes and could be dependent on strain-specific factors. Next, entropy analysis was performed on all E1/E2 genes entered into GenBank. This analysis defined three novel hypervariable regions (HVRs) in E2, at residues 391 to 402 (HVR1), 450 to 461 (HVR2), and 550 to 562 (HVR3). For the experimentally infected horses, entropy analysis focusing on the HVRs demonstrated that these regions were under increased selective pressure during persistent infection. Increased diversity in the HVRs was also temporally associated with seroconversion in some horses, suggesting that these regions may be targets of neutralizing antibody and may play a role in immune evasion.IMPORTANCE Hepacivirus C (hepatitis C virus) is estimated to infect 150 million people worldwide and is a leading cause of cirrhosis and hepatocellular carcinoma. In contrast, its closest relative, hepacivirus A, causes relatively mild disease in horses and is frequently cleared. The relationship between quasispecies evolution and infection outcome has not been explored for hepacivirus A. To address this knowledge gap, we examined envelope gene diversity in horses with resolving and persistent infections. Interestingly, two strain-specific patterns of quasispecies diversity emerged. Persistence of the WSU strain was associated with increased quasispecies diversity and the accumulation of amino acid changes within three novel hypervariable regions following seroconversion. These findings provided evidence that envelope gene mutation is influenced by adaptive immune pressure and may contribute to hepacivirus persistence. However, the CU strain persisted despite relative evolutionary stasis, suggesting that some hepacivirus strains may use alternative mechanisms to persist in the host. Copyright © 2018 American Society for Microbiology.


September 22, 2019  |  

Report from the Killer-cell Immunoglobulin-like Receptors (KIR) component of the 17th International HLA and Immunogenetics Workshop.

The goals of the KIR component of the 17th International HLA and Immunogenetics Workshop (IHIW) were to encourage and educate researchers to begin analyzing KIR at allelic resolution, and to survey the nature and extent of KIR allelic diversity across human populations. To represent worldwide diversity, we analyzed 1269 individuals from ten populations, focusing on the most polymorphic KIR genes, which express receptors having three immunoglobulin (Ig)-like domains (KIR3DL1/S1, KIR3DL2 and KIR3DL3). We identified 13 novel alleles of KIR3DL1/S1, 13 of KIR3DL2 and 18 of KIR3DL3. Previously identified alleles, corresponding to 33 alleles of KIR3DL1/S1, 38 of KIR3DL2, and 43 of KIR3DL3, represented over 90% of the observed allele frequencies for these genes. In total we observed 37 KIR3DL1/S1 allotypes, 40 for KIR3DL2 and 44 for KIR3DL3. As KIR allotype diversity can affect NK cell function, this demonstrates potential for high functional diversity worldwide. Allelic variation further diversifies KIR haplotypes. We determined KIR3DL3?~?KIR3DL1/S1?~?KIR3DL2 haplotypes from five of the studied populations, and observed multiple population-specific haplotypes in each. This included 234 distinct haplotypes in European Americans, 191 in Ugandans, 35 in Papuans, 95 in Egyptians and 86 in Spanish populations. For another 35 populations, encompassing 642,105 individuals we focused on KIR3DL2 and identified another 375 novel alleles, with approximately half of them observed in more than one individual. The KIR allelic level data gathered from this project represents the most comprehensive summary of global KIR allelic diversity to date, and continued analysis will improve understanding of KIR allelic polymorphism in global populations. Further, the wealth of new data gathered in the course of this workshop component highlights the value of collaborative, community-based efforts in immunogenetics research, exemplified by the IHIW.Copyright © 2018. Published by Elsevier Inc.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.