Menu
September 22, 2019

Predicting an HLA-DPB1 expression marker based on standard DPB1 genotyping: Linkage analysis of over 32,000 samples.

The risk of acute graft-versus-host disease (GvHD) after hematopoietic stem cell transplantation is increased with donor-recipient HLA-DPB1 allele mismatching. The single-nucleotide polymorphism (SNP) rs9277534 within the 3′ untranslated region (UTR) correlates with HLA-DPB1 allotype expression and serves as a marker for permissive HLA-DPB1 mismatches. Since rs9277534 is not routinely typed, we analyzed 32,681 samples of mostly European ancestry to investigate if the rs9277534 allele can be reliably imputed from standard DPB1 genotyping. We confirmed the previously-defined linkages between rs9277534 and 18 DPB1 alleles and established additional linkages for 46 DPB1 alleles. Based on these linkages, the rs9277534 allele could be predicted for 99.6% of the samples based on DPB1 genotypes (99.99% concordance). We demonstrate that 100% prediction accuracy could be achieved if the prediction utilized exon 3 sequence information. DPB1 genotyping based on exon 2 data alone allows no unambiguous rs9277534 allele prediction but was estimated to maintain 99% accuracy for samples of European descent. We conclude that DPB1 genotyping is sufficient to infer the DPB1 expression marker rs9277534 with high accuracy. This information could be used to select donors with permissive HLA-DPB1 mismatches without directly screening for rs9277534. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.


September 22, 2019

Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host.

In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.


September 22, 2019

Characterization of the SN35N strain-specific exopolysaccharide encoded in the whole circular genome of a plant-derived Lactobacillus plantarum.

Lactobacillus plantarum SN35N, which has been previously isolated from pear, secretes exopolysaccharide (EPS). The aim of the present study is to characterize the EPS chemically and to find the EPS-biosynthesizing gene cluster. The present study demonstrates that the strain produces an acidic EPS carrying phosphate residue, which is composed of glucose, galactose, and mannose at a molecular ratio of 15.0?:?5.7?:?1.0. We also show that acidic EPS strongly inhibits the catalytic activity of hyaluronidase (EC 3.2.1.35), promoting an inflammatory reaction. In the present study, we also determined the complete genome sequence of the SN35N strain, demonstrating that the genome is a circular DNA with 3267626?bp, and the number of predicted coding genes is 3146, with a GC content of 44.51%. In addition, the strain harbors four plasmids, designated pSN35N-1, -2, -3, and -4. Although four EPS-biosynthesizing genes, designated lpe1, lpe2, lpe3, and lpe4, are present in the SN35N chromosomal DNA, another EPS gene cluster, lpe5, is located in the pSN35N-3 plasmid, composed of 35425?bp. EPS low-producing mutants, which were obtained by treating SN35N cells with novobiocin, lost the lpe5 gene cluster in the plasmid-curing experiment, suggesting that the gene cluster for the biosynthesis of acidic EPS is present in the plasmid. The present study shows the chemical characterization of the acidic EPS and its inhibitory effect to the hyaluronidase.


September 22, 2019

Multi-omics Reveals the Lifestyle of the Acidophilic, Mineral-Oxidizing Model Species Leptospirillum ferriphilumT.

Leptospirillum ferriphilum plays a major role in acidic, metal-rich environments, where it represents one of the most prevalent iron oxidizers. These milieus include acid rock and mine drainage as well as biomining operations. Despite its perceived importance, no complete genome sequence of the type strain of this model species is available, limiting the possibilities to investigate the strategies and adaptations that Leptospirillum ferriphilum DSM 14647T (here referred to as Leptospirillum ferriphilumT) applies to survive and compete in its niche. This study presents a complete, circular genome of Leptospirillum ferriphilumT obtained by PacBio single-molecule real-time (SMRT) long-read sequencing for use as a high-quality reference. Analysis of the functionally annotated genome, mRNA transcripts, and protein concentrations revealed a previously undiscovered nitrogenase cluster for atmospheric nitrogen fixation and elucidated metabolic systems taking part in energy conservation, carbon fixation, pH homeostasis, heavy metal tolerance, the oxidative stress response, chemotaxis and motility, quorum sensing, and biofilm formation. Additionally, mRNA transcript counts and protein concentrations were compared between cells grown in continuous culture using ferrous iron as the substrate and those grown in bioleaching cultures containing chalcopyrite (CuFeS2). Adaptations of Leptospirillum ferriphilumT to growth on chalcopyrite included the possibly enhanced production of reducing power, reduced carbon dioxide fixation, as well as elevated levels of RNA transcripts and proteins involved in heavy metal resistance, with special emphasis on copper efflux systems. Finally, the expression and translation of genes responsible for chemotaxis and motility were enhanced.IMPORTANCELeptospirillum ferriphilum is one of the most important iron oxidizers in the context of acidic and metal-rich environments during moderately thermophilic biomining. A high-quality circular genome of Leptospirillum ferriphilumT coupled with functional omics data provides new insights into its metabolic properties, such as the novel identification of genes for atmospheric nitrogen fixation, and represents an essential step for further accurate proteomic and transcriptomic investigation of this acidophile model species in the future. Additionally, light is shed on adaptation strategies of Leptospirillum ferriphilumT for growth on the copper mineral chalcopyrite. These data can be applied to deepen our understanding and optimization of bioleaching and biooxidation, techniques that present sustainable and environmentally friendly alternatives to many traditional methods for metal extraction. Copyright © 2018 Christel et al.


September 22, 2019

Genomics: Next regeneration sequencing for reference genomes.

Various species have remarkable abilities to regenerate body parts or entire organisms after injury, but a comprehensive understanding of the molecular basis of regeneration mech- anisms will require detailed genomic resources. Two new studies report high-quality reference genomes for two classic regeneration model organ- isms with contrasting genome sizes: the axolotl salamander Ambystoma mexicanum and the planarium flatworm Schmidtea mediterranea.


September 22, 2019

A hybrid-hierarchical genome assembly strategy to sequence the invasive golden mussel Limnoperna fortunei.

For more than 25 years, the golden mussel Limnoperna fortunei has aggressively invaded South American freshwaters, having travelled more than 5,000 km upstream across five countries. Along the way, the golden mussel has outcompeted native species and economically harmed aquaculture, hydroelectric powers, and ship transit. We have sequenced the complete genome of the golden mussel to understand the molecular basis of its invasiveness and search for ways to control it.We assembled the 1.6 Gb genome into 20548 scaffolds with an N50 length of 312 Kb using a hybrid and hierarchical assembly strategy from short and long DNA reads and transcriptomes. A total of 60717 coding genes were inferred from a customized transcriptome-trained AUGUSTUS run. We also compared predicted protein sets with those of complete molluscan genomes, revealing an exacerbation of protein-binding domains in L. fortunei. Conclusions: We built one of the best bivalve genome assemblies available using a cost-effective approach using Illumina pair-end, mate pair, and PacBio long reads. We expect that the continuous and careful annotation of L. fortunei’s genome will contribute to the investigation of bivalve genetics, evolution, and invasiveness, as well as to the development of biotechnological tools for aquatic pest control.© The Authors 2017. Published by Oxford University Press.


September 22, 2019

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Performing sequence alignment to identify structural variants, such as large deletions, from genome sequencing data is a fundamental task, but current methods are far from perfect. The current practice is to independently align each DNA read to a reference genome. We show that the propensity of genomic rearrangements to accumulate in repeat-rich regions imposes severe ambiguities in these alignments, and consequently on the variant calls-with current read lengths, this affects more than one third of known large deletions in the C. Venter genome. We present a method to jointly align reads to a genome, whereby alignment ambiguity of one read can be disambiguated by other reads. We show this leads to a significant improvement in the accuracy of identifying large deletions (=20 bases), while imposing minimal computational overhead and maintaining an overall running time that is at par with current tools. A software implementation is available as an open-source Python program called JRA at https://bitbucket.org/jointreadalignment/jra-src.


September 22, 2019

A survey of localized sequence rearrangements in human DNA.

Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.


September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.


September 22, 2019

Blood CXCR3+CD4 T cells are enriched in inducible replication competent HIV in aviremic antiretroviral therapy-treated individuals.

We recently demonstrated that lymph nodes (LNs) PD-1+/T follicular helper (Tfh) cells from antiretroviral therapy (ART)-treated HIV-infected individuals were enriched in cells containing replication competent virus. However, the distribution of cells containing inducible replication competent virus has been only partially elucidated in blood memory CD4 T-cell populations including the Tfh cell counterpart circulating in blood (cTfh). In this context, we have investigated the distribution of (1) total HIV-infected cells and (2) cells containing replication competent and infectious virus within various blood and LN memory CD4 T-cell populations of conventional antiretroviral therapy (cART)-treated HIV-infected individuals. In the present study, we show that blood CXCR3-expressing memory CD4 T cells are enriched in cells containing inducible replication competent virus and contributed the most to the total pool of cells containing replication competent and infectious virus in blood. Interestingly, subsequent proviral sequence analysis did not indicate virus compartmentalization between blood and LN CD4 T-cell populations, suggesting dynamic interchanges between the two compartments. We then investigated whether the composition of blood HIV reservoir may reflect the polarization of LN CD4 T cells at the time of reservoir seeding and showed that LN PD-1+CD4 T cells of viremic untreated HIV-infected individuals expressed significantly higher levels of CXCR3 as compared to CCR4 and/or CCR6, suggesting that blood CXCR3-expressing CD4 T cells may originate from LN PD-1+CD4 T cells. Taken together, these results indicate that blood CXCR3-expressing CD4 T cells represent the major blood compartment containing inducible replication competent virus in treated aviremic HIV-infected individuals.


September 22, 2019

Genetic separation of Listeria monocytogenes causing central nervous system infections in animals.

Listeria monocytogenes is a foodborne pathogen that causes abortion, septicemia, gastroenteritis and central nervous system (CNS) infections in ruminants and humans. L. monocytogenes strains mainly belong to two distinct phylogenetic groups, named lineages I and II. In general, clinical cases in humans and animals, in particular CNS infections, are caused by lineage I strains, while most of the environmental and food strains belong to lineage II. Little is known about why lineage I is more virulent than lineage II, even though various molecular factors and mechanisms associated with pathogenesis are known. In this study, we have used a variety of whole genome sequence analyses and comparative genomic tools in order to find characteristics that distinguish lineage I from lineage II strains and CNS infection strains from non-CNS strains. We analyzed 225 strains and identified single nucleotide variants between lineages I and II, as well as differences in the gene content. Using a novel approach based on Reads Per Kilobase per Million Mapped (RPKM), we identified 167 genes predominantly absent in lineage II but present in lineage I. These genes are mostly encoding for membrane-associated proteins. Additionally, we found 77 genes that are largely absent in the non-CNS associated strains, while 39 genes are especially lacking in our defined “non-clinical” group. Based on the RPKM analysis and the metadata linked to the L. monocytogenes strains, we identified 6 genes potentially associated with CNS cases, which include a transcriptional regulator, an ABC transporter and a non-coding RNA. Although there is not a clear separation between pathogenic and non-pathogenic strains based on phylogenetic lineages, the presence of the genes identified in our study reveals potential pathogenesis traits in ruminant L. monocytogenes strains. Ultimately, the differences that we have found in our study will help steer future studies in understanding the virulence mechanisms of the most pathogenic L. monocytogenes strains.


September 22, 2019

2′-O-methylation in mRNA disrupts tRNA decoding during translation elongation.

Chemical modifications of mRNA may regulate many aspects of mRNA processing and protein synthesis. Recently, 2′-O-methylation of nucleotides was identified as a frequent modification in translated regions of human mRNA, showing enrichment in codons for certain amino acids. Here, using single-molecule, bulk kinetics and structural methods, we show that 2′-O-methylation within coding regions of mRNA disrupts key steps in codon reading during cognate tRNA selection. Our results suggest that 2′-O-methylation sterically perturbs interactions of ribosomal-monitoring bases (G530, A1492 and A1493) with cognate codon-anticodon helices, thereby inhibiting downstream GTP hydrolysis by elongation factor Tu (EF-Tu) and A-site tRNA accommodation, leading to excessive rejection of cognate aminoacylated tRNAs in initial selection and proofreading. Our current and prior findings highlight how chemical modifications of mRNA tune the dynamics of protein synthesis at different steps of translation elongation.


September 22, 2019

Effect of plasmid design and type of integration event on recombinant protein expression in Pichia pastoris.

Pichia pastoris (syn. Komagataella phaffii) is one of the most common eukaryotic expression systems for heterologous protein production. Expression cassettes are typically integrated in the genome to obtain stable expression strains. In contrast to Saccharomyces cerevisiae, where short overhangs are sufficient to target highly specific integration, long overhangs are more efficient in P. pastoris and ectopic integration of foreign DNA can occur. Here, we aimed to elucidate the influence of ectopic integration by high-throughput screening of >700 transformants and whole-genome sequencing of 27 transformants. Different vector designs and linearization approaches were used to mimic the most common integration events targeted in P. pastoris Fluorescence of an enhanced green fluorescent protein (eGFP) reporter protein was highly uniform among transformants when the expression cassettes were correctly integrated in the targeted locus. Surprisingly, most nonspecifically integrated transformants showed highly uniform expression that was comparable to specific integration, suggesting that nonspecific integration does not necessarily influence expression. However, a few clones (<10%) harboring ectopically integrated cassettes showed a greater variation spanning a 25-fold range, surpassing specifically integrated reference strains up to 6-fold. High-expression strains showed a correlation between increased gene copy numbers and high reporter protein fluorescence levels. Our results suggest that for comparing expression levels between strains, the integration locus can be neglected as long as a sufficient numbers of transformed strains are compared. For expression optimization of highly expressible proteins, increasing copy number appears to be the dominant positive influence rather than the integration locus, genomic rearrangements, deletions, or single-nucleotide polymorphisms (SNPs).IMPORTANCE Yeasts are commonly used as biotechnological production hosts for proteins and metabolites. In the yeast Saccharomyces cerevisiae, expression cassettes carrying foreign genes integrate highly specifically at the targeted sites in the genome. In contrast, cassettes often integrate at random genomic positions in nonconventional yeasts, such as Pichia pastoris (syn. Komagataella phaffii). Hence, cells from the same transformation event often behave differently, with significant clonal variation necessitating the screening of large numbers of strains. The importance of this study is that we systematically investigated the influence of integration events in more than 700 strains. Our findings provide novel insight into clonal variation in P. pastoris and, thus, how to avoid pitfalls and obtain reliable results. The underlying mechanisms may also play a role in other yeasts and hence could be generally relevant for recombinant yeast protein production strains. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Conventional and single-molecule targeted sequencing method for specific variant detection in IKBKG while bypassing the IKBKGP1 pseudogene.

In addition to Sanger sequencing, next-generation sequencing of gene panels and exomes has emerged as a standard diagnostic tool in many laboratories. However, these captures can miss regions, have poor efficiency, or capture pseudogenes, which hamper proper diagnoses. One such example is the primary immunodeficiency-associated gene IKBKG. Its pseudogene IKBKGP1 makes traditional capture methods aspecific. We therefore developed a long-range PCR method to efficiently target IKBKG, as well as two associated genes (IRAK4 and MYD88), while bypassing the IKBKGP1 pseudogene. Sequencing accuracy was evaluated using both conventional short-read technology and a newer long-read, single-molecule sequencer. Different mapping and variant calling options were evaluated in their capability to bypass the pseudogene using both sequencing platforms. Based on these evaluations, we determined a robust diagnostic application for unambiguous sequencing and variant calling in IKBKG, IRAK4, and MYD88. This method allows rapid identification of selected primary immunodeficiency diseases in patients suffering from life-threatening invasive pyogenic bacterial infections. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.