Menu
April 21, 2020

The complete genome and methylome of Helicobacter pylori hpNEAfrica strain HP14039

Background Helicobacter pylori is a Gram-negative bacterium which mainly causes peptic ulcer disease in human, but is also the predominant cause of stomach cancer. It has been coevolving with human since 120,000 years and, according to Multi-locus sequence typing (MLST), H. pylori can be classified into seven major population types, namely, hpAfrica1, hpAfrica2, hpNEAfrica, hpEastAsia, hpAsia2, hpEurope and hpSahul. Helicobacter pylori harbours a large number of restriction-modification (R-M) systems. The methyltransferase (MTase) unit plays a significant role in gene regulation and also possibly modulates pathogenicity. The diversity in MTase can act as geomarkers to correlate strains with the phylogeographic origins. This paper describes the complete genome sequence and methylome of gastric pathogen H. pylori belonging to the population hpNEAfrica. Results In this paper, we present the complete genome sequence and the methylome profile of H. pylori hpNEAfrica strain HP14039, isolated from a patient who was born in Somalia and likely to be infected locally during early childhood prior to migration. The genome of HP14039 consists of 1,678,260 bp with 1574 coding genes and 38.7% GC content. The sequence analysis showed that this strain lacks the cag pathogenicity island. The vacA gene is of S2M2 type. We have also identified 15 methylation motifs, including WCANHNNNNTG and CTANNNNNNNTAYG that were not previously described. Conclusions We have described the complete genome of H. pylori strain HP14039. The information regarding phylo-geography, methylome and associated metadata would help scientific community to study more about hpNEAfrica population type.


October 23, 2019

Optimized CRISPR-Cas9 genome editing for Leishmania and its use to target a multigene family, induce chromosomal translocation, and study DNA break repair mechanisms.

CRISPR-Cas9-mediated genome editing has recently been adapted for Leishmania spp. parasites, the causative agents of human leishmaniasis. We have optimized this genome-editing tool by selecting for cells with CRISPR-Cas9 activity through cotargeting the miltefosine transporter gene; mutation of this gene leads to miltefosine resistance. This cotargeting strategy integrated into a triple guide RNA (gRNA) expression vector was used to delete all 11 copies of the A2 multigene family; this was not previously possible with the traditional gene-targeting method. We found that the Leishmania donovani rRNA promoter is more efficient than the U6 promoter in driving gRNA expression, and sequential transfections of the oligonucleotide donor significantly eased the isolation of edited mutants. A gRNA and Cas9 coexpression vector was developed that was functional in all tested Leishmania species, including L. donovani, L. major, and L. mexicana. By simultaneously targeting sites from two different chromosomes, all four types of targeted chromosomal translocations were generated, regardless of the polycistronic transcription direction from the parent chromosomes. It was possible to use this CRISPR system to create a single conserved amino acid substitution (A189G) mutation for both alleles of RAD51, a DNA recombinase involved in homology-directed repair. We found that RAD51 is essential for L. donovani survival based on direct observation of the death of mutants with both RAD51 alleles disrupted, further confirming that this CRISPR system can reveal gene essentiality. Evidence is also provided that microhomology-mediated end joining (MMEJ) plays a major role in double-strand DNA break repair in L. donovani. IMPORTANCELeishmania parasites cause human leishmaniasis. To accelerate characterization of Leishmania genes for new drug and vaccine development, we optimized and simplified the CRISPR-Cas9 genome-editing tool for Leishmania. We show that co-CRISPR targeting of the miltefosine transporter gene and serial transfections of an oligonucleotide donor significantly eased isolation of edited mutants. This cotargeting strategy was efficiently used to delete all 11 members of the A2 virulence gene family. This technical advancement is valuable, since there are many gene clusters and supernumerary chromosomes in the various Leishmania species and isolates. We simplified this CRISPR system by developing a gRNA and Cas9 coexpression vector which could be used to delete genes in various Leishmania species. This CRISPR system could also be used to generate specific chromosomal translocations, which will help in the study of Leishmania gene expression and transcription control. This study also provides new information about double-strand DNA break repair mechanisms in Leishmania.


October 23, 2019

AAV-mediated delivery of zinc finger nucleases targeting hepatitis B virus inhibits active replication.

Despite an existing effective vaccine, hepatitis B virus (HBV) remains a major public health concern. There are effective suppressive therapies for HBV, but they remain expensive and inaccessible to many, and not all patients respond well. Furthermore, HBV can persist as genomic covalently closed circular DNA (cccDNA) that remains in hepatocytes even during otherwise effective therapy and facilitates rebound in patients after treatment has stopped. Therefore, the need for an effective treatment that targets active and persistent HBV infections remains. As a novel approach to treat HBV, we have targeted the HBV genome for disruption to prevent viral reactivation and replication. We generated 3 zinc finger nucleases (ZFNs) that target sequences within the HBV polymerase, core and X genes. Upon the formation of ZFN-induced DNA double strand breaks (DSB), imprecise repair by non-homologous end joining leads to mutations that inactivate HBV genes. We delivered HBV-specific ZFNs using self-complementary adeno-associated virus (scAAV) vectors and tested their anti-HBV activity in HepAD38 cells. HBV-ZFNs efficiently disrupted HBV target sites by inducing site-specific mutations. Cytotoxicity was seen with one of the ZFNs. scAAV-mediated delivery of a ZFN targeting HBV polymerase resulted in complete inhibition of HBV DNA replication and production of infectious HBV virions in HepAD38 cells. This effect was sustained for at least 2 weeks following only a single treatment. Furthermore, high specificity was observed for all ZFNs, as negligible off-target cleavage was seen via high-throughput sequencing of 7 closely matched potential off-target sites. These results show that HBV-targeted ZFNs can efficiently inhibit active HBV replication and suppress the cellular template for HBV persistence, making them promising candidates for eradication therapy.


October 23, 2019

Galactofuranose in Mycoplasma mycoides is important for membrane integrity and conceals adhesins but does not contribute to serum resistance.

Mycoplasma mycoides subsp. capri (Mmc) and subsp. mycoides (Mmm) are important ruminant pathogens worldwide causing diseases such as pleuropneumonia, mastitis and septicaemia. They express galactofuranose residues on their surface, but their role in pathogenesis has not yet been determined. The M.?mycoides genomes contain up to several copies of the glf gene, which encodes an enzyme catalysing the last step in the synthesis of galactofuranose. We generated a deletion of the glf gene in a strain of Mmc using genome transplantation and tandem repeat endonuclease coupled cleavage (TREC) with yeast as an intermediary host for the genome editing. As expected, the resulting YCp1.1-?glf strain did not produce the galactofuranose-containing glycans as shown by immunoblots and immuno-electronmicroscopy employing a galactofuranose specific monoclonal antibody. The mutant lacking galactofuranose exhibited a decreased growth rate and a significantly enhanced adhesion to small ruminant cells. The mutant was also ‘leaking’ as revealed by a ß-galactosidase-based assay employing a membrane impermeable substrate. These findings indicate that galactofuranose-containing polysaccharides conceal adhesins and are important for membrane integrity. Unexpectedly, the mutant strain showed increased serum resistance. © 2015 The Authors. Molecular Microbiology published by John Wiley & Sons Ltd.


October 23, 2019

Sites of retroviral DNA integration: From basic research to clinical applications.

One of the most crucial steps in the life cycle of a retrovirus is the integration of the viral DNA (vDNA) copy of the RNA genome into the genome of an infected host cell. Integration provides for efficient viral gene expression as well as for the segregation of viral genomes to daughter cells upon cell division. Some integrated viruses are not well expressed, and cells latently infected with human immunodeficiency virus type 1 (HIV-1) can resist the action of potent antiretroviral drugs and remain dormant for decades. Intensive research has been dedicated to understanding the catalytic mechanism of integration, as well as the viral and cellular determinants that influence integration site distribution throughout the host genome. In this review, we summarize the evolution of techniques that have been used to recover and map retroviral integration sites, from the early days that first indicated that integration could occur in multiple cellular DNA locations, to current technologies that map upwards of millions of unique integration sites from single in vitro integration reactions or cell culture infections. We further review important insights gained from the use of such mapping techniques, including the monitoring of cell clonal expansion in patients treated with retrovirus-based gene therapy vectors, or patients with acquired immune deficiency syndrome (AIDS) on suppressive antiretroviral therapy (ART). These insights span from integrase (IN) enzyme sequence preferences within target DNA (tDNA) at the sites of integration, to the roles of host cellular proteins in mediating global integration distribution, to the potential relationship between genomic location of vDNA integration site and retroviral latency.


October 23, 2019

CRISPR/Cas9-generated p47(phox)-deficient cell line for Chronic Granulomatous Disease gene therapy vector development.

Development of gene therapy vectors requires cellular models reflecting the genetic background of a disease thus allowing for robust preclinical vector testing. For human p47(phox)-deficient chronic granulomatous disease (CGD) vector testing we generated a cellular model using clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 to introduce a GT-dinucleotide deletion (?GT) mutation in p47(phox) encoding NCF1 gene in the human acute myeloid leukemia PLB-985 cell line. CGD is a group of hereditary immunodeficiencies characterized by impaired respiratory burst activity in phagocytes due to a defective phagocytic nicotinamide adenine dinucleotide phosphate (NADPH) oxidase. In Western countries autosomal-recessive p47(phox)-subunit deficiency represents the second largest CGD patient cohort with unique genetics, as the vast majority of p47(phox) CGD patients carries ?GT deletion in exon two of the NCF1 gene. The established PLB-985 NCF1 ?GT cell line reflects the most frequent form of p47(phox)-deficient CGD genetically and functionally. It can be differentiated to granulocytes efficiently, what creates an attractive alternative to currently used iPSC models for rapid testing of novel gene therapy approaches.


October 23, 2019

Structural determination of the broadly reactive anti-IGHV1-69 anti-idiotypic antibody G6 and its idiotope.

The heavy chain IGHV1-69 germline gene exhibits a high level of polymorphism and shows biased use in protective antibody (Ab) responses to infections and vaccines. It is also highly expressed in several B cell malignancies and autoimmune diseases. G6 is an anti-idiotypic monoclonal Ab that selectively binds to IGHV1-69 heavy chain germline gene 51p1 alleles that have been implicated in these Ab responses and disease processes. Here, we determine the co-crystal structure of humanized G6 (hG6.3) in complex with anti-influenza hemagglutinin stem-directed broadly neutralizing Ab D80. The core of the hG6.3 idiotope is a continuous string of CDR-H2 residues starting with M53 and ending with N58. G6 binding studies demonstrate the remarkable breadth of binding to 51p1 IGHV1-69 Abs with diverse CDR-H3, light chain, and antigen binding specificities. These studies detail the broad expression of the G6 cross-reactive idiotype (CRI) that further define its potential role in precision medicine. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


October 23, 2019

Rapid CRISPR/Cas9-mediated cloning of full-length Epstein-Barr virus genomes from latently infected cells.

Herpesviruses have relatively large DNA genomes of more than 150 kb that are difficult to clone and sequence. Bacterial artificial chromosome (BAC) cloning of herpesvirus genomes is a powerful technique that greatly facilitates whole viral genome sequencing as well as functional characterization of reconstituted viruses. We describe recently invented technologies for rapid BAC cloning of herpesvirus genomes using CRISPR/Cas9-mediated homology-directed repair. We focus on recent BAC cloning techniques of Epstein-Barr virus (EBV) genomes and discuss the possible advantages of a CRISPR/Cas9-mediated strategy comparatively with precedent EBV-BAC cloning strategies. We also describe the design decisions of this technology as well as possible pitfalls and points to be improved in the future. The obtained EBV-BAC clones are subjected to long-read sequencing analysis to determine complete EBV genome sequence including repetitive regions. Rapid cloning and sequence determination of various EBV strains will greatly contribute to the understanding of their global geographical distribution. This technology can also be used to clone disease-associated EBV strains and test the hypothesis that they have special features that distinguish them from strains that infect asymptomatically.


September 22, 2019

An environmental bacterial taxon with a large and distinct metabolic repertoire.

Cultivated bacteria such as actinomycetes are a highly useful source of biomedically important natural products. However, such ‘talented’ producers represent only a minute fraction of the entire, mostly uncultivated, prokaryotic diversity. The uncultured majority is generally perceived as a large, untapped resource of new drug candidates, but so far it is unknown whether taxa containing talented bacteria indeed exist. Here we report the single-cell- and metagenomics-based discovery of such producers. Two phylotypes of the candidate genus ‘Entotheonella’ with genomes of greater than 9 megabases and multiple, distinct biosynthetic gene clusters co-inhabit the chemically and microbially rich marine sponge Theonella swinhoei. Almost all bioactive polyketides and peptides known from this animal were attributed to a single phylotype. ‘Entotheonella’ spp. are widely distributed in sponges and belong to an environmental taxon proposed here as candidate phylum ‘Tectomicrobia’. The pronounced bioactivities and chemical uniqueness of ‘Entotheonella’ compounds provide significant opportunities for ecological studies and drug discovery.


September 22, 2019

Transcriptional adaptations during long-term persistence of Staphylococcus aureus in the airways of a cystic fibrosis patient.

The lungs of Cystic fibrosis (CF) patients are often colonized and/or infected by Staphylococcus aureus for years, mostly by one predominant clone. For long-term survival in this environment, S. aureus needs to adapt during its interactions with host factors, antibiotics, and other pathogens. Here, we study long-term transcriptional as well as genomic adaptations of an isogenic pair of S. aureus isolates from a single patient using RNA sequencing (RNA-Seq) and whole genome sequencing (WGS). Mimicking in vivo conditions, we cultivated the S. aureus isolates using artificial sputum medium before harvesting RNA for subsequent analysis. We confirmed our RNA-Seq data using quantitative real-time (qRT)-PCR and additionally investigated intermediate isolates from the same patient representing in total 13.2 years of persistence in the CF airways. Comparative RNA-Seq analysis of the first and the last (“late”) isolate revealed significant differences in the late isolate after 13.2 years of persistence. Of the 2545 genes expressed in both isolates that were cultivated aerobically, 256 genes were up- and 161 were down-regulated with a minimum 2-fold change (2f). Focusing on 25 highly (=8f) up- (n=9) or down- (n=16) regulated genes, we identified several genes encoding for virulence factors involved in immune evasion, bacterial spread or secretion (e.g. spa, sak, and esxA). Moreover, these genes displayed similar expression trends under aerobic, microaerophilic and anaerobic conditions. Further qRT-PCR-experiments of highly up- or down-regulated genes within intermediate S. aureus isolates resulted in different gene expression patterns over the years. Using sequencing analysis of the differently expressed genes and their upstream regions in the late S. aureus isolate resulted in only few genomic alterations. Comparative transcriptomic analysis revealed adaptive changes affecting mainly genes involved in host-pathogen interaction. Although the underlying mechanisms were not known, our results suggest adaptive processes beyond genomic mutations triggered by local factors rather than by activation of global regulators. Copyright © 2014 The Authors. Published by Elsevier GmbH.. All rights reserved.


September 22, 2019

Comparison of the mitochondrial genomes and steady state transcriptomes of two strains of the trypanosomatid parasite, Leishmania tarentolae.

U-insertion/deletion RNA editing is a post-transcriptional mitochondrial RNA modification phenomenon required for viability of trypanosomatid parasites. Small guide RNAs encoded mainly by the thousands of catenated minicircles contain the information for this editing. We analyzed by NGS technology the mitochondrial genomes and transcriptomes of two strains, the old lab UC strain and the recently isolated LEM125 strain. PacBio sequencing provided complete minicircle sequences which avoided the assembly problem of short reads caused by the conserved regions. Minicircles were identified by a characteristic size, the presence of three short conserved sequences, a region of inherently bent DNA and the presence of single gRNA genes at a fairly defined location. The LEM125 strain contained over 114 minicircles encoding different gRNAs and the UC strain only ~24 minicircles. Some LEM125 minicircles contained no identifiable gRNAs. Approximate copy numbers of the different minicircle classes in the network were determined by the number of PacBio CCS reads that assembled to each class. Mitochondrial RNA libraries from both strains were mapped against the minicircle and maxicircle sequences. Small RNA reads mapped to the putative gRNA genes but also to multiple regions outside the genes on both strands and large RNA reads mapped in many cases over almost the entire minicircle on both strands. These data suggest that minicircle transcription is complete and bidirectional, with 3′ processing yielding the mature gRNAs. Steady state RNAs in varying abundances are derived from all maxicircle genes, including portions of the repetitive divergent region. The relative extents of editing in both strains correlated with the presence of a cascade of cognate gRNAs. These data should provide the foundation for a deeper understanding of this dynamic genetic system as well as the evolutionary variation of editing in different strains.


September 22, 2019

HIV-1 infection of primary CD4(+) T cells regulates the expression of specific HERV-K (HML-2) elements.

Endogenous retroviruses (ERVs) occupy extensive regions of the human genome. Although many of these retroviral elements have lost their ability to replicate, those whose insertion took place more recently, such as the HML-2 group of HERV-K elements, still retain intact open reading frames and the capacity to produce certain viral RNA and/or proteins. Transcription of these ERVs is, however, tightly regulated by dedicated epigenetic control mechanisms. Nonetheless, it has been reported that some pathologic states, such as viral infections and certain cancers, coincide with ERV expression suggesting transcriptional reawakening is possible. HML-2 elements are reportedly induced during HIV-1 infection, but the conserved nature of these elements has, until recently, rendered their expression profiling problematic.Here, we provide comprehensive HERV-K HML-2 expression profiles specific for productively HIV-1 infected primary human CD4(+) T cells. We combined enrichment of HIV-1 infected cells using a reporter virus expressing a surface reporter for gentle and efficient purification with long-read Single Molecule Real-Time sequencing. We show that three HML-2 proviruses, 6q25.1, 8q24.3, and 19q13.42 are up-regulated on average between 3- and 5-fold in HIV-1 infected CD4(+) T cells. One provirus, HML-2 12q24.33, in contrast, was repressed in the presence of active HIV replication.In conclusion, this report identifies the HERV-K HML-2 loci whose expression profiles differ upon HIV-1 infection in primary human CD4(+) T cells. These data will help pave the way for further studies on the influence of endogenous retroviruses on HIV-1 replication.Importance Endogenous retroviruses inhabit big portions of our genome. And although they are mainly inert some of the evolutionarily younger members maintain the ability to express both RNA as well as proteins. We have developed an approach using long-read SMRT sequencing that produces long reads, that provides us with ability to obtain detailed and accurate HERV-K HML-2 expression profiles. We have now applied this approach to study HERV-K expression in the presence and absence of productive HIV-1 infection of primary human CD4(+) T cells. In addition to using SMRT sequencing, our strategy also includes the magnetic selection of the infected cells so that levels of background expression due to uninfected cells are kept at a minimum. The results in this manuscript provide the blueprint for in-depth studies of the interactions of the authentic upregulated HERV-K HML-2 elements and HIV-1. Copyright © 2017 American Society for Microbiology.


September 22, 2019

Global transcript structure resolution of high gene density genomes through multi-platform data integration.

Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome.© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.


September 22, 2019

Rapid infectious disease identification by next-generation DNA sequencing.

Currently, there is a critical need to rapidly identify infectious organisms in clinical samples. Next-Generation Sequencing (NGS) could surmount the deficiencies of culture-based methods; however, there are no standardized, automated programs to process NGS data. To address this deficiency, we developed the Rapid Infectious Disease Identification (RIDI™) system. The system requires minimal guidance, which reduces operator errors. The system is compatible with the three major NGS platforms. It automatically interfaces with the sequencing system, detects their data format, configures the analysis type, applies appropriate quality control, and analyzes the results. Sequence information is characterized using both the NCBI database and RIDI™ specific databases. RIDI™ was designed to identify high probability sequence matches and more divergent matches that could represent different or novel species. We challenged the system using defined American Type Culture Collection (ATCC) reference standards of 27 species, both individually and in varying combinations. The system was able to rapidly detect known organisms in <12h with multi-sample throughput. The system accurately identifies 99.5% of the DNA sequence reads at the genus-level and 75.3% at the species-level in reference standards. It has a limit of detection of 146cells/ml in simulated clinical samples, and is also able to identify the components of polymicrobial samples with 16.9% discrepancy at the genus-level and 31.2% at the species-level. Thus, the system's effectiveness may exceed current methods, especially in situations where culture methods could produce false negatives or where rapid results would influence patient outcomes. Copyright © 2016 Elsevier B.V. All rights reserved.


September 22, 2019

Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II’s sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.