Menu
July 19, 2019  |  

Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology.

Many genomes have been sequenced to high-quality draft status using Sanger capillary electrophoresis and/or newer short-read sequence data and whole genome assembly techniques. However, even the best draft genomes contain gaps and other imperfections due to limitations in the input data and the techniques used to build draft assemblies. Sequencing biases, repetitive genomic features, genomic polymorphism, and other complicating factors all come together to make some regions difficult or impossible to assemble. Traditionally, draft genomes were upgraded to “phase 3 finished” status using time-consuming and expensive Sanger-based manual finishing processes. For more facile assembly and automated finishing of draft genomes, we present here an automated approach to finishing using long-reads from the Pacific Biosciences RS (PacBio) platform. Our algorithm and associated software tool, PBJelly, (publicly available at https://sourceforge.net/projects/pb-jelly/) automates the finishing process using long sequence reads in a reference-guided assembly process. PBJelly also provides “lift-over” co-ordinate tables to easily port existing annotations to the upgraded assembly. Using PBJelly and long PacBio reads, we upgraded the draft genome sequences of a simulated Drosophila melanogaster, the version 2 draft Drosophila pseudoobscura, an assembly of the Assemblathon 2.0 budgerigar dataset, and a preliminary assembly of the Sooty mangabey. With 24× mapped coverage of PacBio long-reads, we addressed 99% of gaps and were able to close 69% and improve 12% of all gaps in D. pseudoobscura. With 4× mapped coverage of PacBio long-reads we saw reads address 63% of gaps in our budgerigar assembly, of which 32% were closed and 63% improved. With 6.8× mapped coverage of mangabey PacBio long-reads we addressed 97% of gaps and closed 66% of addressed gaps and improved 19%. The accuracy of gap closure was validated by comparison to Sanger sequencing on gaps from the original D. pseudoobscura draft assembly and shown to be dependent on initial reference quality.


July 19, 2019  |  

Quantifying genome-editing outcomes at endogenous loci with SMRT sequencing.

Targeted genome editing with engineered nucleases has transformed the ability to introduce precise sequence modifications at almost any site within the genome. A major obstacle to probing the efficiency and consequences of genome editing is that no existing method enables the frequency of different editing events to be simultaneously measured across a cell population at any endogenous genomic locus. We have developed a novel method for quantifying individual genome editing outcomes at any site of interest using single molecule real time (SMRT) DNA sequencing. We show that this approach can be applied at various loci, using multiple engineered nuclease platforms including TALENs, RNA guided endonucleases (CRISPR/Cas9), and ZFNs, and in different cell lines to identify conditions and strategies in which the desired engineering outcome has occurred. This approach facilitates the evaluation of new gene editing technologies and permits sensitive quantification of editing outcomes in almost every experimental system used.


July 19, 2019  |  

Evolution of hypervirulence by a MRSA clone through acquisition of a transposable element.

Staphylococcus aureus has evolved as a pathogen that causes a range of diseases in humans. There are two dominant modes of evolution thought to explain most of the virulence differences between strains. First, virulence genes may be acquired from other organisms. Second, mutations may cause changes in the regulation and expression of genes. Here we describe an evolutionary event in which transposition of an IS element has a direct impact on virulence gene regulation resulting in hypervirulence. Whole-genome analysis of a methicillin-resistant S. aureus (MRSA) strain USA500 revealed acquisition of a transposable element (IS256) that is absent from close relatives of this strain. Of the multiple copies of IS256 found in the USA500 genome, one was inserted in the promoter sequence of repressor of toxins (Rot), a master transcriptional regulator responsible for the expression of virulence factors in S. aureus. We show that insertion into the rot promoter by IS256 results in the derepression of cytotoxin expression and increased virulence. Taken together, this work provides new insight into evolutionary strategies by which S. aureus is able to modify its virulence properties and demonstrates a novel mechanism by which horizontal gene transfer directly impacts virulence through altering toxin regulation. © 2014 John Wiley & Sons Ltd.


July 19, 2019  |  

Multiplexed highly-accurate DNA sequencing of closely-related HIV-1 variants using continuous long reads from single molecule, real-time sequencing.

Single Molecule, Real-Time (SMRT(®)) Sequencing (Pacific Biosciences, Menlo Park, CA, USA) provides the longest continuous DNA sequencing reads currently available. However, the relatively high error rate in the raw read data requires novel analysis methods to deconvolute sequences derived from complex samples. Here, we present a workflow of novel computer algorithms able to reconstruct viral variant genomes present in mixtures with an accuracy of >QV50. This approach relies exclusively on Continuous Long Reads (CLR), which are the raw reads generated during SMRT Sequencing. We successfully implement this workflow for simultaneous sequencing of mixtures containing up to forty different >9 kb HIV-1 full genomes. This was achieved using a single SMRT Cell for each mixture and desktop computing power. This novel approach opens the possibility of solving complex sequencing tasks that currently lack a solution. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 19, 2019  |  

HLA Class-II associated HIV polymorphisms predict escape from CD4+ T Cell responses.

Antiretroviral therapy, antibody and CD8+ T cell-mediated responses targeting human immunodeficiency virus-1 (HIV-1) exert selection pressure on the virus necessitating escape; however, the ability of CD4+ T cells to exert selective pressure remains unclear. Using a computational approach on HIV gag/pol/nef sequences and HLA-II allelic data, we identified 29 HLA-II associated HIV sequence polymorphisms or adaptations (HLA-AP) in an African cohort of chronically HIV-infected individuals. Epitopes encompassing the predicted adaptation (AE) or its non-adapted (NAE) version were evaluated for immunogenicity. Using a CD8-depleted IFN-? ELISpot assay, we determined that the magnitude of CD4+ T cell responses to the predicted epitopes in controllers was higher compared to non-controllers (p<0.0001). However, regardless of the group, the magnitude of responses to AE was lower as compared to NAE (p<0.0001). CD4+ T cell responses in patients with acute HIV infection (AHI) demonstrated poor immunogenicity towards AE as compared to NAE encoded by their transmitted founder virus. Longitudinal data in AHI off antiretroviral therapy demonstrated sequence changes that were biologically confirmed to represent CD4+ escape mutations. These data demonstrate an innovative application of HLA-associated polymorphisms to identify biologically relevant CD4+ epitopes and suggests CD4+ T cells are active participants in driving HIV evolution.


July 19, 2019  |  

Heterosexual transmission of subtype C HIV-1 selects consensus-like variants without increased replicative capacity or interferon-a resistance.

Heterosexual transmission of HIV-1 is characterized by a genetic bottleneck that selects a single viral variant, the transmitted/founder (TF), during most transmission events. To assess viral characteristics influencing HIV-1 transmission, we sequenced 167 near full-length viral genomes and generated 40 infectious molecular clones (IMC) including TF variants and multiple non-transmitted (NT) HIV-1 subtype C variants from six linked heterosexual transmission pairs near the time of transmission. Consensus-like genomes sensitive to donor antibodies were selected for during transmission in these six transmission pairs. However, TF variants did not demonstrate increased viral fitness in terms of particle infectivity or viral replicative capacity in activated peripheral blood mononuclear cells (PBMC) and monocyte-derived dendritic cells (MDDC). In addition, resistance of the TF variant to the antiviral effects of interferon-a (IFN-a) was not significantly different from that of non-transmitted variants from the same transmission pair. Thus neither in vitro viral replicative capacity nor IFN-a resistance discriminated the transmission potential of viruses in the quasispecies of these chronically infected individuals. However, our findings support the hypothesis that within-host evolution of HIV-1 in response to adaptive immune responses reduces viral transmission potential.


July 19, 2019  |  

Comprehensive bioinformatics analysis of Mycoplasma pneumoniae genomes to investigate underlying population structure and type-specific determinants.

Mycoplasma pneumoniae is a significant cause of respiratory illness worldwide. Despite a minimal and highly conserved genome, genetic diversity within the species may impact disease. We performed whole genome sequencing (WGS) analysis of 107 M. pneumoniae isolates, including 67 newly sequenced using the Pacific BioSciences RS II and/or Illumina MiSeq sequencing platforms. Comparative genomic analysis of 107 genomes revealed >3,000 single nucleotide polymorphisms (SNPs) in total, including 520 type-specific SNPs. Population structure analysis supported the existence of six distinct subgroups, three within each type. We developed a predictive model to classify an isolate based on whole genome SNPs called against the reference genome into the identified subtypes, obviating the need for genome assembly. This study is the most comprehensive WGS analysis for M. pneumoniae to date, underscoring the power of combining complementary sequencing technologies to overcome difficult-to-sequence regions and highlighting potential differential genomic signatures in M. pneumoniae.


July 19, 2019  |  

HIV envelope glycoform heterogeneity and localized diversity govern the initiation and maturation of a V2 apex broadly neutralizing antibody lineage.

Understanding how broadly neutralizing antibodies (bnAbs) to HIV envelope (Env) develop during natural infection can help guide the rational design of an HIV vaccine. Here, we described a bnAb lineage targeting the Env V2 apex and the Ab-Env co-evolution that led to development of neutralization breadth. The lineage Abs bore an anionic heavy chain complementarity-determining region 3 (CDRH3) of 25 amino acids, among the shortest known for this class of Abs, and achieved breadth with only 10% nucleotide somatic hypermutation and no insertions or deletions. The data suggested a role for Env glycoform heterogeneity in the activation of the lineage germline B cell. Finally, we showed that localized diversity at key V2 epitope residues drove bnAb maturation toward breadth, mirroring the Env evolution pattern described for another donor who developed V2-apex targeting bnAbs. Overall, these findings suggest potential strategies for vaccine approaches based on germline-targeting and serial immunogen design. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.


July 19, 2019  |  

Male-killing toxin in a bacterial symbiont of Drosophila.

Several lineages of symbiotic bacteria in insects selfishly manipulate host reproduction to spread in a population 1 , often by distorting host sex ratios. Spiroplasma poulsonii2,3 is a helical and motile, Gram-positive symbiotic bacterium that resides in a wide range of Drosophila species 4 . A notable feature of S. poulsonii is male killing, whereby the sons of infected female hosts are selectively killed during development1,2. Although male killing caused by S. poulsonii has been studied since the 1950s, its underlying mechanism is unknown. Here we identify an S. poulsonii protein, designated Spaid, whose expression induces male killing. Overexpression of Spaid in D. melanogaster kills males but not females, and induces massive apoptosis and neural defects, recapitulating the pathology observed in S. poulsonii-infected male embryos5-11. Our data suggest that Spaid targets the dosage compensation machinery on the male X chromosome to mediate its effects. Spaid contains ankyrin repeats and a deubiquitinase domain, which are required for its subcellular localization and activity. Moreover, we found a laboratory mutant strain of S. poulsonii with reduced male-killing ability and a large deletion in the spaid locus. Our study has uncovered a bacterial protein that affects host cellular machinery in a sex-specific way, which is likely to be the long-searched-for factor responsible for S. poulsonii-induced male killing.


July 7, 2019  |  

Biochemical characterization of a Naegleria TET-like oxygenase and its application in single molecule sequencing of 5-methylcytosine.

Modified DNA bases in mammalian genomes, such as 5-methylcytosine ((5m)C) and its oxidized forms, are implicated in important epigenetic regulation processes. In human or mouse, successive enzymatic conversion of (5m)C to its oxidized forms is carried out by the ten-eleven translocation (TET) proteins. Previously we reported the structure of a TET-like (5m)C oxygenase (NgTET1) from Naegleria gruberi, a single-celled protist evolutionarily distant from vertebrates. Here we show that NgTET1 is a 5-methylpyrimidine oxygenase, with activity on both (5m)C (major activity) and thymidine (T) (minor activity) in all DNA forms tested, and provide unprecedented evidence for the formation of 5-formyluridine ((5f)U) and 5-carboxyuridine ((5ca)U) in vitro. Mutagenesis studies reveal a delicate balance between choice of (5m)C or T as the preferred substrate. Furthermore, our results suggest substrate preference by NgTET1 to (5m)CpG and TpG dinucleotide sites in DNA. Intriguingly, NgTET1 displays higher T-oxidation activity in vitro than mammalian TET1, supporting a closer evolutionary relationship between NgTET1 and the base J-binding proteins from trypanosomes. Finally, we demonstrate that NgTET1 can be readily used as a tool in (5m)C sequencing technologies such as single molecule, real-time sequencing to map (5m)C in bacterial genomes at base resolution.


July 7, 2019  |  

vanG element insertions within a conserved chromosomal site conferring vancomycin resistance to Streptococcus agalactiae and Streptococcus anginosus.

Three vancomycin-resistant streptococcal strains carrying vanG elements (two invasive Streptococcus agalactiae isolates [GBS-NY and GBS-NM, both serotype II and multilocus sequence type 22] and one Streptococcus anginosus [Sa]) were examined. The 45,585-bp elements found within Sa and GBS-NY were nearly identical (together designated vanG-1) and shared near-identity over an ~15-kb overlap with a previously described vanG element from Enterococcus faecalis. Unexpectedly, vanG-1 shared much less homology with the 49,321-bp vanG-2 element from GBS-NM, with widely different levels (50% to 99%) of sequence identity shared among 44 related open reading frames. Immediately adjacent to both vanG-1 and vanG-2 were 44,670-bp and 44,680-bp integrative conjugative element (ICE)-like sequences, designated ICE-r, that were nearly identical in the two group B streptococcal (GBS) strains. The dual vanG and ICE-r elements from both GBS strains were inserted at the same position, between bases 1328 and 1329, within the identical RNA methyltransferase (rumA) genes. A GenBank search revealed that although most GBS strains contained insertions within this specific site, only sequence type 22 (ST22) GBS strains contained highly related ICE-r derivatives. The vanG-1 element in Sa was also inserted within this position corresponding to its rumA homolog adjacent to an ICE-r derivative. vanG-1 insertions were previously reported within the same relative position in the E. faecalis rumA homolog. An ICE-r sequence perfectly conserved with respect to its counterpart in GBS-NY was apparent within the same site of the rumA homolog of a Streptococcus dysgalactiae subsp. equisimilis strain. Additionally, homologous vanG-like elements within the conserved rumA target site were evident in Roseburia intestinalis. Importance: These three streptococcal strains represent the first known vancomycin-resistant strains of their species. The collective observations made from these strains reveal a specific hot spot for insertional elements that is conserved between streptococci and different Gram-positive species. The two GBS strains potentially represent a GBS lineage that is predisposed to insertion of vanG elements. Copyright © 2014 Srinivasan et al.


July 7, 2019  |  

Emergence of a new Neisseria meningitidis clonal complex 11 lineage 11.2 clade as an effective urogenital pathogen.

Neisseria meningitidis (Nm) clonal complex 11 (cc11) lineage is a hypervirulent pathogen responsible for outbreaks of invasive meningococcal disease, including among men who have sex with men, and is increasingly associated with urogenital infections. Recently, clusters of Nm urethritis have emerged primarily among heterosexual males in the United States. We determined that nonencapsulated meningococcal isolates from an ongoing Nm urethritis outbreak among epidemiologically unrelated men in Columbus, Ohio, are linked to increased Nm urethritis cases in multiple US cities, including Atlanta and Indianapolis, and that they form a unique clade (the US Nm urethritis clade, US_NmUC). The isolates belonged to the cc11 lineage 11.2/ET-15 with fine type of PorA P1.5-1, 10-8; FetA F3-6; PorB 2-2 and express a unique FHbp allele. A common molecular fingerprint of US_NmUC isolates was an IS1301 element in the intergenic region separating the capsule ctr-css operons and adjacent deletion of cssA/B/C and a part of csc, encoding the serogroup C capsule polymerase. This resulted in the loss of encapsulation and intrinsic lipooligosaccharide sialylation that may promote adherence to mucosal surfaces. Furthermore, we detected an IS1301-mediated inversion of an ~20-kb sequence near the cps locus. Surprisingly, these isolates had acquired by gene conversion the complete gonococcal denitrification norB-aniA gene cassette, and strains grow well anaerobically. The cc11 US_NmUC isolates causing urethritis clusters in the United States may have adapted to a urogenital environment by loss of capsule and gene conversion of the Neisseria gonorrheae norB-aniA cassette promoting anaerobic growth.


July 7, 2019  |  

Commensal Propionibacterium strain UF1 mitigates intestinal inflammation via Th17 cell regulation.

Consumption of human breast milk (HBM) attenuates the incidence of necrotizing enterocolitis (NEC), which remains a leading and intractable cause of mortality in preterm infants. Here, we report that this diminution correlates with alterations in the gut microbiota, particularly enrichment of Propionibacterium species. Transfaunation of microbiota from HBM-fed preterm infants or a newly identified and cultured Propionibacterium strain, P. UF1, to germfree mice conferred protection against pathogen infection and correlated with profound increases in intestinal Th17 cells. The induction of Th17 cells was dependent on bacterial dihydrolipoamide acetyltransferase (DlaT), a major protein expressed on the P. UF1 surface layer (S-layer). Binding of P. UF1 to its cognate receptor, SIGNR1, on dendritic cells resulted in the regulation of intestinal phagocytes. Importantly, transfer of P. UF1 profoundly mitigated induced NEC-like injury in neonatal mice. Together, these results mechanistically elucidate the protective effects of HBM and P. UF1-induced immunoregulation, which safeguard against proinflammatory diseases, including NEC.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.