Menu
July 19, 2019

De novo assembly of two Swedish genomes reveals missing segments from the human GRCh38 reference and improves variant calling of population-scale sequencing data.

The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. However, recent studies have suggested that GRCh38 may be incomplete and give a suboptimal representation of specific population groups. Here, we performed a de novo assembly of two Swedish genomes that revealed over 10 Mb of sequences absent from the human GRCh38 reference in each individual. Around 6 Mb of these novel sequences (NS) are shared with a Chinese personal genome. The NS are highly repetitive, have an elevated GC-content, and are primarily located in centromeric or telomeric regions. Up to 1 Mb of NS can be assigned to chromosome Y, and large segments are also missing from GRCh38 at chromosomes 14, 17, and 21. Inclusion of NS into the GRCh38 reference radically improves the alignment and variant calling from short-read whole-genome sequencing data at several genomic loci. A re-analysis of a Swedish population-scale sequencing project yields > 75,000 putative novel single nucleotide variants (SNVs) and removes > 10,000 false positive SNV calls per individual, some of which are located in protein coding regions. Our results highlight that the GRCh38 reference is not yet complete and demonstrate that personal genome assemblies from local populations can improve the analysis of short-read whole-genome sequencing data.


July 19, 2019

Genome organization and DNA accessibility control antigenic variation in trypanosomes.

Many evolutionarily distant pathogenic organisms have evolved similar survival strategies to evade the immune responses of their hosts. These include antigenic variation, through which an infecting organism prevents clearance by periodically altering the identity of proteins that are visible to the immune system of the host1. Antigenic variation requires large reservoirs of immunologically diverse antigen genes, which are often generated through homologous recombination, as well as mechanisms to ensure the expression of one or very few antigens at any given time. Both homologous recombination and gene expression are affected by three-dimensional genome architecture and local DNA accessibility2,3. Factors that link three-dimensional genome architecture, local chromatin conformation and antigenic variation have, to our knowledge, not yet been identified in any organism. One of the major obstacles to studying the role of genome architecture in antigenic variation has been the highly repetitive nature and heterozygosity of antigen-gene arrays, which has precluded complete genome assembly in many pathogens. Here we report the de novo haplotype-specific assembly and scaffolding of the long antigen-gene arrays of the model protozoan parasite Trypanosoma brucei, using long-read sequencing technology and conserved features of chromosome folding4. Genome-wide chromosome conformation capture (Hi-C) reveals a distinct partitioning of the genome, with antigen-encoding subtelomeric regions that are folded into distinct, highly compact compartments. In addition, we performed a range of analyses-Hi-C, fluorescence in situ hybridization, assays for transposase-accessible chromatin using sequencing and single-cell RNA sequencing-that showed that deletion of the histone variants H3.V and H4.V increases antigen-gene clustering, DNA accessibility across sites of antigen expression and switching of the expressed antigen isoform, via homologous recombination. Our analyses identify histone variants as a molecular link between global genome architecture, local chromatin conformation and antigenic variation.


July 7, 2019

A novel Tn3-like composite transposon harboring blaVIM-1 in Klebsiella pneumoniae spp. pneumoniae isolated from river water.

We present a new plasmid (pOW16C2) with a novel Tn3-like transposon harboring blaVIM-1 from a Klebsiella pneumoniae strain isolated from river water in Switzerland.Complete nucleotide sequence of pOW16C2 was obtained using a Pacific Biosciences SMRT sequencing approach and coding sequences were predicted.The 59,228?bp sequence included a typical IncN-like backbone and a mosaic structure with blaVIM-1, aacA4, aphA15, aadA1, catB2, qnrS1, sul1, and dfrA14 conferring resistance to carbapenems and other ß-lactam antibiotics, aminoglycosides, chloramphenicol, quinolones, sulfonamides, and trimethoprim, respectively. Most of these resistance genes were inserted in a class 1 integron that was embedded in a novel Tn3-like composite transposon.IncN plasmids carrying carbapenemases are frequently isolated from K. pneumoniae strains in clinical settings. The dissemination of K. pneumoniae harboring blaVIM-1 in surface water is a cause for increased concern to public health.


July 7, 2019

Complete annotated genome sequence of Mycobacterium tuberculosis (Zopf) Lehmann and Neumann (ATCC35812) (Kurono).

We report the completely annotated genome sequence of Mycobacterium tuberculosis (Zopf) Lehmann and Neumann (ATCC35812) (Kurono), which is a used for virulence and/or immunization studies. The complete genome sequence of M. tuberculosis Kurono was determined with a length of 4,415,078 bp and a G+C content of 65.60%. The chromosome was shown to contain a total of 4,340 protein-coding genes, 53 tRNA genes, one transfer messenger RNA for all amino acids, and 1 rrn operon. Lineage analysis based on large sequence polymorphisms indicated that M. tuberculosis Kurono belongs to the Euro-American lineage (lineage 4). Phylogenetic analysis using whole genome sequences of M. tuberculosis Kurono in addition to 22 M. tuberculosis complex strains indicated that H37Rv is the closest relative of Kurono based on the results of phylogenetic analysis. These findings provide a basis for research using M. tuberculosis Kurono, especially in animal models. Copyright © 2014 Elsevier Ltd. All rights reserved.


July 7, 2019

Genome sequence of Serratia nematodiphila DSM 21420T, a symbiotic bacterium from entomopathogenic nematode.

Serratia nematodiphila DSM 21420(T) (=CGMCC 1.6853(T), DZ0503SBS1(T)), isolated from the intestine of Heterorhabditidoides chongmingensis, has been known to have symbiotic-pathogenic life cycle, on the multilateral relationships with entomopathogenic nematode and insect pest. In order to better understanding of this rare feature in Serratia species, we present here the genome sequence of S. nematodiphila DSM 21420(T) with the significance of first genome sequence in this species. Copyright © 2014 Elsevier B.V. All rights reserved.


July 7, 2019

Prevalence of subtilase cytotoxin-encoding subAB variants among Shiga toxin-producing Escherichia coli strains isolated from wild ruminants and sheep differs from that of cattle and pigs and is predominated by the new allelic variant subAB2-2.

Subtilase cytotoxin (SubAB) is an AB5 toxin produced by Shiga toxin (Stx)-producing Escherichia coli (STEC) strains usually lacking the eae gene product intimin. Three allelic variants of SubAB encoding genes have been described: subAB1, located on a plasmid, subAB2-1, located on the pathogenicity island SE-PAI and subAB2-2 located in an outer membrane efflux protein (OEP) region. SubAB is becoming increasingly recognized as a toxin potentially involved in human pathogenesis. Ruminants and cattle have been identified as reservoirs of subAB-positive STEC. The presence of the three subAB allelic variants was investigated by PCR for 152 STEC strains originating from chamois, ibex, red deer, roe deer, cattle, sheep and pigs. Overall, subAB genes were detected in 45.5% of the strains. Prevalence was highest for STEC originating from ibex (100%), chamois (92%) and sheep (65%). None of the STEC of bovine or of porcine origin tested positive for subAB. None of the strains tested positive for subAB1. The allelic variant subAB2-2 was detected the most commonly, with 51.4% possessing subAb2-1 together with subAB2-2. STEC of ovine origin, serotypes O91:H- and O128:H2, the saa gene, which encodes for the autoagglutinating adhesin and stx2b were significantly associated with subAB-positive STEC. Our results suggest that subAB2-1 and subAB2-2 is widespread among STEC from wild ruminants and sheep and may be important as virulence markers in STEC pathogenic to humans. Copyright © 2014 Elsevier GmbH. All rights reserved.


July 7, 2019

Complete genome sequence of Enterobacter cloacae GGT036: a furfural tolerant soil bacterium.

Enterobacter cloacae is a facultative anaerobic bacterium to be an important cause of nosocomial infection. However, the isolated E. cloacae GGT036 showed higher furfural-tolerant cellular growth, compared to industrial relevant strains such as Escherichia coli and Corynebacterium glutamicum. Here, we report the complete genome sequence of E. cloacae GGT036 isolated from Mt. Gwanak, Seoul, Republic of Korea. The genomic DNA sequence of E. cloacae GGT036 will provide valuable genetic resources for engineering of industrially relevant strains being tolerant to cellular inhibitors present in lignocellulosic hydrolysates. Copyright © 2014 Elsevier B.V. All rights reserved.


July 7, 2019

Burkholderia pseudomallei sequencing identifies genomic clades with distinct recombination, accessory, and epigenetic profiles.

Burkholderia pseudomallei (Bp) is the causative agent of the infectious disease melioidosis. To investigate population diversity, recombination, and horizontal gene transfer in closely related Bp isolates, we performed whole-genome sequencing (WGS) on 106 clinical, animal, and environmental strains from a restricted Asian locale. Whole-genome phylogenies resolved multiple genomic clades of Bp, largely congruent with multilocus sequence typing (MLST). We discovered widespread recombination in the Bp core genome, involving hundreds of regions associated with multiple haplotypes. Highly recombinant regions exhibited functional enrichments that may contribute to virulence. We observed clade-specific patterns of recombination and accessory gene exchange, and provide evidence that this is likely due to ongoing recombination between clade members. Reciprocally, interclade exchanges were rarely observed, suggesting mechanisms restricting gene flow between clades. Interrogation of accessory elements revealed that each clade harbored a distinct complement of restriction-modification (RM) systems, predicted to cause clade-specific patterns of DNA methylation. Using methylome sequencing, we confirmed that representative strains from separate clades indeed exhibit distinct methylation profiles. Finally, using an E. coli system, we demonstrate that Bp RM systems can inhibit uptake of non-self DNA. Our data suggest that RM systems borne on mobile elements, besides preventing foreign DNA invasion, may also contribute to limiting exchanges of genetic material between individuals of the same species. Genomic clades may thus represent functional units of genetic isolation in Bp, modulating intraspecies genetic diversity. © 2015 Nandi et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Accumulation-associated protein enhances Staphylococcus epidermidis biofilm formation under dynamic conditions and is required for infection in a rat catheter model.

Biofilm formation is the primary virulence factor of Staphylococcus epidermidis. S. epidermidis biofilms preferentially form on abiotic surfaces and may contain multiple matrix components, including proteins such as accumulation-associated protein (Aap). Following proteolytic cleavage of the A domain, which has been shown to enhance binding to host cells, B domain homotypic interactions support cell accumulation and biofilm formation. To further define the contribution of Aap to biofilm formation and infection, we constructed an aap allelic replacement mutant and an icaADBC aap double mutant. When subjected to fluid shear, strains deficient in Aap production produced significantly less biofilm than Aap-positive strains. To examine the in vivo relevance of our findings, we modified our previously described rat jugular catheter model and validated the importance of immunosuppression and the presence of a foreign body to the establishment of infection. The use of our allelic replacement mutants in the model revealed a significant decrease in bacterial recovery from the catheter and the blood in the absence of Aap, regardless of the production of polysaccharide intercellular adhesin (PIA), a well-characterized, robust matrix molecule. Complementation of the aap mutant with full-length Aap (containing the A domain), but not the B domain alone, increased initial attachment to microtiter plates, as did in trans expression of the A domain in adhesion-deficient Staphylococcus carnosus. These results demonstrate Aap contributes to S. epidermidis infection, which may in part be due to A domain-mediated attachment to abiotic surfaces. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire.

High-throughput immune repertoire sequencing has emerged as a critical step in the understanding of adaptive responses following infection or vaccination or in autoimmunity. However, determination of native antibody variable heavy-light pairs (VH-VL pairs) remains a major challenge, and no technologies exist to adequately interrogate the >1 × 10(6) B cells in typical specimens. We developed a low-cost, single-cell, emulsion-based technology for sequencing antibody VH-VL repertoires from >2 × 10(6) B cells per experiment with demonstrated pairing precision >97%. A simple flow-focusing apparatus was used to sequester single B cells into emulsion droplets containing lysis buffer and magnetic beads for mRNA capture; subsequent emulsion RT-PCR generated VH-VL amplicons for next-generation sequencing. Massive VH-VL repertoire analyses of three human donors provided new immunological insights including (i) the identity, frequency and pairing propensity of shared, or ‘public’, VL genes, (ii) the detection of allelic inclusion (an implicated autoimmune mechanism) in healthy individuals and (iii) the occurrence of antibodies with features, in terms of gene usage and CDR3 length, associated with broadly neutralizing antibodies to rapidly evolving viruses such as HIV-1 and influenza.


July 7, 2019

Complete genome sequence of the Clostridium difficile laboratory strain 630¿ erm reveals differences from strain 630, including translocation of the mobile element CTn 5.

Background Clostridium difficile strain 630¿erm is a spontaneous erythromycin sensitive derivative of the reference strain 630 obtained by serial passaging in antibiotic-free media. It is widely used as a defined and tractable C. difficile strain. Though largely similar to the ancestral strain, it demonstrates phenotypic differences that might be the result of underlying genetic changes. Here, we performed a de novo assembly based on single-molecule real-time sequencing and an analysis of major methylation patterns.ResultsIn addition to single nucleotide polymorphisms and various indels, we found that the mobile element CTn5 is present in the gene encoding the methyltransferase rumA rather than adhesin CD1844 where it is located in the reference strain.ConclusionsTogether, the genetic features identified in this study may help to explain at least part of the phenotypic differences. The annotated genome sequence of this lab strain, including the first analysis of major methylation patterns, will be a valuable resource for genetic research on C. difficile.


July 7, 2019

Genetic determinants of reutericyclin biosynthesis in Lactobacillus reuteri.

Reutericyclin is a unique antimicrobial tetramic acid produced by some strains of Lactobacillus reuteri. This study aimed to identify the genetic determinants of reutericyclin biosynthesis. Comparisons of the genomes of reutericyclin-producing L. reuteri strains with those of non-reutericyclin-producing strains identified a genomic island of 14 open reading frames (ORFs) including genes coding for a nonribosomal peptide synthetase (NRPS), a polyketide synthase (PKS), homologues of PhlA, PhlB, and PhlC, and putative transport and regulatory proteins. The protein encoded by rtcN is composed of a condensation domain, an adenylation domain likely specific for d-leucine, and a thiolation domain. rtcK codes for a PKS that is composed of a ketosynthase domain, an acyl-carrier protein domain, and a thioesterase domain. The products of rtcA, rtcB, and rtcC are homologous to the diacetylphloroglucinol-biosynthetic proteins PhlABC and may acetylate the tetramic acid moiety produced by RtcN and RtcK, forming reutericyclin. Deletion of rtcN or rtcABC in L. reuteri TMW1.656 abrogated reutericyclin production but did not affect resistance to reutericyclin. Genes coding for transport and regulatory proteins could be deleted only in the reutericyclin-negative L. reuteri strain TMW1.656?rtcN, and these deletions eliminated reutericyclin resistance. The genomic analyses suggest that the reutericyclin genomic island was horizontally acquired from an unknown source during a unique event. The combination of PhlABC homologues with both an NRPS and a PKS has also been identified in the lactic acid bacteria Streptococcus mutans and Lactobacillus plantarum, suggesting that the genes in these organisms and those in L. reuteri share an evolutionary origin. Copyright © 2015, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Complete genome of Geobacter pickeringii G13T, a metal-reducing isolate from sedimentary kaolin deposits.

We used PacBio sequencing to assemble the genome of the pristine freshwater isolate Geobacter pickeringii G13(T) into a single 3,618,700-bp circular chromosome polished to 99.999% accuracy (quality value [QV], 50). This isolate shares several features with other Geobacter spp., including genes for degradation of aromatics and an abundance of multiheme c-type cytochromes. Copyright © 2015 Badalamenti and Bond.


July 7, 2019

Short communication: Single molecule, real-time sequencing technology revealed species- and strain-specific methylation patterns of 2 Lactobacillus strains.

Pacific Biosciences’ (Menlo Park, CA) single molecule, real-time sequencing technology was reported to have some advantages in generating finished genomes and characterizing the epigenome of bacteria. In the present study, this technology was used to sequence 2 Lactobacillus strains, Lactobacillus casei Zhang and Lactobacillus plantarum P-8. Previously, the former bacterium was sequenced by an Applied Biosystems 3730 DNA analyzer (Grand Island, NY), whereas the latter one was analyzed with Roche 454 (Indianapolis, IN) and Illumina sequencing technologies (San Diego, CA). The results showed that single molecule, real-time sequencing resulted in high-quality, finished genomes for both strains. Interestingly, epigenome analysis indicates the presence of 1 active N(6)-methyladenine methyltransferase in L. casei Zhang, but none in L. plantarum P-8. Our study revealed for the first time a completely different methylation pattern in 2 Lactobacillus strains. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.